Gene Arth_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1043 
Symbol 
ID4446478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1120021 
End bp1121265 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content68% 
IMG OID639688846 
Producthypothetical protein 
Protein accessionYP_830537 
Protein GI116669604 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4692] Predicted neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACA CCACCACCGA CGCGTACAGC ACCATCACCC CGGACGGGAC CGTAAAGCGG 
GCGGACGGGG CGGACTTCGC CTACCTCCCG GCCCCCACCG TGCAGAGCCA CGCCGCCAAC
CTTCTCACGC TGCCGGACGG CCGGCTGGGC TGCGTCTGGT TCGGCGGCAC CCAGGAAGGC
GTGCCGGACA TCTCCATCTG GTTCTCCGCC CTGGAACCCG GCAGCAGCCA GTGGTCCGAA
CCGCAGCAGC TCTCCAACGA CTCCACCCGG TCCGAACAGA ACCCCATCCT CTTCACTGCG
CCGAATGGCG CGCTGTGGCT GCTGTACACC GCGCAGAAGG CGGGCAACCA GGACACCGCC
GAGGTCCGCC GTCGTACGTC CATGGACGGC GGCCGCACCT GGGGCGAGGT GGAGACGCTG
TTCCCCGCGA ACGAAACCGG CGGCGTGTTC GTCCGGCAGC TCCCGGTAGT GTTGCCGTCC
GGCCGCCTGA TCGTGCCGAT CTTCCGGTGC ATCACCACGC CGGGGGAGAA ATGGGTGGGC
AACAGCGACG ACAGCGCCGT GATGATCTCC GACGACGCCG GCGCCACGTG GACCGAGCAC
GTGCTGCCCG GGAGCCTGGG CTGCGTCCAC ATGAACATCC AGCCCGTGGC CGACGGCACC
CTGCTGGCAC TGTTCCGGAG CCGCTGGGCG GACTCGATCT ACGAATCCCG CTCCACCGAC
GACGGCTCCA CCTGGAGCGA GCCGGTCCCC ACCGAGCTGC CCAACAACAA CTCCTCCATC
CAGTTCACCG CGCTCGCCGA CGGACGCCTG GCACTCGTGT ACAACCACAG CCGGGCGGAG
GCTTCCACTG AGCGCCGGTT GTCCCTCTAC GACGAAATTG ACGACGACGG CCTGGCCGAG
GAGCAGGGCC AGGTGGCCGA GCCGGACGCT TCCGCTTTCT CTGAGGACGA TGGTTCGCGG
AAGGCATTCT GGGGCACCCC GCGCTCGCCG ATGACCCTGG CCATCTCCGA GGACTCCGGC
CGCTCGTGGC CCATCCGCCG GAACCTGGAT GTGGGGGACG GGTACTGCCT GTCCAACAAC
TCCCGCGACG GGCTGAACCG TGAGTACTCC TACCCGTCCA TCCACCAGGG CCCGGACGGT
TCCCTGAACA TCGCGTACAC GTACTTCCGG CAGGCCATCA AGTTCGTCCG GGTGGACCCG
CAGTGGGCGT ACGAAGGCAC CACCACTCCT GGCGGGGACG AATGA
 
Protein sequence
MTNTTTDAYS TITPDGTVKR ADGADFAYLP APTVQSHAAN LLTLPDGRLG CVWFGGTQEG 
VPDISIWFSA LEPGSSQWSE PQQLSNDSTR SEQNPILFTA PNGALWLLYT AQKAGNQDTA
EVRRRTSMDG GRTWGEVETL FPANETGGVF VRQLPVVLPS GRLIVPIFRC ITTPGEKWVG
NSDDSAVMIS DDAGATWTEH VLPGSLGCVH MNIQPVADGT LLALFRSRWA DSIYESRSTD
DGSTWSEPVP TELPNNNSSI QFTALADGRL ALVYNHSRAE ASTERRLSLY DEIDDDGLAE
EQGQVAEPDA SAFSEDDGSR KAFWGTPRSP MTLAISEDSG RSWPIRRNLD VGDGYCLSNN
SRDGLNREYS YPSIHQGPDG SLNIAYTYFR QAIKFVRVDP QWAYEGTTTP GGDE