Gene Arth_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1941 
Symbol 
ID4445525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2189594 
End bp2191207 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content67% 
IMG OID639689751 
Producthypothetical protein 
Protein accessionYP_831423 
Protein GI116670490 
COG category[N] Cell motility 
COG ID[COG5492] Bacterial surface proteins containing Ig-like domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGCC TTGGTGCTCT TCGACCTGGC CGCCGCTCCC GGGCATCCGA GCCCCGGCTC 
CTGGCGGTTT TACTTGTGCT CGCACTCGGC GTGCTGTTCG CTGCACCCGC CCGAGCAGTC
GACTACGGAC ATGACGTTTC GTGGCCACAG TGTCCGGGCG GTCTTCCGAT GCCGCCGGAA
GACACGGAGT TCGTGGTCGT GGGGCTGACC AACGGACTCG CATTCACCGA GAACCCCTGC
CTGGGCGGGC AGTTCCAGTG GGTGCTCGAC CGGGGCGTCC GGGCTCAGGC CTACGCCATG
GCTACGTTCC CCACCACCGC GCAATACGAA ACCTACGGCG ACGACGGTCC ATGGCCCGCA
AGCACCACGC CGGACCGGCT GCGCAATGTC GGGTACGCCG AGGGCCGCGC AGCTCTCGCG
TCTCTCGATG AGGTGGGATG GCGACCGGAG AGAATCTGGG TCGACGTCGA GCCCCGCCCG
CAACAGCCCT GGCCCACCTC GACAGCAGCC CAGCGGCAGG AGAACCGGTA CGTCATCTCC
GGGCTCCTGG CCGCACTGGC GGACGCCGGG TACCCGCACG GGATCTATTC CTATTCGAGC
GGTTGGGAGG CCATTACCGG ATCGTGGCAG CTTCCCGACG TCCCCGTCTG GTCACCGGCA
GGGCGTCTCG ACTTCGCCAG TGAAGCATCC GACCTCTGTG TGAATCGCAG CTTCTCCGGT
GGAGCCGTAC ACATCTCGCA ATGGACCGAC GGCACCTACG ACTACGACAT GACGTGCATC
GGGGTCTACC AGGCCCACGT CGCGACCATC GGCTGGCAGT CGAGCGTCTC GGACGGCGCC
ACCGCAGGAA CGACCGGCCG GTCACTGCCG ATGGAGGCCT TGCGCCTGTC GGTGGCAGGA
GACCGCCTGT CGGGCGACAT TCTGTGGAGG GGGCATGTGC AGAACATCGG CTGGCAGTCC
TGGACGACGT CGGCGTCCCC GATCGGAACG ACCGGGCGCG GTCTGCGCCT GGAGGCGTTC
GAACTGCGGT TGACGGGGGA TCTGGCCTCT CAGTACAGCA TCAGGTATCG CGCCCACGTG
CAGAACGTCG GCTGGCAGCC GTACGGGATC GACGGAGCCA CGGCCGGCAC CGTCGGGCAA
GGTCTGCGGG TGGAGGCCGT TACGATCGAG TTGGTTCCGA AGGTCGCACC AGCATTCACT
GCCGTGTACG CCGCCCACGT TCAGAACCTC GGCTGGATGG CGAACGTTTC GGATGGGACC
GTCGCCGGGA CCACGGGTCG GGCCCTTCGG GTCGAGGCGC TGCGCCTCAA CGTGTCCAGC
ACGGCTTATT CCGGGGACAT CGAGTGGCGG GGGCATGTGC AGTCGATCGG CTGGCAGCCG
TGGACATCCT CGGCCAATCC CATCGGCACG GCCGGGCAGG GGCTACGGTT GGAAGCGTTT
GAAATCAGGC TGACCGGTGA GCTGGCCAAC CACTACAGGA TCCACTACCG CGCCCACGTG
CAAGATTTGG GCTGGCAGTC ATGGGTCGCC GACGGCGGAA CGGCGGGCAC GTCGGGTATG
GGCAAACGGA TGGAGGCCGT GCAGATCCTC CTCGCACCCA AAACCGGCGG CTAG
 
Protein sequence
MAGLGALRPG RRSRASEPRL LAVLLVLALG VLFAAPARAV DYGHDVSWPQ CPGGLPMPPE 
DTEFVVVGLT NGLAFTENPC LGGQFQWVLD RGVRAQAYAM ATFPTTAQYE TYGDDGPWPA
STTPDRLRNV GYAEGRAALA SLDEVGWRPE RIWVDVEPRP QQPWPTSTAA QRQENRYVIS
GLLAALADAG YPHGIYSYSS GWEAITGSWQ LPDVPVWSPA GRLDFASEAS DLCVNRSFSG
GAVHISQWTD GTYDYDMTCI GVYQAHVATI GWQSSVSDGA TAGTTGRSLP MEALRLSVAG
DRLSGDILWR GHVQNIGWQS WTTSASPIGT TGRGLRLEAF ELRLTGDLAS QYSIRYRAHV
QNVGWQPYGI DGATAGTVGQ GLRVEAVTIE LVPKVAPAFT AVYAAHVQNL GWMANVSDGT
VAGTTGRALR VEALRLNVSS TAYSGDIEWR GHVQSIGWQP WTSSANPIGT AGQGLRLEAF
EIRLTGELAN HYRIHYRAHV QDLGWQSWVA DGGTAGTSGM GKRMEAVQIL LAPKTGG