Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1941 |
Symbol | |
ID | 4445525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2189594 |
End bp | 2191207 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639689751 |
Product | hypothetical protein |
Protein accession | YP_831423 |
Protein GI | 116670490 |
COG category | [N] Cell motility |
COG ID | [COG5492] Bacterial surface proteins containing Ig-like domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGCC TTGGTGCTCT TCGACCTGGC CGCCGCTCCC GGGCATCCGA GCCCCGGCTC CTGGCGGTTT TACTTGTGCT CGCACTCGGC GTGCTGTTCG CTGCACCCGC CCGAGCAGTC GACTACGGAC ATGACGTTTC GTGGCCACAG TGTCCGGGCG GTCTTCCGAT GCCGCCGGAA GACACGGAGT TCGTGGTCGT GGGGCTGACC AACGGACTCG CATTCACCGA GAACCCCTGC CTGGGCGGGC AGTTCCAGTG GGTGCTCGAC CGGGGCGTCC GGGCTCAGGC CTACGCCATG GCTACGTTCC CCACCACCGC GCAATACGAA ACCTACGGCG ACGACGGTCC ATGGCCCGCA AGCACCACGC CGGACCGGCT GCGCAATGTC GGGTACGCCG AGGGCCGCGC AGCTCTCGCG TCTCTCGATG AGGTGGGATG GCGACCGGAG AGAATCTGGG TCGACGTCGA GCCCCGCCCG CAACAGCCCT GGCCCACCTC GACAGCAGCC CAGCGGCAGG AGAACCGGTA CGTCATCTCC GGGCTCCTGG CCGCACTGGC GGACGCCGGG TACCCGCACG GGATCTATTC CTATTCGAGC GGTTGGGAGG CCATTACCGG ATCGTGGCAG CTTCCCGACG TCCCCGTCTG GTCACCGGCA GGGCGTCTCG ACTTCGCCAG TGAAGCATCC GACCTCTGTG TGAATCGCAG CTTCTCCGGT GGAGCCGTAC ACATCTCGCA ATGGACCGAC GGCACCTACG ACTACGACAT GACGTGCATC GGGGTCTACC AGGCCCACGT CGCGACCATC GGCTGGCAGT CGAGCGTCTC GGACGGCGCC ACCGCAGGAA CGACCGGCCG GTCACTGCCG ATGGAGGCCT TGCGCCTGTC GGTGGCAGGA GACCGCCTGT CGGGCGACAT TCTGTGGAGG GGGCATGTGC AGAACATCGG CTGGCAGTCC TGGACGACGT CGGCGTCCCC GATCGGAACG ACCGGGCGCG GTCTGCGCCT GGAGGCGTTC GAACTGCGGT TGACGGGGGA TCTGGCCTCT CAGTACAGCA TCAGGTATCG CGCCCACGTG CAGAACGTCG GCTGGCAGCC GTACGGGATC GACGGAGCCA CGGCCGGCAC CGTCGGGCAA GGTCTGCGGG TGGAGGCCGT TACGATCGAG TTGGTTCCGA AGGTCGCACC AGCATTCACT GCCGTGTACG CCGCCCACGT TCAGAACCTC GGCTGGATGG CGAACGTTTC GGATGGGACC GTCGCCGGGA CCACGGGTCG GGCCCTTCGG GTCGAGGCGC TGCGCCTCAA CGTGTCCAGC ACGGCTTATT CCGGGGACAT CGAGTGGCGG GGGCATGTGC AGTCGATCGG CTGGCAGCCG TGGACATCCT CGGCCAATCC CATCGGCACG GCCGGGCAGG GGCTACGGTT GGAAGCGTTT GAAATCAGGC TGACCGGTGA GCTGGCCAAC CACTACAGGA TCCACTACCG CGCCCACGTG CAAGATTTGG GCTGGCAGTC ATGGGTCGCC GACGGCGGAA CGGCGGGCAC GTCGGGTATG GGCAAACGGA TGGAGGCCGT GCAGATCCTC CTCGCACCCA AAACCGGCGG CTAG
|
Protein sequence | MAGLGALRPG RRSRASEPRL LAVLLVLALG VLFAAPARAV DYGHDVSWPQ CPGGLPMPPE DTEFVVVGLT NGLAFTENPC LGGQFQWVLD RGVRAQAYAM ATFPTTAQYE TYGDDGPWPA STTPDRLRNV GYAEGRAALA SLDEVGWRPE RIWVDVEPRP QQPWPTSTAA QRQENRYVIS GLLAALADAG YPHGIYSYSS GWEAITGSWQ LPDVPVWSPA GRLDFASEAS DLCVNRSFSG GAVHISQWTD GTYDYDMTCI GVYQAHVATI GWQSSVSDGA TAGTTGRSLP MEALRLSVAG DRLSGDILWR GHVQNIGWQS WTTSASPIGT TGRGLRLEAF ELRLTGDLAS QYSIRYRAHV QNVGWQPYGI DGATAGTVGQ GLRVEAVTIE LVPKVAPAFT AVYAAHVQNL GWMANVSDGT VAGTTGRALR VEALRLNVSS TAYSGDIEWR GHVQSIGWQP WTSSANPIGT AGQGLRLEAF EIRLTGELAN HYRIHYRAHV QDLGWQSWVA DGGTAGTSGM GKRMEAVQIL LAPKTGG
|
| |