Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1517 |
Symbol | |
ID | 5084766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1555551 |
End bp | 1556588 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640483076 |
Product | cobalamin biosynthesis protein CobW |
Protein accession | YP_001167716 |
Protein GI | 146277557 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | [TIGR02475] cobalamin biosynthesis protein CobW |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0241447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0776458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TTGCCAAGAT CCCCGTCACG GTGATCACGG GTTTCCTCGG CGCCGGAAAG ACCACGCTGA TCCGGCACCT GATGCAGAAC CTGGGCGGGC GCCGCCTTGC GGTCCTGGTC AACGAGTTCG GCACCGTGGG CGTCGATGGC GACCTGATCC GCGCCTGCGC CGACGAGAAC TGCCCCGACG AGGCGATCGT GGAGCTGGCG AACGGCTGCC TCTGCTGCAC CGTGGCCGAC GAGTTCATCC CCACCATCGA GGCGCTGATG GCGCTGCCCA GGCGCCCGGA TCACATCCTG ATCGAGACCT CGGGCCTTGC GCTGCCGAAG CCGCTCCTGA AGGCCTTCGA CTGGCCCGCC ATCCGCTCGC GCATCACGGT CGATGGCGTG ATCGCTCTGG CCGATGCCGA GGCCGTTGCC GCGGGCCGCT TTGCCCCCGA TGCCGACGCC GTGGCCGCGC AGGCCCAGGC CGAGGGCGCC GATCACGAGA CCCCGCTTTC GGAGGTGTTC GAGGATCAGC TCGCCTGTGC CGACCTCGTG CTTCTGACCA AGGCCGATCT CGCGGGCGAG GCGGGCCTTG CCGTCGCCCG CGCGGTGGTC GAGGCGGAAT CGCCGCGGCC GATCCCGATC CTCGCCGTGA CCGAGGGCGC GGTCGATCCG CAGGTGATCC TCGGGATCGA GGCCGCGGCC GAGGACGATC TCGCCGCCCG CCCCTCGCAC CATGACGGGG CCGACGATCA CGAGCACGAC GATTTCGCCT CGACCGTCGT CGATCTGCCC GAGATCGCCG ATCCCGAGCG TCTGGCCGAG GCGATCCGGG CGCTCGCGAC CGAGCGCAAC GTCCTCCGCG TGAAGGGCCA TGTGGCGGTT CAGGGCAAGC CGATGCGGCT TCTCGTGCAG GCGGTGGGTG CGCGCGTCCG CCACCAGTTC GACCGGCCCT GGAACGGCGC GCGGCAGAGC CGTCTCGTGA TCATTGCCGA GCGCGGCGAT CTGGACGAGG CCGCGATCCG GCAGGATCTT CTGGCGCGGA TCGGCTGA
|
Protein sequence | MTDLAKIPVT VITGFLGAGK TTLIRHLMQN LGGRRLAVLV NEFGTVGVDG DLIRACADEN CPDEAIVELA NGCLCCTVAD EFIPTIEALM ALPRRPDHIL IETSGLALPK PLLKAFDWPA IRSRITVDGV IALADAEAVA AGRFAPDADA VAAQAQAEGA DHETPLSEVF EDQLACADLV LLTKADLAGE AGLAVARAVV EAESPRPIPI LAVTEGAVDP QVILGIEAAA EDDLAARPSH HDGADDHEHD DFASTVVDLP EIADPERLAE AIRALATERN VLRVKGHVAV QGKPMRLLVQ AVGARVRHQF DRPWNGARQS RLVIIAERGD LDEAAIRQDL LARIG
|
| |