Gene Rsph17025_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1517 
Symbol 
ID5084766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1555551 
End bp1556588 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID640483076 
Productcobalamin biosynthesis protein CobW 
Protein accessionYP_001167716 
Protein GI146277557 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID[TIGR02475] cobalamin biosynthesis protein CobW 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0241447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0776458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TTGCCAAGAT CCCCGTCACG GTGATCACGG GTTTCCTCGG CGCCGGAAAG 
ACCACGCTGA TCCGGCACCT GATGCAGAAC CTGGGCGGGC GCCGCCTTGC GGTCCTGGTC
AACGAGTTCG GCACCGTGGG CGTCGATGGC GACCTGATCC GCGCCTGCGC CGACGAGAAC
TGCCCCGACG AGGCGATCGT GGAGCTGGCG AACGGCTGCC TCTGCTGCAC CGTGGCCGAC
GAGTTCATCC CCACCATCGA GGCGCTGATG GCGCTGCCCA GGCGCCCGGA TCACATCCTG
ATCGAGACCT CGGGCCTTGC GCTGCCGAAG CCGCTCCTGA AGGCCTTCGA CTGGCCCGCC
ATCCGCTCGC GCATCACGGT CGATGGCGTG ATCGCTCTGG CCGATGCCGA GGCCGTTGCC
GCGGGCCGCT TTGCCCCCGA TGCCGACGCC GTGGCCGCGC AGGCCCAGGC CGAGGGCGCC
GATCACGAGA CCCCGCTTTC GGAGGTGTTC GAGGATCAGC TCGCCTGTGC CGACCTCGTG
CTTCTGACCA AGGCCGATCT CGCGGGCGAG GCGGGCCTTG CCGTCGCCCG CGCGGTGGTC
GAGGCGGAAT CGCCGCGGCC GATCCCGATC CTCGCCGTGA CCGAGGGCGC GGTCGATCCG
CAGGTGATCC TCGGGATCGA GGCCGCGGCC GAGGACGATC TCGCCGCCCG CCCCTCGCAC
CATGACGGGG CCGACGATCA CGAGCACGAC GATTTCGCCT CGACCGTCGT CGATCTGCCC
GAGATCGCCG ATCCCGAGCG TCTGGCCGAG GCGATCCGGG CGCTCGCGAC CGAGCGCAAC
GTCCTCCGCG TGAAGGGCCA TGTGGCGGTT CAGGGCAAGC CGATGCGGCT TCTCGTGCAG
GCGGTGGGTG CGCGCGTCCG CCACCAGTTC GACCGGCCCT GGAACGGCGC GCGGCAGAGC
CGTCTCGTGA TCATTGCCGA GCGCGGCGAT CTGGACGAGG CCGCGATCCG GCAGGATCTT
CTGGCGCGGA TCGGCTGA
 
Protein sequence
MTDLAKIPVT VITGFLGAGK TTLIRHLMQN LGGRRLAVLV NEFGTVGVDG DLIRACADEN 
CPDEAIVELA NGCLCCTVAD EFIPTIEALM ALPRRPDHIL IETSGLALPK PLLKAFDWPA
IRSRITVDGV IALADAEAVA AGRFAPDADA VAAQAQAEGA DHETPLSEVF EDQLACADLV
LLTKADLAGE AGLAVARAVV EAESPRPIPI LAVTEGAVDP QVILGIEAAA EDDLAARPSH
HDGADDHEHD DFASTVVDLP EIADPERLAE AIRALATERN VLRVKGHVAV QGKPMRLLVQ
AVGARVRHQF DRPWNGARQS RLVIIAERGD LDEAAIRQDL LARIG