Gene Rsph17029_3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3566 
Symbol 
ID4898121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp657090 
End bp658289 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID640114175 
Productcobalamin synthesis protein, P47K 
Protein accessionYP_001045429 
Protein GI126464316 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0714729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGATA CCCGCCTTCC CGTGACCGTG CTCTCCGGCT TCCTCGGAGC CGGCAAGACC 
ACGCTTCTGA ACCATATTCT GGCCAACCGC GAGGGGCTGC GCGTGGCCGT CATCGTCAAT
GACATGTCGG AGGTGAACAT CGACGCCGAT CTGGTCCGCG CCGGCGGCAG CCTCTCGCGT
GGCGAGGAAC GGCTGGTCGA ACTGACCAAC GGCTGCATCT GCTGCACGCT GCGCGACGAC
CTGCTGACCG AGGTGCGCCG GCTGGCCGAG GAGGGGCGCT TCGACTATCT GCTGATCGAA
TCCACCGGCG TGTCCGAGCC GCTGCCCGTC GCCGCCACCT TCGAGTTCCG CGACGAGGAG
GGCCGGTCGC TCTCGGACAT CGCGCGGCTC GACACGATGG TGACGGTGGT CGATGCGGCC
AACCTCACCC GCGACTTTTC GGCGCAGGAT TTCCTGAAGG ACCGCGGTGC GGCGATGGCG
GAGGAGGACG AGCGCAGCCT TGTCCAGCTT CTGACCGAGC AGATCGAGTT CGCCGACGTG
ATCGTGCTGA ACAAGGTCTC GAGCGCGACG CCCGAACAGC TCGCCAGCGC ACGCGCCATC
CTGCGGGCGC TGAATGCCGA TGCCGCGATC CTCGAGACCG ATCACGGCCG CGCCCCGCCG
CGCGCCATCG TCGGCACCGG GCGGTTCAGC TTCGAGGCCG CGCACCGCCA TCCGACCTGG
GTCAAGGAGC TCTACGGCCA TGCCGACCAT GTGCCCGAGG ATGCCGAATA CGGCATCACC
AGCTTCGTCT GGCGGGCCGA ACGGCCGTTC GATCCGACGC GGATCGCCGA CTTCTTCGAC
ACGCCGCTGC CCGGCGTGAT CCGCGCCAAG GGCCATTTCT GGATCGCCAC GCGCCCCGAC
TGGGCGGGCG AGTTCTCGGT GGCGGGGCCG ATGGTCGAGG TGAAGGGGCT GGGCCTCTGG
TGGGCCGCCG TGCCCGAGGC GCACTGGCCG CCCGAGGCGC AGGAGCGGAT GGCTGCGCGC
GTGGCCGGAG AGTTCGGCGA CCGGCGGCAG GAGATGGTCT TCATCGGCAT GCTCGGGCAG
ATGAACCGGG CGCGGATCTC GGCCATGCTC GAGCGCTGCC TTGTCCCCGA AACCCGGTTC
GCGCCCGAGG CATGGACGGC GCTGGAAGAT CCGTTCCCGC GCTGGGGCCG CGCGGCATGA
 
Protein sequence
MTDTRLPVTV LSGFLGAGKT TLLNHILANR EGLRVAVIVN DMSEVNIDAD LVRAGGSLSR 
GEERLVELTN GCICCTLRDD LLTEVRRLAE EGRFDYLLIE STGVSEPLPV AATFEFRDEE
GRSLSDIARL DTMVTVVDAA NLTRDFSAQD FLKDRGAAMA EEDERSLVQL LTEQIEFADV
IVLNKVSSAT PEQLASARAI LRALNADAAI LETDHGRAPP RAIVGTGRFS FEAAHRHPTW
VKELYGHADH VPEDAEYGIT SFVWRAERPF DPTRIADFFD TPLPGVIRAK GHFWIATRPD
WAGEFSVAGP MVEVKGLGLW WAAVPEAHWP PEAQERMAAR VAGEFGDRRQ EMVFIGMLGQ
MNRARISAML ERCLVPETRF APEAWTALED PFPRWGRAA