Gene Rsph17029_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3679 
Symbol 
ID4898485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp782257 
End bp783474 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content70% 
IMG OID640114287 
Productcobalamin synthesis protein, P47K 
Protein accessionYP_001045541 
Protein GI126464428 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.393021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCG ATCCCGCCCG TTCCGGCAAA CTGCCCACCG CGATCCTTTC GGGCTTTCTG 
GGTGCGGGAA AGACGACCCT GCTCAACCAC CTGCTGCGCA ACCGCAGCGG CCTGCGCGTG
GCCGTGATCG TGAACGACAT GTCCGAGATC TGCATCGACG CCGAGCTCGT GGCCTCGGGC
GGCGCCTCGC TCAGCCACAC CGAGGAGGAG CTGGTCGAGA TGTCGAACGG CTGCATCTGC
TGCACGCTGC GCGACGATCT GCTGCGCGAG GTGCGCCGTC TGGCGCTCGA GGGGCGGTTC
GATTACCTGC TGATCGAGGC GACCGGCATC TCCGAGCCGC TGCCCATCGC CGCCACCTTC
GAATTCTGCG ACGGCGCCGA ATCGAGCCTG AGCGACGTGG CGCGGCTCGA TGCGATGGTC
ACGGTGGTGG ATGCGGTCAA TCTCACGCAG GATTACCTGA GCCGCGACCT TCTGCGGGAC
CGCGGCGAGG TCCGCGGCAC GGACGACCAG CGCACGCTGG TCGAGCTGCT GGTGGACCAG
ATCGAATTCG CCGACATCGT GATCCTCAAC AAGACCCGCA CGGCGGGTCC CGAGCGCACC
GCGGCGGCGC GGCGCATCGT GCGGGCGCTC AATCCCGATG CGAAGCTGAT CGAGACCGAA
CAGAGCGAGG TCGATCCGCG CGAGATCCTC GACACCGGGC TCTTCGATCC GGCCCGTGCC
CGCAGCCATC CGCGCTGGCT GCAGGAGCTC TACGGTTTCG CGGCGCACAA GCCCGAGGAT
CAGGAATACG GCATCACCTC CTTCTGCTTC CGCGCGCGGG CGCCCTTCGA CGGCAAGCGC
ATCCGCGACG TGCTGACGGG CGAGCTGCCG GGGGTGATCC GGGCCAAGGG GCATTTCTGG
ACCGCCGACC ACCCCGACCG CGTGCTCCAG TTCAGCCAGG CGGGCAGCCT GCGCACCATC
GCGCAGAGCG GGCGCTGGTG GGCGGCCACG CCGCGCAGCG ACTGGCCCGC GGATCGGCGC
GTGATCGAGC GGATCGCGCG GCACTGGCGC CCGCCCTACG GCGACCGGCG TCAGGAGCTC
GTGTTCATCG GCACGCGCGA GATGGACTAC GGCAGGATCC ATCCGCTGAT CGACGCCTGC
CTGCTGCGGG GCACCCCGGT GCGCCGGCCG GCCATGACGG CCGCGGGCCT CGCGTCTCAG
GGCGCCCGAC GGGGCTAG
 
Protein sequence
MSSDPARSGK LPTAILSGFL GAGKTTLLNH LLRNRSGLRV AVIVNDMSEI CIDAELVASG 
GASLSHTEEE LVEMSNGCIC CTLRDDLLRE VRRLALEGRF DYLLIEATGI SEPLPIAATF
EFCDGAESSL SDVARLDAMV TVVDAVNLTQ DYLSRDLLRD RGEVRGTDDQ RTLVELLVDQ
IEFADIVILN KTRTAGPERT AAARRIVRAL NPDAKLIETE QSEVDPREIL DTGLFDPARA
RSHPRWLQEL YGFAAHKPED QEYGITSFCF RARAPFDGKR IRDVLTGELP GVIRAKGHFW
TADHPDRVLQ FSQAGSLRTI AQSGRWWAAT PRSDWPADRR VIERIARHWR PPYGDRRQEL
VFIGTREMDY GRIHPLIDAC LLRGTPVRRP AMTAAGLASQ GARRG