Gene Rsph17025_3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3248 
Symbol 
ID5085997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp116117 
End bp117559 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID640484820 
Producthypothetical protein 
Protein accessionYP_001169437 
Protein GI146279279 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.713925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.212703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGG CTCCGCCCGA AAGCGTCGCC GGCGAGCATT TCGACGTGGT GATCGTGGGC 
TCGGGCTTCG GCTCTTCCTT TTTCCTGCAC AGGCTGATGC GGCAGCCCGG CCGGCGGGTG
CTCGTCCTTG AATGGGGCGG TCACGCGACG CATGACTGGC AGCTTGACGA GGGCCGGAAC
TCGTCGGTTG CCGACAGCGA CACCTATGCC ACCAACTCCG ACAAGCCCTG GAACTTCACC
GTCGGGTTCG GGGGCGGCAC GAACTGCTGG TTTGCCCAGA CGCCCCGGCT TCATCCGGCC
GATTTCCGGC TGGGAACCGA TCATGGCGTG GCGCCGGACT GGCCGATCAC CTACGACGAT
CTCGAGACCT ACTGGTGCGA CGCGGAAGAG ATCATGGCGG TCTCCGGCGA TCCCGACATG
GCGCGCGTCA TGCCGCGCTC GCGTCCCTTC CCGCAGCCGC CCCATCGGAT GCCCGATCCC
GACCGGCTGA TGAAGGCGGC CCGCCCCGAC AGCCACTTCG TCATGCCGAC GGCGCGGGCC
CGGATCGCCA CCGAGACGCG GGCCGCCTGC TGCGCGTCGC TGCGTTGCCA GATCTGCCCC
GCCGACGCCA AGTTCACCGC CAACAACTCG CTCGTGCCGC TCTATGAGGC CGAGGGCGTC
ACGCTCTGTC TTGAGGCCGA GGTGCGCCGG TTCGAGGCGG CGGGCTCGTC CATCTCGGCC
GCCGTGTTCC GGGGCTCGGA CGGGCGCGAG CACCGCGTGA CGGGGGATCT CTTCGTCCTT
GGCGCCAATG CGATCCACAG CCCCGCGATC CTTCTCCGGT CCGATCTGGG GGGCGGGCTG
ACGGGTGTGG GGCTGCATGA ATCCTACGGC TGGTCGATGG AGGCCTGGCT CGACGGTGTG
GAGAATTTCG GCGGCAGCAC CATCACGACC GGCCTCGACT TCGGCCTCTA TGACGGGCCG
CACCGCAAGA CCGAGGGCGC CGCGCTGGTC TATTTCGAGA ACCGCTGGTC GCACGGGATG
CGCCTTGGCG CCGAGCGGAT GCGCCAGACG CTGCCGCTGG TGATCGTGAC CGAGGATCTG
CCGGAAAACA GGAACCGCGT GACGCTCGAC GGTGAGGGCG GGGCCTTCGT CGAGTATCAC
GGGCCGTCGG ACTATGCGCT GCGCGGGATG GAGCGGGCGA AGGCCGCGCT GCCGGATCTG
TTGGCGCCGC TGCCGGTCGA GCGGATCCTC GACCACGGCA TCCGCGAGAC GGAGTCGCAT
CTGCAGGGCA CGCTGCGGAT GGGCCACGAT CCGGCCACCT CGGTCGTCGA TGCGGGGCTC
GTGCATCACC GGCTGCGCAA TCTCGTCGTG GTGGGGACGA GCACCTTCCC CACCTGCTCG
GCCGCCAATC CCTCGCTGAC CGCCGCGGCG CTTTCGCTGC GCGCGGCCGA CCTGCTGATC
TGA
 
Protein sequence
MNLAPPESVA GEHFDVVIVG SGFGSSFFLH RLMRQPGRRV LVLEWGGHAT HDWQLDEGRN 
SSVADSDTYA TNSDKPWNFT VGFGGGTNCW FAQTPRLHPA DFRLGTDHGV APDWPITYDD
LETYWCDAEE IMAVSGDPDM ARVMPRSRPF PQPPHRMPDP DRLMKAARPD SHFVMPTARA
RIATETRAAC CASLRCQICP ADAKFTANNS LVPLYEAEGV TLCLEAEVRR FEAAGSSISA
AVFRGSDGRE HRVTGDLFVL GANAIHSPAI LLRSDLGGGL TGVGLHESYG WSMEAWLDGV
ENFGGSTITT GLDFGLYDGP HRKTEGAALV YFENRWSHGM RLGAERMRQT LPLVIVTEDL
PENRNRVTLD GEGGAFVEYH GPSDYALRGM ERAKAALPDL LAPLPVERIL DHGIRETESH
LQGTLRMGHD PATSVVDAGL VHHRLRNLVV VGTSTFPTCS AANPSLTAAA LSLRAADLLI