Gene Rsph17029_3452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3452 
Symbol 
ID4898292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp523832 
End bp525274 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID640114049 
Producthypothetical protein 
Protein accessionYP_001045317 
Protein GI126464204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0700524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.443587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGG CTCCGCCCGA GAGCGTCGCC GGCGAACATT TCGATGTGAT TATTGTGGGC 
TCGGGCTTCG GCTCCTCGTT CTTCCTGCAT CGGCTGCTGC GCCAGCCGGG GCGGCGGGTC
CTCGTCCTGG AATGGGGTCG GCATTCGACA CACGACTGGC AGCTGGAGGA AGGGCGGAAT
TCGCCCGTCA CCGACAGCGA CACCTACGCC ACCAACTCCG AGAAGCCTTG GAACTTCACC
ATCGGATTCG GCGGAGGCAC CAACTGCTGG TTCGCCCAGA CGCCGCGCCT GCATCCGGCG
GATTTCCGGC TCGGGTCCGA TCACGGCGTG GCGCAGGACT GGCCCGTCTC CTACGACGAT
CTCGAACCCT ACTGGTGCGA GGCCGAGGAG ATCATGGCGG TTTCGGGCGA TCCCGACATG
GCGCAGGTCA TGCCGCGCTC GCGTCCCTTT CCGCAGCCGC CGCATGTGAT GCCCGATCCC
GACCGGCTGA TGAAGGCGGC GCGGCCCGAC AGCCATTTCG TGATGCCGAC CGCCCGGGCC
CGCATCGCCA CCGAGACGCG CGCGGCCTGC TGCGCCTCGC TGCGCTGCCA GATCTGCCCG
GCGGATGCGA AATTCACCGC GAACAATTCG CTCGTGCCGC TCTACGAGAC GCCGGGCGTG
ACCCTCTGCC TCGAGACGGA GGTGCGCCGG TTCGAGGCGG CGGGCTCGTC GATCTCGGCG
GCGGTGATCC GCGGCCCCGA CGGGCGCGAG CATAAGGTGA CGGGCGATCT CTTCGTGCTC
GGCGCCAACG CGATCCACAG CCCGGCGATC CTCCTGCGCT CGGATCTGGG CGGCGGGCTG
ACCGGCGTGG GGCTGCACGA ATCCTACGGC TGGTCGATGG AGGCCTGGCT CGACGGGGTC
GACAATTTCG GCGGCAGCAC CATCACGACG GGGCTCGACT TCGGCCTCTA CGACGGGCCG
CACCGCAAGG ATCGGGGCGC GGCGCTGGTC TATTTCGAGA ATCGCTGGTC GCACGGGATG
CGGCTCGGGG CCGAGCGGAT GCGCCAGACC CTGCCGCTCG TGATCGTGAC CGAGGACCTG
CCCGAGGACC GCAACCGCGT GACGCTGGAT GGCGAGGGGC GGGCCTTCAT CGACTATCAC
GGACCTTCGG ATTATGCGCT GCGCGGGATG GAGCGGGCCA AGGCCGCGCT GCCCGAGCTG
CTCGCGCCGC TGCCGGTCGA GAAGATCCTC GACCACGGCA TCCGCGAGAC GGAAAGCCAC
CTGCAGGGCA CGCTGCGGAT GGGATCCGAT CCGGCCACGT CCGTGGTGGA CGCGGGCCTT
GTCCATCACC GGCTGCGCAA TCTGGTGGTG GTGGGCACCA GCACCTTCCC CACCTGCTCG
GCCGCCAACC CTTCGCTCAC CGCCGCCGCG CTGTCGCTGC GCGCCGCCGA CCTTCTGATC
TGA
 
Protein sequence
MNLAPPESVA GEHFDVIIVG SGFGSSFFLH RLLRQPGRRV LVLEWGRHST HDWQLEEGRN 
SPVTDSDTYA TNSEKPWNFT IGFGGGTNCW FAQTPRLHPA DFRLGSDHGV AQDWPVSYDD
LEPYWCEAEE IMAVSGDPDM AQVMPRSRPF PQPPHVMPDP DRLMKAARPD SHFVMPTARA
RIATETRAAC CASLRCQICP ADAKFTANNS LVPLYETPGV TLCLETEVRR FEAAGSSISA
AVIRGPDGRE HKVTGDLFVL GANAIHSPAI LLRSDLGGGL TGVGLHESYG WSMEAWLDGV
DNFGGSTITT GLDFGLYDGP HRKDRGAALV YFENRWSHGM RLGAERMRQT LPLVIVTEDL
PEDRNRVTLD GEGRAFIDYH GPSDYALRGM ERAKAALPEL LAPLPVEKIL DHGIRETESH
LQGTLRMGSD PATSVVDAGL VHHRLRNLVV VGTSTFPTCS AANPSLTAAA LSLRAADLLI