Gene Rsph17025_3593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3593 
Symbol 
ID5085745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp483274 
End bp484986 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content69% 
IMG OID640485151 
Producthypothetical protein 
Protein accessionYP_001169767 
Protein GI146279609 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.185956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.226258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGG ATCCGACTTT CGACGTCGCC ATCGTGGGCA GCGGTCCCGC CGGTCTGGCC 
GTTGCCAGCC GCCTTGTCGG CCGCGGCCTC TCTCTCGTTC TGATCGAGGC TGGCAACGCC
GATCGCGACC GGCAGGACGA ACAGGACAGC CTCTCCGCCG AGAATGAGGT GGGGCCGAAA
CATCCGGCTG CGCATCTCTA CCGCCGGCGG ATGCTGGGGG GATCCTCGAC CGTCTGGGGC
GGGCGCTGCA TTCCGTTCGA TCAGGCCGAC TATCTCATCG GCGGCGACGG GACCGGCTGG
CCCATCGACC CCGCCGAGAT CGAGCGCCAC CAGCGGGTGG CGGCAGAGTT CCTCGACTGC
GGCGAGCCGC AGTTCGACGA GGCGGCCTTC GAGACGCCCG CATGGTGGAG CCGCTCTCCG
CGGATCGACC TCGACCTTGA CCTGATCGAA CGGTTCTCGC GCCCGACAAA CCTCTGGCGC
AAGATGCGCG ACAGCCTGCG CGCGCGGAGC GACCTCAGGC TCCTCGCGGA TCATGTGGTG
GTGCGGGTGG ACCTCTCGTC CGATGGCACC CGCGTCGAGG GGCTGCGGAC GATCGACCGC
CGCTCGGGAT CCGGCGACCT TCTGCGCGCA CGTCATGTGG TTCTTGCCTG CGGAGGGATC
GAGACCACCC GGCTGCTCCT GGCCTCGAGG AACGTCCAGC CGCGCGGGAT CGGCAACCAC
AGCGATCAGC TTGGCCGCCA CTACATGACG CATCTCATCG GCGATGTGGG CGAGCTGGAC
CTTTCGCCCG CCTTCGATCA GGCGCGCATC GATTACCGCC GGACCCGGGA CGGGATCTAT
GCCCGCAGCC TGATCCGGCT CTCGCCGGCC CTCCGCCTGA GGGAGCGCCT GCCGAATGCG
GTCTGGCGCC CGGTGTCGCC GCCCTTCTGG AACCCGTCCC ATCACGATCC GATCCTGTCG
GCGGTCCATC TGGCCAAGGC GATCCTTCCC AAGGAGTACC ACGGCCATCC CGCCGAGGCG
CTCGCGCAGC GGAACGGCTG GCGCGACCCG GCCGCCCATG TCGCCAACAT CCTGCGCCAT
CCGGGCACTC TGGCCGCCTA TGTGCCGGTG ATCATGGCAA AGCGCATTCT CGCCCGGCGC
AAGCTGCCCT CGGTGTTCCT GCTTCGCCCG GACCGTCACT ACCGTCTGGA GATCAATGCC
GAGCAGCTCC CGGATCCGTC CTCGCGCATC ACGCTGGGCG ACAGCCGGGA CCGCTGGGGC
ATGCCGCGGA TCCGCCTCGA CTGGCAGGTG AACGGTGCAA CCCTCGAAGG CGTGCGCCAG
AGCCTCGCCC ACCTCGCCGG GCTGATGCCG AAACACGGGG TTGGCCGCCT TCTCATGCAG
CCCGACCAGG TCGCCGAGGG GCTGGTCTCG CAGGGCGGTC ACCACATCGG CACAACGCGG
ATGGGCAAGT CCCCGGAGAG GGGCGTGGTG GACAGCGATT GCACCGTCTT CGGCGTGCCG
AACCTCCATA TTGCGGGGGC ATCGGTCTTC CCGACTTCGG GGGCGGTGAA CCCGACGCTG
CTGCTGACCT GCGTGGCCTT CCGCCTCGCC GATCATCTGC TTGCCCGACT TGCGCCCGCA
CCCGTCCTTG CGCTTGCCAC CTCCGAACCC GCCCCGCCGC GTCCGCTCGC CGAGCTGCCC
CCGGTCGCGG CCGCCGTGCA GGCCCTACCC TGA
 
Protein sequence
MEADPTFDVA IVGSGPAGLA VASRLVGRGL SLVLIEAGNA DRDRQDEQDS LSAENEVGPK 
HPAAHLYRRR MLGGSSTVWG GRCIPFDQAD YLIGGDGTGW PIDPAEIERH QRVAAEFLDC
GEPQFDEAAF ETPAWWSRSP RIDLDLDLIE RFSRPTNLWR KMRDSLRARS DLRLLADHVV
VRVDLSSDGT RVEGLRTIDR RSGSGDLLRA RHVVLACGGI ETTRLLLASR NVQPRGIGNH
SDQLGRHYMT HLIGDVGELD LSPAFDQARI DYRRTRDGIY ARSLIRLSPA LRLRERLPNA
VWRPVSPPFW NPSHHDPILS AVHLAKAILP KEYHGHPAEA LAQRNGWRDP AAHVANILRH
PGTLAAYVPV IMAKRILARR KLPSVFLLRP DRHYRLEINA EQLPDPSSRI TLGDSRDRWG
MPRIRLDWQV NGATLEGVRQ SLAHLAGLMP KHGVGRLLMQ PDQVAEGLVS QGGHHIGTTR
MGKSPERGVV DSDCTVFGVP NLHIAGASVF PTSGAVNPTL LLTCVAFRLA DHLLARLAPA
PVLALATSEP APPRPLAELP PVAAAVQALP