Gene Rsph17029_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3624 
Symbol 
ID4898603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp716615 
End bp717763 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID640114232 
Productgalactonate dehydratase 
Protein accessionYP_001045486 
Protein GI126464373 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.694628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.369645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA CTGCGCTCAA GACCTTCATC GTGCCGCCGC GCTGGCTGTT CCTGAAGATC 
GAGACCGATG AGGGGATCGT CGGCTGGGGC GAGCCGGTGG TCGAGGGCAA GGCGCTGACG
GTCGAGGCCG CGGTGCACGA GATGGCCGAC TATCTGATCG GGCAGGACCC GCTCAGGATC
GAGGACCACT GGCAGGTGCT CTACCGCGGC GGCTTCTACC GCGGCGGTCC CGTGCTGATG
AGCGCGCTGG CGGGCATCGA TCAGGCGCTG TGGGACATCA AGGGCCGCGC GGCGGGCCTG
CCGGTGCATC AGATGCTGGG CGGCGCCTGC CGAGACCGGA TCCGCGTCTA TTCCTGGATC
GGCGGGGACC GCCCCGAGGA TGTGGCGCAA GGCGCGCGCG AGGCCGTGGC GCGCGGCTTC
ACCGCGATCA AGCTGAACGG GGCGGAAGAG CTTCAGATCG TCGACGGGCA CGACAAGATC
GACCGGATCG TGGAGACCAT CGGCGCCGTG CGCGACGCGG TCGGGCCGCA TGTGGGGATC
GGCGTCGACT TCCACGGCCG GGTGCAGAAG CCGATGGCCA AGGTGCTGAT CCATGCCCTC
GATCCCTTCC GGCTGATGTT CATCGAAGAG CCGGTGCTGT CGGAGAACCT CGAGGCCCTG
CCCGAGATCA CCCGCGGCAC CTCGACGCCC ATCGCTCTGG GCGAGCGGCT CTATTCCCGC
TGGGATTTCA AGCGCGTGTT CGAAACGCGC TGCGTCGACA TCATCCAGCC CGACCTCAGC
CACGCGGGCG GCATCACCGA ATGCCGCAAG ATCGCGGCGA TGGCCGAGGC CTACGACATC
GGCGTGGCCT TCCATTGTCC GCTCGGACCC ATCGCGCTGG CCGCCTGCCT GCAGGTCGAT
GCGGTCTCGC ACAATGCCTT CATCCAGGAG CAGAGCCTGG GCATCCACTA CAATGCCGGA
AGCGACCTCC TGGATTATCT CGTGGACCCG TCGGTGTTCC GCTACGAGGC GGGCGCCGTC
GAGATCCCGA CCGGCCCGGG CCTCGGGATC GAGATCGACG AGGGCGCCGT CCTTCGGGCT
GCCGAGACCG GCCACCGCTG GCGCAACCCG CTCTGGCGCC ACGCGGACGG CTCGGTGGCG
GAATGGTGA
 
Protein sequence
MKITALKTFI VPPRWLFLKI ETDEGIVGWG EPVVEGKALT VEAAVHEMAD YLIGQDPLRI 
EDHWQVLYRG GFYRGGPVLM SALAGIDQAL WDIKGRAAGL PVHQMLGGAC RDRIRVYSWI
GGDRPEDVAQ GAREAVARGF TAIKLNGAEE LQIVDGHDKI DRIVETIGAV RDAVGPHVGI
GVDFHGRVQK PMAKVLIHAL DPFRLMFIEE PVLSENLEAL PEITRGTSTP IALGERLYSR
WDFKRVFETR CVDIIQPDLS HAGGITECRK IAAMAEAYDI GVAFHCPLGP IALAACLQVD
AVSHNAFIQE QSLGIHYNAG SDLLDYLVDP SVFRYEAGAV EIPTGPGLGI EIDEGAVLRA
AETGHRWRNP LWRHADGSVA EW