Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3624 |
Symbol | |
ID | 4898603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 716615 |
End bp | 717763 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640114232 |
Product | galactonate dehydratase |
Protein accession | YP_001045486 |
Protein GI | 126464373 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.694628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.369645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA CTGCGCTCAA GACCTTCATC GTGCCGCCGC GCTGGCTGTT CCTGAAGATC GAGACCGATG AGGGGATCGT CGGCTGGGGC GAGCCGGTGG TCGAGGGCAA GGCGCTGACG GTCGAGGCCG CGGTGCACGA GATGGCCGAC TATCTGATCG GGCAGGACCC GCTCAGGATC GAGGACCACT GGCAGGTGCT CTACCGCGGC GGCTTCTACC GCGGCGGTCC CGTGCTGATG AGCGCGCTGG CGGGCATCGA TCAGGCGCTG TGGGACATCA AGGGCCGCGC GGCGGGCCTG CCGGTGCATC AGATGCTGGG CGGCGCCTGC CGAGACCGGA TCCGCGTCTA TTCCTGGATC GGCGGGGACC GCCCCGAGGA TGTGGCGCAA GGCGCGCGCG AGGCCGTGGC GCGCGGCTTC ACCGCGATCA AGCTGAACGG GGCGGAAGAG CTTCAGATCG TCGACGGGCA CGACAAGATC GACCGGATCG TGGAGACCAT CGGCGCCGTG CGCGACGCGG TCGGGCCGCA TGTGGGGATC GGCGTCGACT TCCACGGCCG GGTGCAGAAG CCGATGGCCA AGGTGCTGAT CCATGCCCTC GATCCCTTCC GGCTGATGTT CATCGAAGAG CCGGTGCTGT CGGAGAACCT CGAGGCCCTG CCCGAGATCA CCCGCGGCAC CTCGACGCCC ATCGCTCTGG GCGAGCGGCT CTATTCCCGC TGGGATTTCA AGCGCGTGTT CGAAACGCGC TGCGTCGACA TCATCCAGCC CGACCTCAGC CACGCGGGCG GCATCACCGA ATGCCGCAAG ATCGCGGCGA TGGCCGAGGC CTACGACATC GGCGTGGCCT TCCATTGTCC GCTCGGACCC ATCGCGCTGG CCGCCTGCCT GCAGGTCGAT GCGGTCTCGC ACAATGCCTT CATCCAGGAG CAGAGCCTGG GCATCCACTA CAATGCCGGA AGCGACCTCC TGGATTATCT CGTGGACCCG TCGGTGTTCC GCTACGAGGC GGGCGCCGTC GAGATCCCGA CCGGCCCGGG CCTCGGGATC GAGATCGACG AGGGCGCCGT CCTTCGGGCT GCCGAGACCG GCCACCGCTG GCGCAACCCG CTCTGGCGCC ACGCGGACGG CTCGGTGGCG GAATGGTGA
|
Protein sequence | MKITALKTFI VPPRWLFLKI ETDEGIVGWG EPVVEGKALT VEAAVHEMAD YLIGQDPLRI EDHWQVLYRG GFYRGGPVLM SALAGIDQAL WDIKGRAAGL PVHQMLGGAC RDRIRVYSWI GGDRPEDVAQ GAREAVARGF TAIKLNGAEE LQIVDGHDKI DRIVETIGAV RDAVGPHVGI GVDFHGRVQK PMAKVLIHAL DPFRLMFIEE PVLSENLEAL PEITRGTSTP IALGERLYSR WDFKRVFETR CVDIIQPDLS HAGGITECRK IAAMAEAYDI GVAFHCPLGP IALAACLQVD AVSHNAFIQE QSLGIHYNAG SDLLDYLVDP SVFRYEAGAV EIPTGPGLGI EIDEGAVLRA AETGHRWRNP LWRHADGSVA EW
|
| |