Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3241 |
Symbol | |
ID | 5112955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 3532985 |
End bp | 3534328 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640493445 |
Product | D-glucarate dehydratase |
Protein accession | YP_001177956 |
Protein GI | 146312882 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0695422 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATA CGCAATCCAG CCCAGTAATT ACCGATATGC AGGTCATTCC GGTGGCCGGC TACGACAGTA TGCTGCTCAA TATTGGCGGC GCGCATAATG CGTATTTCAC GCGCAATATC GTGATTCTCA CCGACAGCGC CGGGAACACC GGCGTAGGGG AAGCGCCAGG CGGTGAAGTG ATTCTCCAGA CGCTGCTTGA TGCCATTCCG ATGGTCATCG GTCGGGAAAT TGCCCGACTG AACAACGTGG TGCAGCAAGT GCACAAGGGC AATCAAGCCG CCGATTTCGA TACGTTCGGC AAAGGCGCCT GGACGTTTGA ACTGCGCGTG AACGCTGTCG CCGCGCTAGA AGCGGCGCTG CTGGACCTTC TCGGAAAAAC GCTGAACGTC CCGGTGTGTG AACTGCTCGG CCCGGGCAAA CAGCGCGATG CGGTGACGGT GCTGGGGTAT CTCTTCTACG TGGGCGATCG CACTAAAACC GATCTGCCGT ATCTGGAAAA CACTGCGGGC AATCATGACT GGTACCACTT GCGTCACCAG GAGGCACTGA CCCGCGACGC GGTGGTTCGT CTCGCAGAAG CCGCACAGGA TCGCTATGGG TTTAAAGACT TCAAACTGAA GGGCGGCGTA CTGCCGGGTG AGCAGGAAAT CGAGGCCGCT CGCGCACTCA AGAAACGTTT CCCGGATGCG CGTATTACCG TCGATCCCAA CGGGGCGTGG CTCCTGGATG AAGCCATTTC CCTGTGCAAA GGGCTGAACG ATGTGCTGAC CTATGCGGAA GATCCTTGTG GCGCGGAACA GGGTTTTTCA GGACGCGAAG TGATGGCTGA ATTCCGTCGC GCAACAGGAT TGCCGGTGGC GACCAATATG ATCGCCACCA ACTGGCGAGA AATGGGTCAC GCGGTCATGC TGAATTCAGT CGATATTCCG TTGGCCGATC CGCATTTCTG GACGCTTTCA GGCGCAGTGC GTGTCGCGCA ACTCTGCGAC GACTGGGGCC TGACCTGGGG TTGCCACTCC AATAATCATT TTGATATCTC ACTGGCGATG TTCACGCACG TTGGTGCCGC TGCGCCGGGT ACGCCAACGG CCATCGACAC CCACTGGATT TGGCAGGAGG GCGACGCGCG CCTGATCAAA AATCCCCTTG AGATTAAAAA CGGCAAAATC GCCGTACCGG ATGCACCGGG ATTAGGCGTC GAACTGGACT GGGATCAAAT CCATAAGGCG CATGAGCTTT ACAAGAAGTT GCCAGGCGGC GCGCGTAATG ACGCTGGTCC GATGCAATAT CTTGTTCCCG GCTGGACATT TGACCGCAAA CGCCCTGCGT TCGGTCGCCA CTGA
|
Protein sequence | MMNTQSSPVI TDMQVIPVAG YDSMLLNIGG AHNAYFTRNI VILTDSAGNT GVGEAPGGEV ILQTLLDAIP MVIGREIARL NNVVQQVHKG NQAADFDTFG KGAWTFELRV NAVAALEAAL LDLLGKTLNV PVCELLGPGK QRDAVTVLGY LFYVGDRTKT DLPYLENTAG NHDWYHLRHQ EALTRDAVVR LAEAAQDRYG FKDFKLKGGV LPGEQEIEAA RALKKRFPDA RITVDPNGAW LLDEAISLCK GLNDVLTYAE DPCGAEQGFS GREVMAEFRR ATGLPVATNM IATNWREMGH AVMLNSVDIP LADPHFWTLS GAVRVAQLCD DWGLTWGCHS NNHFDISLAM FTHVGAAAPG TPTAIDTHWI WQEGDARLIK NPLEIKNGKI AVPDAPGLGV ELDWDQIHKA HELYKKLPGG ARNDAGPMQY LVPGWTFDRK RPAFGRH
|
| |