Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0275 |
Symbol | |
ID | 4270493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 316674 |
End bp | 317750 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638125000 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_741120 |
Protein GI | 114319437 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.447479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.950957 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATC GTACTGTGGA CGACCTGAAC GTCGAGAAGA TCGAGCACCT CCCCACCCCC GCCGAGATCA AGGCCCAGCT GCCGCTCAGC GAGCAAGCCC GCCGTCTGGT GGTCGAGGGT CGGGAGACCG TTCGCAACAT CCTGGATGGC AAGGATCACC GGCTGTTGGT GGTGGTGGGG CCCTGTTCCA TCCACGACCC CAAAGCGGCG CTGGACTATG CCAAGCAACT GAAAGCGCTG AGTGATCAGG TGGGCGACAG TCTGTTCATC GTCATGCGGG TCTATTTTGA GAAGCCGCGC ACGGTGACCG GGTGGAAGGG GCTGATCAAC GACCCCGACA TGGACGACTC CTTCCGGATC GACAATGGCC TGTTCCAGGC GCGCAAACTG CTGCTGGACC TGGCTGAGAT GGGACTGCCC ACAGCCACCG AAGCGCTCGA CCCGATCATC CCGCAGTACC TGCAGGACCT GATCACCTGG ACGGCCATTG GTGCCCGCAC CACCGAATCG CAGACCCACC GCGAGATGGC CAGCGGCCTC TCCACGCCGG TCGGATTCAA GAATGGCACC GACGGCAGCC TGGACGTGGC CATCAACGCC ATGAAGTCCG CCGCCCATCC GCACAGCTTC CTGGGCATCA ACTCCCGCGG CGAGTGCAGT ATCATCCGGA CCCGCGGCAA CAGCTACGGC CACGTGGTGC TGCGCGGCGG CCATGGCCAG CCCAATTACG ACAGCGTGCA CATTGCCCTG TGCGAGCAGG AGCTGGAAAA GGCGGGGCTG CCCGCGCGGA TCGTGGTCGA CTGCAGCCAC GCCAACTCCA ACAAGGACCC GGCGCTGCAG CCCATGGTGC TGAAGGACCT GGTGCACCAG ATCCTGGAGG GCAACCAGTC GCTGGTGGGC GTCATGCTGG AGAGCAACCT GGGCTGGGGC AACCAGAAGC TGGGGGCCGA TCCCGCTGCC CTCGACTACG GGGTCTCCAT CACCGATGCC TGTATCGACT GGCCGACCAC CGAGCAGGGT CTGCTGGAGG CGGCGGAAAA GCTGCGCGAG GTGTTGCCCC GGCGGGCCGC GGCCTGA
|
Protein sequence | MSDRTVDDLN VEKIEHLPTP AEIKAQLPLS EQARRLVVEG RETVRNILDG KDHRLLVVVG PCSIHDPKAA LDYAKQLKAL SDQVGDSLFI VMRVYFEKPR TVTGWKGLIN DPDMDDSFRI DNGLFQARKL LLDLAEMGLP TATEALDPII PQYLQDLITW TAIGARTTES QTHREMASGL STPVGFKNGT DGSLDVAINA MKSAAHPHSF LGINSRGECS IIRTRGNSYG HVVLRGGHGQ PNYDSVHIAL CEQELEKAGL PARIVVDCSH ANSNKDPALQ PMVLKDLVHQ ILEGNQSLVG VMLESNLGWG NQKLGADPAA LDYGVSITDA CIDWPTTEQG LLEAAEKLRE VLPRRAAA
|
| |