Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1775 |
Symbol | |
ID | 3832441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1830419 |
End bp | 1831435 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829700 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_430619 |
Protein GI | 83590610 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTA GGTCAATCCA CATTGTCGAT ACAACCCTGC GTGACGGTAG TCACGCCGTC AGCCACCAGT TTACGGCCGA GCAAATAGCC GCCATTGCCG GCGGCCTGGA CGCGGCCGGA GTGGAGTATA TCGAGGTTTC CCACGGTGAC GGCCTGGCCG GTTCCTCTTA CAACTACGGC TGGGCAGCCC TGGGGGATGA GGAAATGCTA AAAGCCGCTA GCGCGGCGAT AAAAAAAGGC AAACTAACTG TCCTCTTACT CCCCGGCATT GGCACCGTCG AGGACCTGAA GATGGCGGCC GACTGCGGCG CCAAAGTGGT CCGGGTGGCC ACCCACGTCA CCGAGGCCGA TATCGGCGAG CAGCATATCG GCATGGCCAA GAAGCTGGGT ATGATGGCTG TTGGTTTCCT TATGATGTGC CACATGGCGC CGCCGGAAAA GGTAGTGGAA CAGGCCAAAC TCTTTGAGTC CTACGGCGCC GACTATATCA ACATCGCCGA CTCCGCGGGG GCTATGTTAC CGGAAGATGT CAAAGCCCGG GTAGGTGCCG TGGTCGAAGC AGTAAAAGTA CCTGTTGGGT TCCACGCCCA CAACAACCTG ACCATGGCTA CGGCCAATGC CCTGGCGGCG GTAGAGGCTG GCGCGACTTT CCTGGACGGT GCCTGTCGTG GCCTGGGGGC CGGAGCCGGC AATGCCCAAA CAGAAGCTTT AGTCGGTGTG CTTGACAAGC TGGGCTACCG GACGGGTGTC GATTTTTACA AAGTCATGGA CGTAGCCGAA GACATCGTCG AGCCTATCAT GCACCGGCCC CAGGTGGTGC GTAACGCACC CTTAATGCTG GGCTACGCCG GTGTTTACTC CAGTTTCTTA CTCCACACCT ACCGGGCGGC CGAGAAATTC AACCTCGACC CCCGGGACAT CCTGGTGGAA CTGGGAAGGA GGCGCATGGT CGGCGGGCAG GAAGACATGA TTGTCGATGT GGCTTACCAG TTAGCACAAA AGAGAGGAGG AAACTAA
|
Protein sequence | MSARSIHIVD TTLRDGSHAV SHQFTAEQIA AIAGGLDAAG VEYIEVSHGD GLAGSSYNYG WAALGDEEML KAASAAIKKG KLTVLLLPGI GTVEDLKMAA DCGAKVVRVA THVTEADIGE QHIGMAKKLG MMAVGFLMMC HMAPPEKVVE QAKLFESYGA DYINIADSAG AMLPEDVKAR VGAVVEAVKV PVGFHAHNNL TMATANALAA VEAGATFLDG ACRGLGAGAG NAQTEALVGV LDKLGYRTGV DFYKVMDVAE DIVEPIMHRP QVVRNAPLML GYAGVYSSFL LHTYRAAEKF NLDPRDILVE LGRRRMVGGQ EDMIVDVAYQ LAQKRGGN
|
| |