Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2251 |
Symbol | |
ID | 3830746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2354553 |
End bp | 2356154 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830171 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_431081 |
Protein GI | 83591072 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGA AGATCCTTAT CTACGATACC ACCCTCCGGG ACGGCAGCCA GGGCGAGGGT ATCAGCCTGT CGGTAGAGGA TAAACTAAAG ATTGCCTCCC GCCTGGACCG GCTGGGCGTA GATTATATCG AGGGTGGCTG GCCCTGGGCC AATCCCAAGG ATATGGAGTT TTTCCTCCGG GCACGGGAGG TCATCTGGCG CCAGGCCAGG CTGGTGGCCT TCGGCCGTAC CCGCAAGCCC GGCCAGGCCG CCGCCGAGGA CGCCAACCTG CTGGCCATCA AGCGGGCCGG AGTAAAGGTG GCCACTATTT TTGGCAAATC ATGGGATCTT CATGTCACGG CGGCCCTGGG GACCACCCTG GCCGAAAACC TGGCCATGAT CGGCGACAGC GTGGCCTTCC TGGTGGACCA GGGCCTGGAA GTAATCTATG ACGCCGAACA CTTTTTTGAC GGCTTCAAGG CCAACCCGGA TTATGCCCTG GAAACCCTGA AGGCGGCGGC AAAGGCCGGG GCCAGCTGGA TTGTCTTGTG TGACACCAAT GGCGGTTGCC TGCCATGGGA GATTGAGGAG GCGGTAGCCA GGGTACGCCA GGAGATCCAG GTGCCGGTGG GTATTCACGC CCATAACGAC GGCGACCTGG CCGTGGCCAA CACCCTGGCG GCGGTGACCG CCGGGTGCCG CCAGGTCCAG GGGACCATCA ACGGCTTTGG CGAGCGCTGC GGCAACGCCG ACCTGTGCTC GGTAATGCCC AACCTGGAAC TCAAGATGGG CTACCAGTGC CTGCCGCCGG GACAACTGGC CTTTCTCACG GAAGTCTCCC GTTATGTCAG CGAGATTGCC AACGTCGTCC CTGCCGGCAA CCAGCCCTTT GTCGGCTATA GCGCCTTTGC CCATAAAGGC GGCATCCACG TCAGCGCCGT TTTGAAGGCA CCGGATACCT ACGAGCATAT CCGGCCCCAG CAGGTAGGCA ACGAGCGGCG GGTGTTAATG TCGGACCAGG CCGGGGCCAG CAACCTGCGG TGCAAAGCGG AGGAGATGGG GCTGGAGTTG AACCCGGAGC GGGAACGAGG CATCATAGAG GGAATCAAGG AACTGGAACG CCAGGGCTAC CAGTTCGAGG GAGCCGATGC CTCCCTGGAG CTTTTCCTGC GGAAGACGAC GGGCGAATAC CGGCAGCAGT TTGAAGTCGA GTATGTCAAA GCCCTGGTAG AAAAGAGGGC CGGGCAGGAG GCCATATCGG AAGCCATAGT CAAGCTGCGG GTGGGCGACC AGGTGGTCCA TACGGCCGCC GAAGGCAACG GCCCTGTAAA CGCCATGGAT AACGCCCTGC GGAAAGCCCT GGAAGAAGTC TTCCCGGCTA TTCGGCACAT GCGCCTGACT GACTACAAAG TACGCGTCCT TGATGAAAAG GATGCCACCA GCGCCCGCGT CAGGGTACTC ATTGAATCCC GGGACGGCAG CAATTCCTGG AATACTGTCG GCGTCTCCAC CAATATTATC GAAGCCAGCT GGGAGGCCCT TCTGGACAGT ATGGAGTACG CCCTCCTTAA ACAACAGCAG GAGTTAAATA AGCGGGCGGC AGCCCCCTGT GAACCTTATT AG
|
Protein sequence | MAEKILIYDT TLRDGSQGEG ISLSVEDKLK IASRLDRLGV DYIEGGWPWA NPKDMEFFLR AREVIWRQAR LVAFGRTRKP GQAAAEDANL LAIKRAGVKV ATIFGKSWDL HVTAALGTTL AENLAMIGDS VAFLVDQGLE VIYDAEHFFD GFKANPDYAL ETLKAAAKAG ASWIVLCDTN GGCLPWEIEE AVARVRQEIQ VPVGIHAHND GDLAVANTLA AVTAGCRQVQ GTINGFGERC GNADLCSVMP NLELKMGYQC LPPGQLAFLT EVSRYVSEIA NVVPAGNQPF VGYSAFAHKG GIHVSAVLKA PDTYEHIRPQ QVGNERRVLM SDQAGASNLR CKAEEMGLEL NPERERGIIE GIKELERQGY QFEGADASLE LFLRKTTGEY RQQFEVEYVK ALVEKRAGQE AISEAIVKLR VGDQVVHTAA EGNGPVNAMD NALRKALEEV FPAIRHMRLT DYKVRVLDEK DATSARVRVL IESRDGSNSW NTVGVSTNII EASWEALLDS MEYALLKQQQ ELNKRAAAPC EPY
|
| |