Gene Moth_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2251 
Symbol 
ID3830746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2354553 
End bp2356154 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content61% 
IMG OID637830171 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_431081 
Protein GI83591072 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA AGATCCTTAT CTACGATACC ACCCTCCGGG ACGGCAGCCA GGGCGAGGGT 
ATCAGCCTGT CGGTAGAGGA TAAACTAAAG ATTGCCTCCC GCCTGGACCG GCTGGGCGTA
GATTATATCG AGGGTGGCTG GCCCTGGGCC AATCCCAAGG ATATGGAGTT TTTCCTCCGG
GCACGGGAGG TCATCTGGCG CCAGGCCAGG CTGGTGGCCT TCGGCCGTAC CCGCAAGCCC
GGCCAGGCCG CCGCCGAGGA CGCCAACCTG CTGGCCATCA AGCGGGCCGG AGTAAAGGTG
GCCACTATTT TTGGCAAATC ATGGGATCTT CATGTCACGG CGGCCCTGGG GACCACCCTG
GCCGAAAACC TGGCCATGAT CGGCGACAGC GTGGCCTTCC TGGTGGACCA GGGCCTGGAA
GTAATCTATG ACGCCGAACA CTTTTTTGAC GGCTTCAAGG CCAACCCGGA TTATGCCCTG
GAAACCCTGA AGGCGGCGGC AAAGGCCGGG GCCAGCTGGA TTGTCTTGTG TGACACCAAT
GGCGGTTGCC TGCCATGGGA GATTGAGGAG GCGGTAGCCA GGGTACGCCA GGAGATCCAG
GTGCCGGTGG GTATTCACGC CCATAACGAC GGCGACCTGG CCGTGGCCAA CACCCTGGCG
GCGGTGACCG CCGGGTGCCG CCAGGTCCAG GGGACCATCA ACGGCTTTGG CGAGCGCTGC
GGCAACGCCG ACCTGTGCTC GGTAATGCCC AACCTGGAAC TCAAGATGGG CTACCAGTGC
CTGCCGCCGG GACAACTGGC CTTTCTCACG GAAGTCTCCC GTTATGTCAG CGAGATTGCC
AACGTCGTCC CTGCCGGCAA CCAGCCCTTT GTCGGCTATA GCGCCTTTGC CCATAAAGGC
GGCATCCACG TCAGCGCCGT TTTGAAGGCA CCGGATACCT ACGAGCATAT CCGGCCCCAG
CAGGTAGGCA ACGAGCGGCG GGTGTTAATG TCGGACCAGG CCGGGGCCAG CAACCTGCGG
TGCAAAGCGG AGGAGATGGG GCTGGAGTTG AACCCGGAGC GGGAACGAGG CATCATAGAG
GGAATCAAGG AACTGGAACG CCAGGGCTAC CAGTTCGAGG GAGCCGATGC CTCCCTGGAG
CTTTTCCTGC GGAAGACGAC GGGCGAATAC CGGCAGCAGT TTGAAGTCGA GTATGTCAAA
GCCCTGGTAG AAAAGAGGGC CGGGCAGGAG GCCATATCGG AAGCCATAGT CAAGCTGCGG
GTGGGCGACC AGGTGGTCCA TACGGCCGCC GAAGGCAACG GCCCTGTAAA CGCCATGGAT
AACGCCCTGC GGAAAGCCCT GGAAGAAGTC TTCCCGGCTA TTCGGCACAT GCGCCTGACT
GACTACAAAG TACGCGTCCT TGATGAAAAG GATGCCACCA GCGCCCGCGT CAGGGTACTC
ATTGAATCCC GGGACGGCAG CAATTCCTGG AATACTGTCG GCGTCTCCAC CAATATTATC
GAAGCCAGCT GGGAGGCCCT TCTGGACAGT ATGGAGTACG CCCTCCTTAA ACAACAGCAG
GAGTTAAATA AGCGGGCGGC AGCCCCCTGT GAACCTTATT AG
 
Protein sequence
MAEKILIYDT TLRDGSQGEG ISLSVEDKLK IASRLDRLGV DYIEGGWPWA NPKDMEFFLR 
AREVIWRQAR LVAFGRTRKP GQAAAEDANL LAIKRAGVKV ATIFGKSWDL HVTAALGTTL
AENLAMIGDS VAFLVDQGLE VIYDAEHFFD GFKANPDYAL ETLKAAAKAG ASWIVLCDTN
GGCLPWEIEE AVARVRQEIQ VPVGIHAHND GDLAVANTLA AVTAGCRQVQ GTINGFGERC
GNADLCSVMP NLELKMGYQC LPPGQLAFLT EVSRYVSEIA NVVPAGNQPF VGYSAFAHKG
GIHVSAVLKA PDTYEHIRPQ QVGNERRVLM SDQAGASNLR CKAEEMGLEL NPERERGIIE
GIKELERQGY QFEGADASLE LFLRKTTGEY RQQFEVEYVK ALVEKRAGQE AISEAIVKLR
VGDQVVHTAA EGNGPVNAMD NALRKALEEV FPAIRHMRLT DYKVRVLDEK DATSARVRVL
IESRDGSNSW NTVGVSTNII EASWEALLDS MEYALLKQQQ ELNKRAAAPC EPY