Gene Moth_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1104 
Symbol 
ID3833070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1131553 
End bp1132680 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID637829032 
ProductL-threonine O-3-phosphate decarboxylase 
Protein accessionYP_429961 
Protein GI83589952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase
[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCATG AAAGGGCAAG CAACAGCAAT GGCCGGGAAC TGACCGGGGT CATTCACGGC 
GGAGACTGGC AGGGAGCCGT AGATCGCTAC GGCTGGAAGG TGGAAGAGAT ACTGGACTTC
AGCGCCAACA TCAATCCCCT GGGGCCGCCG GTCGGGGTAC TGGAAACCTT GAAGGAAAAC
TTGCCGGCCG TTCAGCGTTA CCCTGACCCG GCAAGCCGGC GTTTGAAAGA AGCACTGGCA
GATCAACTGC ATGTAGATAC CGGCGCTATA ATTATTGCTA ACGGGGCGGT AGAATTAATT
TACCTTATAA TGCAGGTTCT AAAGCCTGAT CGGGTGCTGG TAGTCGAACC CACCTTCGGC
GAATACCGCC GGGCAGCTAC CATCGCCGGG GCCGAAGTGT TGCCTGTCTA CCTTGATCCC
GCTACCGGTT TTACCTTTGA TTTCGACCGC TGGCGCCCGG AACTGCAGCG GTCGCAGGTA
GCCTTTATTT GCAATCCCAA TAATCCCACC GGCCGCCTCC TGAACCCGGA TATTTTACAT
CGGGCAGCAA GCCTATGCCG GGAACAGGGG GTTTTCCTGG TGATGGATGA ATCCTTTCTG
GACTTTGTCC CCGACGGGGA TAAGTTTTCT CTGGTACCCC AGGCGGCCGC CGGGCCGGGG
ATATTTATCC TGCACTCTTT GACGAAGATT TTTGCCCTGC CGGGATTGCG CCTTGGTTAC
GGTGTCGGCT GTCCGGATAT GGTACGCAGG CTGGAGAACA GCCGGGACCC CTGGAGCGTC
AATATCCTGG CCCAGATGGC AGGCGTAGCC GCCCTGGCCG ATAAGGAGTA TTTAAAGAAA
ACCCGGGAAC TAATCAAGCG GGAAAAGGAG TATCTTTTCC ATAACCTGTC CAGGCTGGCA
GGATTCCGGC CCTATTACCC CGAAGTTAAT TTTATCCTGA CTAACATTCA GAACGGCTGC
CTGACGGTAT CCCGGTTGGC TGAACTCCTG GCCCGGAAGC GCATCTTAAT CCGTGACTGT
TCTTCTTTTC CCGGTCTTGG ACCGGCCTAC TTCCGGGTTG CCGTACGTGA CCACCGGGCC
AATAAGAGAC TGGTGGCTGC CTTAAAGGAG ATAATGGAGG AGAGTTAA
 
Protein sequence
MVHERASNSN GRELTGVIHG GDWQGAVDRY GWKVEEILDF SANINPLGPP VGVLETLKEN 
LPAVQRYPDP ASRRLKEALA DQLHVDTGAI IIANGAVELI YLIMQVLKPD RVLVVEPTFG
EYRRAATIAG AEVLPVYLDP ATGFTFDFDR WRPELQRSQV AFICNPNNPT GRLLNPDILH
RAASLCREQG VFLVMDESFL DFVPDGDKFS LVPQAAAGPG IFILHSLTKI FALPGLRLGY
GVGCPDMVRR LENSRDPWSV NILAQMAGVA ALADKEYLKK TRELIKREKE YLFHNLSRLA
GFRPYYPEVN FILTNIQNGC LTVSRLAELL ARKRILIRDC SSFPGLGPAY FRVAVRDHRA
NKRLVAALKE IMEES