Gene Moth_0727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0727 
Symbol 
ID3831003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp758881 
End bp759903 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content50% 
IMG OID637828658 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_429588 
Protein GI83589579 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000339412 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.641149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAC CCTTAATTGC TATTACTGTA GGTGACCCTT GCGGCATTGG GCCTGAGATT 
ACTGCTAAAG CCCTGGCCAT ACCAGAGATT TATAATCTAT GCCGGCCTCT GGCTATAGCC
GATGCCGGCC TGATGGGCGA AGCTATCAAG ATCGCGGGAG TTAACCTGTC CGTTCGAGCC
GTGACCAGTC CTGGTGAAGG CCGGTATGAG TATGGCACCA TCGATGTCCT GGACATGCAA
AATGTTGACC TGAACCAGTT GCAGTACGGC AAAGTAACCC GCATGGGGGG TGAAGCCAGT
TTCCAGTATA TAACCCGAGC CATTGAACTC GCCCTGGCCG GGGAAGTTGA TGCCGTCACC
ACCGGTCCCA TTAATAAAGA AGCTATCAAC CTTGCCGGAC ATCATTACTC CGGGCATACG
GAGATCTTCG CCGACCTGAC GAAAACGCAG GACTACTGCA TGATGCTCGT TGACAAGAAT
TTTCGGGTTT CCCATGTGAC TACCCATGTG GCTTTCAGTC AGGTACCATC ACTGATAAAA
AAGGAGCGGG TACTTACAGT AATCAAATTG ACTAACGATG CTCTTTTAAA AATGGGAATT
TCAATGCCGA GAATTGCGGT CGCTGGACTC AACCCCCATG CCGGCGAGGA TGGTCTCTTC
GGCCGCGAGG AAATCGAGGA AATCGGTCCG GCTATCACCG CCGCCAGGGA GCAGGGAATC
CAGGTAGATG GCCCGGTACC TCCAGATACC ATTTTTGTTA AACTCCAGGG TGGCCAGTAT
GATGCTGTAG TAGCTATGTA CCATGATCAG GGCCATATTC CAACCAAATT AATCGGCTTT
AAATATGACA ATGCCACCGG CAAGTGGGGA TCGGTTGCCG GGATAAACAT CACTTTAGGA
TTACCAATAA TACGGACCTC AGTTGATCAT GGTACCGCTT TTGGTAAAGC TGGAAAGGGG
ACAGCCAACC CCGAAAGTAT GGTGGATGCC TTGAAAATGG GGGCAATAAT GACGCGAACC
TAA
 
Protein sequence
MVKPLIAITV GDPCGIGPEI TAKALAIPEI YNLCRPLAIA DAGLMGEAIK IAGVNLSVRA 
VTSPGEGRYE YGTIDVLDMQ NVDLNQLQYG KVTRMGGEAS FQYITRAIEL ALAGEVDAVT
TGPINKEAIN LAGHHYSGHT EIFADLTKTQ DYCMMLVDKN FRVSHVTTHV AFSQVPSLIK
KERVLTVIKL TNDALLKMGI SMPRIAVAGL NPHAGEDGLF GREEIEEIGP AITAAREQGI
QVDGPVPPDT IFVKLQGGQY DAVVAMYHDQ GHIPTKLIGF KYDNATGKWG SVAGINITLG
LPIIRTSVDH GTAFGKAGKG TANPESMVDA LKMGAIMTRT