Gene Moth_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1998 
Symbol 
ID3832331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2082267 
End bp2083511 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content60% 
IMG OID637829927 
ProductL-threonine synthase 
Protein accessionYP_430837 
Protein GI83590828 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATA TAACCCAACT ACGCTGTATA AACTGCAGCC GTACCTATCA ACCCCGGCCC 
GGCTTTTATA CCTGCCCCAT CTGCGGGTCG ACGGACGGCA TCCTGGATGT CGAATACGAT
TACGATTATA TAAACAAATC CATCTCCAGG CAGAGCCTGG CCAGCAACCG GGAACATTCC
CTCTGGCGTT ACCGTCCCTT CTTGCCGGTG GCCCAGGAGG GACCGCTGCC CCCTCTGCGG
GTGGGTTGGA GCCCCCTGTA CCGGGCCCGG CGCCTGGGGG ACGAACTGGG GTTGAAAGAG
CTATATATCA AGGACGATGG GATTAATCCC ACGGGGTCTT TAAAAGACAG GCCTTCGGCC
GTGGCGGTGG CCCGAGCCCT GGCCGAAGGA GCTAAGGTGG TCGCCTGCTC TTCAACCGGC
AACGCTGCTT CCTCCCTGGC GGGGGCAGCA GCTTCCGTGG GCCTGAAGGC GGTAATCTTT
GTACCGGGAC GGGCACCGCA AGGGAAGGTA GCCCAACTCT TGATTTTCGG GGCCACCGTT
ATCAGTGTTC AGGGATCCTA TGAGGATGCC TTCAAGCTTT CGGCGGCGGC CATAGCCGAG
CACGGCTGGT ACAACCGCAA TGCCGCCATT AACCCTTATC TGGTAGAAGG CAAAAAGACA
GTTTGCCTGG AAGTGGCGGA ACAGCTAAAC TGGGAGGTAC CCGACTGGGT GGTCCTTTCC
GTGGGCGACG GTTGTACCAT GGCCGGTGCC TGGAAGGCCT GGGTGGACCT GAAAAAGGCC
GGTTGGATTG ACAAACTGCC GCGGATGCTG GGGGTCCAGG CCGAGGGTTG CTGCCCCATT
ACCAGGGCCT TCCGGGAGGG AACCAGGGTA AAACCCATGC CCGAAAATAC CCTGGCGGAC
AGCATCGCCG TCGGCGTGCC CCGCAATCCA GAAAAGGCTC TGCGGGCCGT AAGGGATTCC
GGGGGTACGA TGATCAACGT CAGCGATGAA GAAATCCTTG CAGCCATGCG CACCCTGGGG
CGGACCAGCG GCATCTTCGG CGAACCTGCC GGTGCAGCCG GGACCGCCGG CCTTATCAAA
GCCGTCCAGG AAGGAATAAT TAAATCCGGA GATAAGGTGG TCGTCCTGGT AACAGGCAAC
GGCCTCAAGG ATGTGGCCAA TGCCATCAAA GCGGCCGGAG AGCCTATCCG GGTGGAGCCC
TCCCTGGAGG CCTTAAAGGA AGCCCTGGCT CGGTACGGGA AATGA
 
Protein sequence
MSNITQLRCI NCSRTYQPRP GFYTCPICGS TDGILDVEYD YDYINKSISR QSLASNREHS 
LWRYRPFLPV AQEGPLPPLR VGWSPLYRAR RLGDELGLKE LYIKDDGINP TGSLKDRPSA
VAVARALAEG AKVVACSSTG NAASSLAGAA ASVGLKAVIF VPGRAPQGKV AQLLIFGATV
ISVQGSYEDA FKLSAAAIAE HGWYNRNAAI NPYLVEGKKT VCLEVAEQLN WEVPDWVVLS
VGDGCTMAGA WKAWVDLKKA GWIDKLPRML GVQAEGCCPI TRAFREGTRV KPMPENTLAD
SIAVGVPRNP EKALRAVRDS GGTMINVSDE EILAAMRTLG RTSGIFGEPA GAAGTAGLIK
AVQEGIIKSG DKVVVLVTGN GLKDVANAIK AAGEPIRVEP SLEALKEALA RYGK