Gene Moth_1367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1367 
Symbol 
ID3832290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1412471 
End bp1413928 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content57% 
IMG OID637829303 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_430223 
Protein GI83590214 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAA AGCAGCCTGC TAAAATTGTC ATGGTTTCCA ACCGAGGTTC ATATAATTTG 
CAGGCAACCC GGGACGGTAT CCAGGTTGTA CCGGCCATCA GCGGACTGGT TTCGGCCGTG
GAACCCTTCT TAAGGGAAAA AGGTGGCGTC TGGGTTGCCT GGGGTGGCCG GGAGGCGCCC
CAGGAGAACA GTCCCGGTTT ACGTTTACCG GTACCCGAGG GTAATCCGGC CTATACTTTT
CGTGAAGTTC CCCTTACCGC CGATGAAATC AACCTCTATT ACCATGGCTT TACCAATAGC
GCCTTGTGGC CCCTATGCCA TTACTTTTTA GAAAAATGTC GTTATGAAGT CCGGGAATGG
TCGGCGTATG TCGATGTTAA TGCCAAATTT GCCGCAGCCA CCCTGGAAGA AGCCGGGGAA
AGGGATACAG TCTGGGTAAA TGATTATCAC CTGGCCCTGG TACCGGCCCA CCTCCGCCAC
CGCAGGCCCC AATTAAAACA ATTCTTCTTC TGGCATATAC CTTTCCCCCA CCATGACCTG
CTGGCGACCC TACCCTGGGC AACCCATATC CTGCGCGGTC TGCTAGGAAC CGATGTCCTG
GGTTTTCACC TTCAAGCCTA CGTTGACAAT TTTTTGCAGG CCGTGGCGCA CATGCTGGGA
GCCAGGGTCG ATTTTGAAGC CGGTATTGTC TTCTGGGAGC AGCGGCGCAT TCATGTTGGT
GCCTGGCCTA TGGGCATAGA TTATCAAGCC TTCCAGCGCC AGGCTGCCAG TCCAGCCACC
ATGGCCAGGG CTCAAAAACT GCGGGAACAA ATTGGTGTCG AGCGCCTGGC GTTAAGCGTG
GAGCGCCTCG ATTATACCAA GGGCATCCTG GAAAGGCTCC TGGCGTGGGA ACGTCTGCTG
GAAGAAGCCC CCGAATGGCG CGGCCGGGCC GCCCTGATCC AGGTCGCCGT TCCCAGCCGG
ACAGCTGTAC CGGCCTATCG TCAGTTAAAG GAGCAGGTGG AAGCCACTGT AGGCCGCATC
AACGGCCGTT TTAGTGACGG CAACTACCAA CCGGTATACT ATTTCTGGCG CGGCCTCCCC
CGGCGGGAGC TGGTAGCCTA TTACCTGGCA GCCGATGTGA TGCTGGTCAC ACCCTTAAGA
GACGGCTTGA ACCTGGTGGC CAAGGAATAT GTAGCTTCGC GGCGTGATCA GACCGGCGTC
CTGGTCTTGA GCCGTTTCGC CGGTGCCGCC CAGGAATTGA AGGGTGCCGT CCTGGTGAAC
CCCTATGACA TCGACGGGAT GGCCATGATC TTTAATACGG CCCTGGGGAT GGGCCGGGCA
GAGCAAAGCA AAAGGTTGCA ACTGTTACAG GAAAGGGTAC GGCGCCATGA TGTCCACTGG
TGGATGAATT GTTTTCATCA GGCCATGGCC GTCAGGGAAG GAGAAACCGG TGATGTCTCC
GGATCAGCTG GCAGCTAG
 
Protein sequence
MSQKQPAKIV MVSNRGSYNL QATRDGIQVV PAISGLVSAV EPFLREKGGV WVAWGGREAP 
QENSPGLRLP VPEGNPAYTF REVPLTADEI NLYYHGFTNS ALWPLCHYFL EKCRYEVREW
SAYVDVNAKF AAATLEEAGE RDTVWVNDYH LALVPAHLRH RRPQLKQFFF WHIPFPHHDL
LATLPWATHI LRGLLGTDVL GFHLQAYVDN FLQAVAHMLG ARVDFEAGIV FWEQRRIHVG
AWPMGIDYQA FQRQAASPAT MARAQKLREQ IGVERLALSV ERLDYTKGIL ERLLAWERLL
EEAPEWRGRA ALIQVAVPSR TAVPAYRQLK EQVEATVGRI NGRFSDGNYQ PVYYFWRGLP
RRELVAYYLA ADVMLVTPLR DGLNLVAKEY VASRRDQTGV LVLSRFAGAA QELKGAVLVN
PYDIDGMAMI FNTALGMGRA EQSKRLQLLQ ERVRRHDVHW WMNCFHQAMA VREGETGDVS
GSAGS