Gene Moth_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1543 
Symbol 
ID3831929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1586839 
End bp1588158 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content60% 
IMG OID637829475 
Producthypothetical protein 
Protein accessionYP_430395 
Protein GI83590386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.124081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG GTAACTATGA GCTCAGCCGC CGGGAAGTGA GGTTGCTGGG TATAATGGCG 
GCAGTTTTAC TTGCTTTTTT CTTTTATCAC GTGGTCTGGG GAAAACAGCT TCCCGCCTAC
AGGGAGGTCA GGAGCCGCCT TCTGGCGGAC CAGGCACGCC TGGCTGCCGC CCAGCAGGCA
GCGGCTACGG CACCATCCCT GACAAAAAAT GCCGAACAGG CCCGGGCCGC CTGGGAGGCT
ACCAGACAGC GACTGGGCTT TACCCTCCAG GGTACCTCAG CCTTTCTGGA TGCTGCTCAA
CCCCGTGACC CGGCCCTCCG GATCCTGGTC TTCAAGCCCC TCCCGGTTGA AAAAAGAGAT
CCCTTTCAGG TATATCCCTA TGAAGTCACC GTCAGCGGCC CTTATCCGGC GTTACAGGAT
TATATAAGCC AGCTAGAATC CCTGCCGGCC CTGACGGCCA TCCACAATTT GAAGATCCTT
GCACGCCAGG GAAACGCTGC AACAGTCGAA GCCAGCTTTA TTATCGACCT CTACGACCTG
GGCGAGACTG TACCGGTACC GGCTGCGGTG GCCCTTTTCC CCGGCGGCAG GGCCGACAGC
TTTGCCCCAC CGCCAGGGGT TGCCTCGCCG GTAGGGGGAG GGACGGTAAC GGCCTCCCCC
GGCCAGGGAG TCCAGGTAAA CAGCAGCCAG GGCAGCCAGC AGGCACCAAC AAAACCCGGT
CCGGCAGGCA GCCCGCCACC GGCGTCTTCC CAACCTTCGG CCCGGACGTC ACCTTCCGCA
GCTCCGTCCC AGCCGGCTGA GGAGATTGGC GGGACGCCGG CCTATACCCT GCCCCGCCAG
CAGGGGGGAC GGCTGGTTCC GGGTCCGGCT TTTACGGACC CTCCGGCCGG CAACGGTGAT
GTCTGGCTGG ACGAGCTGCG CGTGTTGCGG AACGTCGGTC CTTTCTTCGT CCTCTCCAGG
CCGGCAGCCC TGGCGGGTAT GAACCTGGGC CGCAGTATAG GCGTAAATTT AAGCAAAGGT
CAAACTAAAG CCGAATTAAA GGTCGATCTC CGCGGCCGGT ACACCCGTCT CCAGGGGTAT
ACCGGCATCG ACGATAGCTT TGCCAACAGC AGCGGCAAAG TTAAAGTAAC CATTTTTGCC
GATGGCCGCC AGATTTATCA GGGGGAAATC AAGCCGGGGG ATTACCCCCG GTACCTGGAG
TTACCCCTCT TTCTAGTCCG GCAACTGACC TTCAGCCTGG AATGGCAGGC TGGTGATACT
GGTAGCTACG ATCAATTACT GGCTACCCTG GCCAGCATTC ATTTTTCTCG CCAGCCCTAG
 
Protein sequence
MKIGNYELSR REVRLLGIMA AVLLAFFFYH VVWGKQLPAY REVRSRLLAD QARLAAAQQA 
AATAPSLTKN AEQARAAWEA TRQRLGFTLQ GTSAFLDAAQ PRDPALRILV FKPLPVEKRD
PFQVYPYEVT VSGPYPALQD YISQLESLPA LTAIHNLKIL ARQGNAATVE ASFIIDLYDL
GETVPVPAAV ALFPGGRADS FAPPPGVASP VGGGTVTASP GQGVQVNSSQ GSQQAPTKPG
PAGSPPPASS QPSARTSPSA APSQPAEEIG GTPAYTLPRQ QGGRLVPGPA FTDPPAGNGD
VWLDELRVLR NVGPFFVLSR PAALAGMNLG RSIGVNLSKG QTKAELKVDL RGRYTRLQGY
TGIDDSFANS SGKVKVTIFA DGRQIYQGEI KPGDYPRYLE LPLFLVRQLT FSLEWQAGDT
GSYDQLLATL ASIHFSRQP