Gene Moth_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0189 
Symbol 
ID3832262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp184654 
End bp186300 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content64% 
IMG OID637828125 
Producthypothetical protein 
Protein accessionYP_429067 
Protein GI83589058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGCGGA GGGTTTCTAC GAGGGTTTTA ATCTGCTGGA CGTTGTTGTT CTTGATGCTG 
TTGTCCCTGG TGGGCGGGGC GGTCGCCCTG GCCGCGGGGC TGGACCTCCC CGGCCTTTAC
GACCAGTTGA AGAATGATCC CACCTACCAG CCCTATCGCC AGGAACTGCT GAATGAATAC
CTTAAGTTAA ATAGCAACGA GCAGGCCATG GATAACGACC TGCGGTCTTT CCTGGCCGAT
GTGCAGGCCC GGCTTTCCCA GGCTGATACG AGCAGCCTCA AGGATGAAGC CGGCGTGAAC
GCCCTGGTCA TGAAAACGGC GGCGGAAGTG CTGTTTACGG ATAAGAAATA CACTACCCTG
GCAAATGCCT TGGCGGCAAC TTCGGACATT CAATCCATCC TGAAGGGCAA TTTCCCGCCG
TCCCTGGAGC CTGTCCGGAA AGAGGTTGTG AACGCTTTGC TGGGGAGCGG CGGTCAGGCC
GCGGCCGGCG GGGGCGGGAC CGCGGCGCCG TCGGCCGGTA CGGAAGAAGA GATCCAGCGC
GACACTGCCG CCGGGGTTGT TACCTGGCGG GTGAACCCGG AGGCGGCCGC CGGGATAAAG
GATAACAAGC TGGTCCTTTC CCTGGCCGGG GAGACCGCTC CCACCCGGAC CTTCTCCCTG
CCGCCGGCTC TCCTCCAGGA CCTGGGGCAA AAAAAGGCGG ACCTGGAGCT GGACTACGGC
CCGGTGGTCC TGACCCTGCC GGCAGCCTCC CTGGCGGATT TGAGCAGCTC CGGGGGAGCA
GACCTGACGT TTAGGCAGGA GAGCCTGGAC CCGGCGCGGG TAAAAGACGT CAACGGGCCG
GGCTACAATG GGGCCGGCAT GGTTTATACC CTCACCGCCG GGGATAAGGG AGGGCTCCCG
GCCGGCGTCC CCGCTAGGAT AAAATATGAA GAGCGGCAGG GGTTGGACCG GGACCTCCTG
GGCGTTTACC AGGTGAAAAG CGATGGTTCT CTGGTTTACC TCGGCGGTCA TGGGGTGGAC
GGGAGTCCCT ACCTGGAATT TTCGTTACCG GGTGATGGCA GTTATGCCGT TTTGGAGTAC
CGGGCTGATT TTGCCGACCT GGCGGGCCAC TGGGCGGCCA GGGATGTCCA GGTCATGGCG
GCCAGGCATA TCGCCGCCGG CGTGGGCGAG GGGCGGTTTG AGCCCGACCG AGACATTACC
CGGGCCGAAT TTACGAGCCT CCTGCAGCGG GTCCTGGGCC TGCCGGTAAA AGGAACGGTA
ACCGGCTTTA GCGATGTGCC GGCCGATGCC TGGTACGCTC CCTCCGTGGC CGCCGCCGTC
CGGGCGGGGC TGGTGCACGG GTATGAGGAT AGTACCTTTA AACCCGATAA TCCGGTGACC
AGGGCCGAGA TGGCGGCCAT GCTGGGCAAC GCCCTGGCCT TGCAGGGCCT GGCCGTGAAG
GTGGAGCCTG GTCAGGTAGA AGCCGTCCTG CAACCCTATC GGGACCAGGC GGCCGTACCT
TCCTGGGCCC GGCCGGCCAT GGCCGCGGCT GTGACGGCCG GTATTGTCGG CGGCCGCGAA
GGCGGCCTGG CGCCCCTTGA GCGCGCCACC AGGGCCGAGG CCATAGTGAT GCTTGAGCGG
CTGATGGATA AAGCCGGCTG GAGATAG
 
Protein sequence
MLRRVSTRVL ICWTLLFLML LSLVGGAVAL AAGLDLPGLY DQLKNDPTYQ PYRQELLNEY 
LKLNSNEQAM DNDLRSFLAD VQARLSQADT SSLKDEAGVN ALVMKTAAEV LFTDKKYTTL
ANALAATSDI QSILKGNFPP SLEPVRKEVV NALLGSGGQA AAGGGGTAAP SAGTEEEIQR
DTAAGVVTWR VNPEAAAGIK DNKLVLSLAG ETAPTRTFSL PPALLQDLGQ KKADLELDYG
PVVLTLPAAS LADLSSSGGA DLTFRQESLD PARVKDVNGP GYNGAGMVYT LTAGDKGGLP
AGVPARIKYE ERQGLDRDLL GVYQVKSDGS LVYLGGHGVD GSPYLEFSLP GDGSYAVLEY
RADFADLAGH WAARDVQVMA ARHIAAGVGE GRFEPDRDIT RAEFTSLLQR VLGLPVKGTV
TGFSDVPADA WYAPSVAAAV RAGLVHGYED STFKPDNPVT RAEMAAMLGN ALALQGLAVK
VEPGQVEAVL QPYRDQAAVP SWARPAMAAA VTAGIVGGRE GGLAPLERAT RAEAIVMLER
LMDKAGWR