Gene Moth_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0209 
Symbol 
ID3831360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp204139 
End bp205698 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content60% 
IMG OID637828145 
Producthypothetical protein 
Protein accessionYP_429087 
Protein GI83589078 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAGC ACCCCAGGGC ATTTGAACCA GCCGTCCTCA TCCTCAATAT TATTCTCTCG 
GTATTGGGGT CGATTATCGG TCTGCAGATC CTTACTACCC TGGGGGTCAC ACCCAACACG
GCCATCATTG GGGTACTGGT AGCCCTCGCC CTGTCGCGCA TCCCGGGAGG GTGGATGGCC
AAATACCGTT CCATTCACCG CCAGAACCTG GTCCAGTCGA CCATTTCCGG CGCCACCTTC
GGGGCAGCCA ACTCCCTGCT GCTGCCTATC GGCATTCCCT ACCTCTTCGG CCGTCCCGAC
CTGGTTGTGC CCATGCTCAT AGGAGCGACC ATGGGCATGT TCATTGACTG GGCCATGCTG
TACTGGTTCT TCGATTCCCG GATTTTCCCC GGCCAGGCCG CCTGGCCGCC GGGTGTGGCG
GCAGCCGAAG CCATCTATGC CGGTGATGAA GGCGGCAAGA GGGCCTGGTT ACTCGTCTGG
GGCACCATTA TCGGGATCAT TGGTTCTTAC TTCAAGGTTT CCATGTCCGC CTTCGGGGTG
GCCTTTATCG GCAATGTCTG GGCTTTGACC ATGTTTGGCC TTGGGCTGCT GCTGAGGGGT
TATTCTGTAA AACTCTTTGG CTTTGATATT GATAAACTCT ATATCCCCCA CGGGATGATG
ATTGGAGCCG GTCTGGTGGC CGGAATCCAG ATATTGCTCA TTCTTCTAAA GGGAAGGAAA
GAAACGACGG CTTCCGGGGA TGCCCCTGCG GCCGCTAACT ATACGCGGAG TGAAAAACAG
GTCGCTAAAG GCCTGGCCCG GGGTTTTGGC CTCTATATCG TCGCAGCCCT GGTCCTGGCC
ATGCTGGGCG GCCTCTACAC CAGCATGCCG GCGTGGCAGT TGCTGTTCTG GGCCGTTTTC
GCCGCCGTTT CCTGCATCCT GGCTGAGTTT ATCGTGGGGC TTTCAGCCAT GCACGCCGGC
TGGTTCCCGG CCTTTGCCAC GGCCCTGATT TTCCTGGTCA TCGGCATGGC CCTCGGCTTC
CCGGCCCCGG CCCTGGCCCT GCTGGTGGGC TTTGTCGCTT CGGGGGGGCC GGCCTTTGCC
GACGCCGGTT ATGATTTTAA GGCCGGCTGG ATCCTCAGGG GTGAAGGCCG GGATCGCGGT
TTTGAGCTTG ACGGCCGCTG GCAGCAGTTC CTGGCAGGTG CTTCAGGCCT GGTCGTGGCC
TGGGCCATGG TCACCCTGAC CCACGGTATC TATTTCCGCC AGGGCCTCTT CCCGCCGGTG
GATAAGGTTT ACGCGGCGAC CATCAAAGCG GGGGTAGACG CGGCTATCAT CAAAAATCTG
GTCCTGTGGG CCATACCCGG AGCCCTGATT CAGGCCCTGG GCGGTTCTGA AAAACAGCTG
GGCATCATGC TGGCCACCGG CCTTTTAATC CTCAATCCCC TGGCGGGTTA TGCCGTGCTG
GCGGGGATTT TGATCCGCAC CCTGGTTTTG AAGTTTAAGG GGCGGGAAGC GGAGACCCCC
ATGACCATCC TGGCAGCTGG CTTTATCGCC GGCGATGCCC TCTACGGTTT CTTTAACTAG
 
Protein sequence
MEKHPRAFEP AVLILNIILS VLGSIIGLQI LTTLGVTPNT AIIGVLVALA LSRIPGGWMA 
KYRSIHRQNL VQSTISGATF GAANSLLLPI GIPYLFGRPD LVVPMLIGAT MGMFIDWAML
YWFFDSRIFP GQAAWPPGVA AAEAIYAGDE GGKRAWLLVW GTIIGIIGSY FKVSMSAFGV
AFIGNVWALT MFGLGLLLRG YSVKLFGFDI DKLYIPHGMM IGAGLVAGIQ ILLILLKGRK
ETTASGDAPA AANYTRSEKQ VAKGLARGFG LYIVAALVLA MLGGLYTSMP AWQLLFWAVF
AAVSCILAEF IVGLSAMHAG WFPAFATALI FLVIGMALGF PAPALALLVG FVASGGPAFA
DAGYDFKAGW ILRGEGRDRG FELDGRWQQF LAGASGLVVA WAMVTLTHGI YFRQGLFPPV
DKVYAATIKA GVDAAIIKNL VLWAIPGALI QALGGSEKQL GIMLATGLLI LNPLAGYAVL
AGILIRTLVL KFKGREAETP MTILAAGFIA GDALYGFFN