Gene Moth_1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1462 
Symbol 
ID3831348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1511892 
End bp1513142 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID637829395 
Productspore germination B3 GerAC like 
Protein accessionYP_430315 
Protein GI83590306 
COG category 
COG ID 
TIGRFAM ID[TIGR02887] germination protein, Ger(x)C family 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAT TGAAGAGGTT GGTGCTCTTT ACATCTATCC TGGGGGCATT TGCGCTGGCG 
TTTTCTGCCG GCGGCTGCTG GGATCGCCGG GAGATAAACG AGCTCGCTTT CCTTTCCTGC
GCAGCCTTTG ACCTGGAGGG CGGTAATCGC GTCCTGACAT CTGAGTTCAT CCGGCCCTCC
GCAGCCGGTG GGGGAGAAAG GGGCGGTGGG GAGACCTTGC CCCAGCGACA GGCCCTCATA
GTGAGTAACC GGAACAAGAC CTTCCTGGCC ATCGGGCGGG AAAAAGCCCT GGGGCTGCCC
CGTCGGGCCT ACCTGGCCCA TACTGCCGCC GTCCTGGTGG GGGAGGAGAT GGCCCGGTAC
GGCATAAAAG AAGTCCTGGA TTTTGTCGAC CGGAACCCGG AGATACGCCG CACCACCCTG
ATTTTACTTA CCCGCGGCCC GGCGCGGGAG GTGCTGGTCA GGGCCCAGAG CGGCTTGGAA
AAAACCCTGG GAAGGGAAAT AACCGGCCTT CATAAATGGG TCCAGGTCAG CGGTTACGGA
TATATCCCCA ACATTAACGA TATTTTTTTC GATTTGTCCG GTGATGCGGG AACAACCGTC
CTGCCGGTGC TGGAATTAAG TCCCCAGCCG TTCCCGCCCA TTCTCGGCCC CGCTACCGCG
ACGGGCGGCG GCATTCCCGC CGGGAGGGCT GGAGAACCGG AAACGCTAAT GACGGCGCGC
CTGAACGGCG CCGGGCTGTT CTACCATGAC AAATTGGTGG CGTGGCTGGA TCAAGAACAA
ACCCGCGGCT GGGCCTGGGT ACGCAATAAA GTTAAAAGTG CCATGCTGGC TTTACCCCGC
CAGGAAAACA GCCTGGTATC CGTAAATATC ATCTCTTCCC GGGCCGAGGC CGCTATCGAC
ATGCAAGGAG GCCGGCCCCA GGGCAAAATC AAGATCAAGG TGGAGGGCGA TCTCCTGGAA
GAGCAGTGCT ACCAGGACTT TACCAAAGAA GAAGCTGTTA AATCGCTGGA AAGCCGGATG
GCAGCCCAGA TCACGTCTGA AATCAGCAGC GCCCTCAATC AGGCCAAGAT GGCTGGTACG
GATGTTTTTG GCTTCGGCGG CGCCCTCCAC CGCCGGTACC CAGAAGTGTG GCGCCAGTTA
GAAGGACGTT GGAACGAGGA ATTCAAAAAG TTGCCTGTTA CCATTAGCGT CGAAGCCAAA
CTACGACGTA CCGGGATGAC TGGACGTCCC TGGCAGCCCG GGGCGCGCTA G
 
Protein sequence
MPKLKRLVLF TSILGAFALA FSAGGCWDRR EINELAFLSC AAFDLEGGNR VLTSEFIRPS 
AAGGGERGGG ETLPQRQALI VSNRNKTFLA IGREKALGLP RRAYLAHTAA VLVGEEMARY
GIKEVLDFVD RNPEIRRTTL ILLTRGPARE VLVRAQSGLE KTLGREITGL HKWVQVSGYG
YIPNINDIFF DLSGDAGTTV LPVLELSPQP FPPILGPATA TGGGIPAGRA GEPETLMTAR
LNGAGLFYHD KLVAWLDQEQ TRGWAWVRNK VKSAMLALPR QENSLVSVNI ISSRAEAAID
MQGGRPQGKI KIKVEGDLLE EQCYQDFTKE EAVKSLESRM AAQITSEISS ALNQAKMAGT
DVFGFGGALH RRYPEVWRQL EGRWNEEFKK LPVTISVEAK LRRTGMTGRP WQPGAR