Gene Moth_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2040 
Symbol 
ID3831186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2128780 
End bp2130204 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content62% 
IMG OID637829969 
Productspore germination B3 GerAC like 
Protein accessionYP_430879 
Protein GI83590870 
COG category 
COG ID 
TIGRFAM ID[TIGR02887] germination protein, Ger(x)C family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGT GCCGGCGGGT ACTGGCTGCT GTCTTGCTGA TCCTGGGCCT GCCCACCCTA 
TCTGGTTGTT GGAGCCGGCA TGAGATTGAG GAACTGGCTA TTATCATGGC GGTAGCCCTG
GATAGTGCTC CCGGGGGCCA GGTACGCCTG ACTACCTTTG TAGTTATCCC CAGGGCCGTA
GGCGGCGGCC CCTCCATGGG CGGCGGTGGT GGCGGTGAAC CCAAAAAAGC GGGCGAGGTC
GTCAGCGCCG TTGGTAGCGA CTTTGCCCAG GCGACCAGGC GCCTGGAAGG ACAGGTGCCG
CGGCGCCTCT TTTGGGCCCA GAACAGGGTG ATAATTATAG GCGAGGACCT GGCCCGGCGC
GGCCTCAATT ACTTAGACCT CTTCCTCCGT GATCGCCAGA TGCGCCTGAC AACGCCTATT
TTAATCACCA AGGGCGAAGC GCGTAAGGTG CTGGACCTGC CGCCGGGGAT CGAGCTGAAT
CCCGGTACCA TCCTGACGGG GATCTTGCGT AATCGCACCA CGTTTAAAGT GGAGTTAAAG
GACCTCCTGG CTATGTGGGA GGCGCCCGGA GACAACCCGG CCCTGCCGGA GGTGGTTATG
ACGCCGACAC CGGAGCAGGA GAGCAAGGAA ACAGGCGGCG AGGGTCGGGA CGCTAAAGGA
GGCGGCAACG AAGGAGGAGG GAAAGGGAAA ACACAACCCG AGGCTGTGGT AATTAAAGGC
GCCGGCGCCT TCCGTTACGA CCGCCTGGCT GGCTGGCTCG ACGAAAAGGA GGCTAACGGG
CTGATGTGGC TGCGGGGAGA GGTAACGGAG GGGGTTGTTA CGGTACCCCT GCCCGCCGGA
GCCACGGGAA CGGGGCAGGG TACGGCTGGT GGCCTAAGTT CAACCACGGG TGAGGGAGGC
AGTCCCGGTC AGGGACAGAC CCCCGGCGCG GAGAAGGGGG GTAACCGGGA TCTGCCCCTG
GCGCAAACCA CCCTCTCCGC GCCGGCTCAG GTTTCCGTCG TCTTTCACCG CGTTAGCGCC
AAAACCAAGG CCCAGGTTCA GGGCCAGCAG GTTACCTTTA TGGTGGGGAT CCGTGGTCAA
GGAGACTTGA TAGAGGTGAC GGGTAACTTT AATCCCGATA ACCAGGAACA GTTGCTCTCT
CTCCAGGAGG CCGTCAACAA GGCCATCGAG GAACGCGTTC TTCTCGCCCT GCGTAAAGCC
CAGCAGGAGT TCCAGACGGA TATCTTCGGT TTCGGCGAGA TCCTGCACCG GAGCAACCCC
CGTTACTGGC ACCAGGTCCA GGATCATTGG AACGAAGAAT TTACCCGGGC GCGAGTACAG
GTGCAGGTGG AGTTACAGCT ACGAAGGACG GGCATGACCA ACAGGACCCC TGGTAGAGTG
GGGGCCGTGG GCGGTAAAGT CTCCACCCAG GGCATGACGA GATAG
 
Protein sequence
MIWCRRVLAA VLLILGLPTL SGCWSRHEIE ELAIIMAVAL DSAPGGQVRL TTFVVIPRAV 
GGGPSMGGGG GGEPKKAGEV VSAVGSDFAQ ATRRLEGQVP RRLFWAQNRV IIIGEDLARR
GLNYLDLFLR DRQMRLTTPI LITKGEARKV LDLPPGIELN PGTILTGILR NRTTFKVELK
DLLAMWEAPG DNPALPEVVM TPTPEQESKE TGGEGRDAKG GGNEGGGKGK TQPEAVVIKG
AGAFRYDRLA GWLDEKEANG LMWLRGEVTE GVVTVPLPAG ATGTGQGTAG GLSSTTGEGG
SPGQGQTPGA EKGGNRDLPL AQTTLSAPAQ VSVVFHRVSA KTKAQVQGQQ VTFMVGIRGQ
GDLIEVTGNF NPDNQEQLLS LQEAVNKAIE ERVLLALRKA QQEFQTDIFG FGEILHRSNP
RYWHQVQDHW NEEFTRARVQ VQVELQLRRT GMTNRTPGRV GAVGGKVSTQ GMTR