Gene Moth_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1373 
Symbol 
ID3831620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1418789 
End bp1420375 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content53% 
IMG OID637829309 
ProductGerA spore germination protein 
Protein accessionYP_430229 
Protein GI83590220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTT CCGGCGACAT TGTAAAAGTT GATCCCAATT TAGATGTTAA CAATAACTGG 
ATTATGAAGG AACTGGGCGT TGGGCAGAGT TTTGACGTTA TCAGGCGGGA TATAACCATT
GCCGGGCGCC GGGCTACTAT GTTTTATATT AATGGTATGG CCCGGGAGGA TGTCCTGGTA
TATATCCTTA CCTCCCTCTC CAAGCTGAAA CGTGAGGACG TAACCCCCGA CGCATATACC
AAGATCTTAA ACCAGTATAT TAGCTACCTG CAGGTCGAAG CCCTGGACGA TCTGCATAAA
GTCGTTGATA AAATCCTGTC CGGGGGCATT GTCATCCTTA TAGATGGCTT TAAAAAGGTT
ATCCAGCTCG ATGCCCGGAA CCACCCGGTC CGCAATCCGC AGGAATCCGA CCTGGAGCGG
GTGGTACGCG GCTCCCGCGA CAATTTTGTC GAAACCCTGT TATTTAACGT CAATCTCCTG
CGTCGCCGGG TGCGGGACCC CAAACTGCGG ACGGAAATCC TTCAGGTGGG TAGTCGTTCC
AAAACTGATG TGGCCGTCGT TTATGTTCAA GATATCGCCA ACCCGCGGCT GGTAGATACA
ATTAAAGAAC GAATCAAGGC CATAAAGATG GACGGCCTGC CCATGGCGGA GAAATCGCTG
GAGGAGTTTA TTAGCCCGGG GAGCTTCTGG AATCCCTTCC CACGGGTACG CTACACCGAG
CGGCCGGATG TAGCTGCCGC CCATCTCTTT GAAGGTCACG TACTTGTCAT GGTGGATACT
TCCCCCAGCG TAATGATCCT GCCAGCGACT ATTTTCCACC ACCTGCAGCA CGCCGAGGAG
TTTCGCCAAG CGCCCCTTAT TGGCACTTTT TTACGCGTGG TACGCTTTAT CGGAGTATTA
CTTTCTTTAT TTCTGCCTCC CGTATGGCTC CTGGCCGCTT TGCAGCACGA TCTACTCCCG
CCGAACCTGG CCTGGATCGG CCCCAAACAG CTGGGATACA TTCCCCTGGT GTGGCAGTTT
CTCTTTGCCG AACTGGGCAT CGATTTAATG CGCCTGGCGG CCATCCATAC CCCCACCTCC
CTGGCTACCG CCCTGGGTTT GATTGCCGCC GTTCTGATTG GCCAGATCGC AGTGGCAGTA
GGTTTCTTTA ACCCTGAGGT TATTCTCTAT ATGGCCATTG CCGCCGTGGG TATCTTCGCC
ACCCCGAGCT ATGAACTGGG GATGGCCAAC ACCCTGGTAC GGATTGCTCT GTTAATCGGC
GTGGGGCTGC TCCGTTTGCC CGGTTTTGTG GCCGTTACCA TGGGTATCTT TTTATTGCTG
CTGACCACAA AGTCCTTCGG CCTGCCCTAC CTGTGGCCCC TGATACCCTT TAACGCCAGC
GCCCTGAAGG ACGTCCTGGT CCGTCCACCG GTTCCTTTAC TGAAACTGCG GTCGAAAGCA
ATGCATCTTC TGGATATGGA CCGGCAACCC TATCCGGCTC CGGCCCGCAA ACCGTTAAAA
CGCAGTCTTA AAGAGAAAAA GGAAGAGGAG AAACGCATTT GGCCCGAACT TGAAAAGAAA
AAGGAAGATG ATGGTAGTGA AGGTTAA
 
Protein sequence
MTISGDIVKV DPNLDVNNNW IMKELGVGQS FDVIRRDITI AGRRATMFYI NGMAREDVLV 
YILTSLSKLK REDVTPDAYT KILNQYISYL QVEALDDLHK VVDKILSGGI VILIDGFKKV
IQLDARNHPV RNPQESDLER VVRGSRDNFV ETLLFNVNLL RRRVRDPKLR TEILQVGSRS
KTDVAVVYVQ DIANPRLVDT IKERIKAIKM DGLPMAEKSL EEFISPGSFW NPFPRVRYTE
RPDVAAAHLF EGHVLVMVDT SPSVMILPAT IFHHLQHAEE FRQAPLIGTF LRVVRFIGVL
LSLFLPPVWL LAALQHDLLP PNLAWIGPKQ LGYIPLVWQF LFAELGIDLM RLAAIHTPTS
LATALGLIAA VLIGQIAVAV GFFNPEVILY MAIAAVGIFA TPSYELGMAN TLVRIALLIG
VGLLRLPGFV AVTMGIFLLL LTTKSFGLPY LWPLIPFNAS ALKDVLVRPP VPLLKLRSKA
MHLLDMDRQP YPAPARKPLK RSLKEKKEEE KRIWPELEKK KEDDGSEG