Gene Moth_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1014 
Symbol 
ID3833317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1042638 
End bp1044239 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content49% 
IMG OID637828942 
ProductGerA spore germination protein 
Protein accessionYP_429871 
Protein GI83589862 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000120979 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCGAA AATCCTGGTT TTATTTACGC AGAAAAAAGA AGGCCAGACC TAGCCCGGGG 
AAGGATCAAT TGGCGTCTCC CCTGGTAGAT GAAGGTAAAG CTATCGCTCT TCCTTTCCAG
CGTTCTTTAG AAGCCAACTT AAAAATCTTA CAAGAAATTT TTAGCGATTG CCAGGATGTC
GTCTACCGCA GGATGAAAAT CGGTGGTACA GGTGGATTGG CAGCTCTATT AGTTTATGTA
GATGGCCTGG TAGATAACGA GATTATCGAT ACCCAAATTG TCAAGTCTCT CTTGCTTGAA
GCCCACCAGG CAACCCCTGC CTTCACTCCA GATGGGAAGT TCTTCTACCA GGTGCGTGAT
TCCTTGTTAA CAGTTGCCAA TGTCCGGGAG ATAGCCAATT TCCGTGAAGC TGCCTTTTGT
ATTTTAAGTG GCTTTGCTCT CTTGATCCTG GAAGGCAGCA ATAAAGGTTT GGCCCTGGAG
GTTTGCGGTT GGGAACACCG TGGGGTTGAA GAACCTCAAA ACGAGGCGGT GATCCGTGGC
CCCCGGGATG GTTTTTCCGA GAGCTTCCGT ACTAATACGG CGCTTTTACG GCGGCGCCTC
CGCGATCCCC AGCTCAAGCT AAAGACCTAT TTTCTAGGGC GGCGCAGCCA GACTGCAGTG
GGACTATTGT ACTTGAAAGG TGTAGTTGAT CCTGAGTTGG TGGCGGAAGT AGAGGGACGA
CTTAAAAATA TTGATATTGA CGGCATCCTG GATAGCGGTT TTATCGAACA GTTAATTGAG
GACTGCTGGT ACTCTCCTTT TGCTCAAATC CAGAGTACAG AAAGACCTGA CGAGGCGGCT
GCCGCCCTCC TTGAAGGCAG GGTCATAGTC TTAACAGATA ATACCCCTTT TGCCCTGCTC
ATTCCCGCTA CCTTTAATTC CCTGATGCAC AGCCCGGAGG ACTTTTATCA TCGCTGGTTG
ATTTCATTCC TTATCCGGAG CTTGCGTTTT TTGGGAACTG TCCTGGCGCT CATTCTCCCC
TCCCTATATA TTGCCATGGT TTCCTTTTCT CCGGAAATGA TACCCACTTC CCTGGCCATT
TCTATAGGTG CCGGGCGGGA AGGGTTGCCT TTTCCCTCAA TAGTAGAAGC TTTAATAATG
GAGGTTGTCC TGGAGCTTCT CCGAGAAGCA GGGATCCGTT TACCCGGTCC TTTAGGCCAG
ACCCTGGGCG TAGTGGGAGC CCTGATCCTC GGCCAGGCGG CAGTTCAGGC CAACATCGTC
AGTTCTGCCA TGGTCATCGT CGTAGCCTTG ACAGCTATAA GTGGCTTTGT TGCCCCTAGA
TTTGATGCCG CCATAGCTTT ACGGATTCTC CGTTTCTTTC TTATGGTGAT GGCGGCATTT
TTAGGACTCT ACGGCATCAT ACTTGGGCTG ATGTTAATCC TGGCCCACCT GGCTTCCCTA
AAAAGTTTTG GTGTGCCCTA CCTTCAACCA TGGGCACCGT TACGGGTACC TGATCTTAAA
GACAGTGTTT ACCGTGCCGC TTTTTTTAAA CAACGCTGGC GCCCTTTCTA CCTTAAACCA
CGTGAAGTCA AAAGAATTGG TGACAGGGTG AAGAAAAATT GA
 
Protein sequence
MLRKSWFYLR RKKKARPSPG KDQLASPLVD EGKAIALPFQ RSLEANLKIL QEIFSDCQDV 
VYRRMKIGGT GGLAALLVYV DGLVDNEIID TQIVKSLLLE AHQATPAFTP DGKFFYQVRD
SLLTVANVRE IANFREAAFC ILSGFALLIL EGSNKGLALE VCGWEHRGVE EPQNEAVIRG
PRDGFSESFR TNTALLRRRL RDPQLKLKTY FLGRRSQTAV GLLYLKGVVD PELVAEVEGR
LKNIDIDGIL DSGFIEQLIE DCWYSPFAQI QSTERPDEAA AALLEGRVIV LTDNTPFALL
IPATFNSLMH SPEDFYHRWL ISFLIRSLRF LGTVLALILP SLYIAMVSFS PEMIPTSLAI
SIGAGREGLP FPSIVEALIM EVVLELLREA GIRLPGPLGQ TLGVVGALIL GQAAVQANIV
SSAMVIVVAL TAISGFVAPR FDAAIALRIL RFFLMVMAAF LGLYGIILGL MLILAHLASL
KSFGVPYLQP WAPLRVPDLK DSVYRAAFFK QRWRPFYLKP REVKRIGDRV KKN