Gene Moth_2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2506 
Symbol 
ID3832778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2612099 
End bp2613085 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content52% 
IMG OID637830429 
Productspore germination protein-like 
Protein accessionYP_431331 
Protein GI83591322 
COG category[R] General function prediction only 
COG ID[COG5401] Spore germination protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0523043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACTTT TCACAGTCCT GCTGTTAAGC CTGGTTTTAG TTATTAATGG TTGTGGTAGG 
AAAGCTCAAG CTCCTGCAGC AATTAATCCT GGTGCCAGCG AAGGTCCCAA GGAAATCAGA
CCGTTGCTCT ATGACCGGGA GCTGGACCGG GTCATCGTCT ATTACCTGAC CGGAGATGGT
CGCTACCTGG TACCGGTGAC CGTTAATTTT AATCCGACCA GGGAAGTTGC CAAGATAGCA
GTAGAGAAGT TATTGGCCGG CCCCCAGGGT GACGGGCTGA AACCGGTCTT TCCTGAGGGT
GTCAAGCTTC AAGATATCTA TCTTTTAAAC AACCAGCAAA CTGTCTATGT AAATTTGACC
AGGGAATTTC TGGATATCAA AGATGCCAGG CAGGCGGACC TAGCCGTTAA AGCCCTGGTT
CTAACGATGA CGAACCTCAC CAACGTCAAG GAAGTTCAGA TCCTGGTAGA GGGCAATAAA
GTACCTGAGG TAGCAGGGGT AAAATTGGAT GCCCCTCTAC ACCGCCCTGA CAGTGTTAAT
AGCCTGCTGA AGGACGGCAA TCAAAAGGGG GTTCAGGTGT TTTTCAACGA TGCTGATGCC
CGCTTTTTCG TCCCGGTAAC AGTTGCTATG CCGCCGGGGT CCAGTGCAGA TAATCTACCC
CGGGCGGCAG TTCTGGCCCT CTTGGCCGGT CCTCCTGCAG ATAGCGGTCT TATTCGGACT
ATCTGGCCCG GGACCAGGCT CCTGGACTTT AAGGTTGAGG GAGGCCTGGC CACGGTTAAT
TTCAGCCGCC AGGTCACCGG TTACGGCGGA GGCAGTGCCG CCGAGACCGC CCTGCTAAAA
TCCCTCCTCT TTACCCTGAC CCAGTTCCCA GATATTGACC GGGTACAGAT TCTTATTGAC
GGGAAGAAGA AGGAGTATTT ACCTGAAGGT ACGGCTATCG ATAAACCCCT GTCCAGGCCG
GAACTCCTCA ATCCCCTTAA TCACTAA
 
Protein sequence
MVLFTVLLLS LVLVINGCGR KAQAPAAINP GASEGPKEIR PLLYDRELDR VIVYYLTGDG 
RYLVPVTVNF NPTREVAKIA VEKLLAGPQG DGLKPVFPEG VKLQDIYLLN NQQTVYVNLT
REFLDIKDAR QADLAVKALV LTMTNLTNVK EVQILVEGNK VPEVAGVKLD APLHRPDSVN
SLLKDGNQKG VQVFFNDADA RFFVPVTVAM PPGSSADNLP RAAVLALLAG PPADSGLIRT
IWPGTRLLDF KVEGGLATVN FSRQVTGYGG GSAAETALLK SLLFTLTQFP DIDRVQILID
GKKKEYLPEG TAIDKPLSRP ELLNPLNH