Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2042 |
Symbol | |
ID | 3831188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2131302 |
End bp | 2132984 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829971 |
Product | GerA spore germination protein |
Protein accession | YP_430881 |
Protein GI | 83590872 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.480991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCGGTC CTTACCGTAG ACCCCTGCGG GCGGCCCTTT TCCGCCGCCG GCAAAAGCCG ACCCAACAGG GCAAGCAAGA TAACAATAAA GATAAACAGG GCCAGCCTGG CAGCGGGAAT AAACACACGG CGCCTTACCC GCCCCTGGAA GAGTACCGGC TTACGGCCGA GCTCAAGCTC AACCTGGAGC AGCTACAAAA TATATTCGCC AAGTGCAGCG ATGTGGTCCG TCGCGATTTT TACCTGGGCC AGGCCGTCAA GGTAGCCCTT TTTTATATCG ACGGCCTGGC TGATAAAAAG CTGATCGAGG ACACTCTGGT GCGGACTCTG CAGCAGGCAC CTGCGGAACT GGCCCGCTCC CTGGCGGAAA GAACCCGGGA CCTGACGGCG GTTATGGAGA TGGTCATGCC TATGACGGAC GTGGCCGTGG TTGACAACAT GGGGAGTTTG GTTAAGCACG TCCTCAACGG TGATACAGTC TTGATCCTAG ACGGGTTTAA CAAGGGCCTG GCCTTGAGCA CCAGGGCCTG GGAGCACCGT GCCGTCGAAG AACCCATTAC CGAAAGCGTA GTTCGTGGCC CCAGGGAGAG TTTTATCGAG AGCCTGCGCA CCAACACCTC CCTTATCCGC CGGCGCCTGA AAACCCCGTC CCTGAAGATG GAGACTGTCT GCCTGGGTCG CTACACCATG ACCAACATTG TCATTACTTA CATAGAGGAC CTGGCTGACC CGGCGGTGGT GCGGGAGGTG CGCCGGCGCC TGGAGCGCAT TCAAATTGAC GGCGTCCTGG AGAGCGCCTA TATTGAGGAA CTGATCGAAG ACAACCCGGA TTCCCCTTTT CCCCAGATCG ACCACACGGA GCGACCGGAC AAAGTGGCTG CCGCCCTGTT GGAGGGGCGG GTGGGCATCC TTATTGACGG CACGCCCTTC GCCCTCATTG TGCCCACCGT TTTTGTCCAG TTCCTTCAGG CCAGCGAGGA TTACTATGAA CGCTATTATC TTTCCAGTTT CTTACGTTTA GCCCGATTCG TAGCCCTGAA TATCGCCCTC TTACTGCCGG CTGTGTACGT GGCCGTCACT ACCTTTCACC AGGAGATGCT GCCTACTAAC CTGCTGGTGA AGGTGGCTTC CCAGCGAGAA GGGATTCCCT TCCCTGCCTT GGTGGAGGCC CTGTTGATAG AGCTGACCTT TGAACTCCTG CGGGAGGCCG GCGTGCGCCT GCCGCGTCCG GTGGGCCAGG CGGTGAGCAT TGTCGGCGCC TTGGTCATCG GCGAGGCGGC AGTCAGCGCT AGCCTGGTTT CCCCGGCCAT GGTCATTGTC GTGGCTATGA CGGGGATCAG CTCCTTTGCC ACACCCTCTT ACTCCATGGC CATTACTTTG CGCCTGCTCC GTTTCATCAT GCTTATCCTG GCGGGAACCC TTGGCTTTTA CGGTATCATG CTCGGCCTGC TGGCCATCCT TGTACACCTC AATACCTTGC GCTCCTTCGG CGTACCCTAC CTGGCGCCGG TGGCACCCTG GAATTTTAGT GACTTTAAAG ATGTCGTGGT CCGGGTGCCC CGCTGGGCCA TGAACTCCCG GCCCAGCCAA ATCGGCTACC GCGACCCCGT CCGCCAGGGG GCAGGTCTGA AGCCTACGCC GCCGGCCAGT TCTAATCCTG CGCCTGAAAG GGGCGAGTCT TAA
|
Protein sequence | MPGPYRRPLR AALFRRRQKP TQQGKQDNNK DKQGQPGSGN KHTAPYPPLE EYRLTAELKL NLEQLQNIFA KCSDVVRRDF YLGQAVKVAL FYIDGLADKK LIEDTLVRTL QQAPAELARS LAERTRDLTA VMEMVMPMTD VAVVDNMGSL VKHVLNGDTV LILDGFNKGL ALSTRAWEHR AVEEPITESV VRGPRESFIE SLRTNTSLIR RRLKTPSLKM ETVCLGRYTM TNIVITYIED LADPAVVREV RRRLERIQID GVLESAYIEE LIEDNPDSPF PQIDHTERPD KVAAALLEGR VGILIDGTPF ALIVPTVFVQ FLQASEDYYE RYYLSSFLRL ARFVALNIAL LLPAVYVAVT TFHQEMLPTN LLVKVASQRE GIPFPALVEA LLIELTFELL REAGVRLPRP VGQAVSIVGA LVIGEAAVSA SLVSPAMVIV VAMTGISSFA TPSYSMAITL RLLRFIMLIL AGTLGFYGIM LGLLAILVHL NTLRSFGVPY LAPVAPWNFS DFKDVVVRVP RWAMNSRPSQ IGYRDPVRQG AGLKPTPPAS SNPAPERGES
|
| |