Gene Moth_1326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1326 
Symbol 
ID3831036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1370971 
End bp1372263 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content59% 
IMG OID637829262 
Productstage II sporulation protein P 
Protein accessionYP_430182 
Protein GI83590173 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0674472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAG CCAGCCGGTC TTGGCGAATC CGGGGTAGCC TGGGCCTGCT GGTCCTGGCC 
CTGGTACTGG TAGGAGTCGT GTATACCCGG AGCCAGCAGC AGGGGACGGC TGTCCGTGTC
TTCAGCCTGG CCGAGTTATT GCCGGGGAAC CACACCACCG GGCAATATTC AATCTTGGTT
GATGAACAGG GCCGGGTCCT GGATATGATG GCCCGCCGGA TATATATCCA CGATGAATTT
ATCTCAGCCG ATAACCGTCG TTACCGGGTA ATTCGCATTG AAGGTAATAA AGGCATCTGC
CGGGAAATAG GGGTGGAGCA GATTTCCCAG GAGGATACCG GGGTTCCCGC CCAGGCAGGC
CAAACCATGC CCGGCGATGG AACTAATCCG GTCCAGGCTG CCGGTAGCCA GATAATTGGC
GTATATCACT CCCATGATGA TGAATCCTAC GTTCCCTCCG ACGGTAGCCA GAGTATCCCC
GGGAACGGTG GCGTTTTAAG GGTAGGGAGC GCTTTTGCTG ATCGTTTACG CAGCCTGGGC
TTGACGGTCG TTCACGACAC GACCTCCCAT GCCCCCCATG ATGACGGCGC CTACCGGCGC
TCCCGCCGGA CGGCCATGTC CTTAATGCAG AGGGGAGCGG CAGCCTTATT TGATATCCAC
CGGGACGGTG TACCGGATCC AACCTTTTAC CGCCGGACCA TCAACGGCCA GGATGTAACT
ATGGTCCGCC TGGTGGTGGG ACGGGAAAAC CAAAACATGA GCGCCAACCT GGACTATGCC
AAAAGACTAA AGGCAGCAGC CGATGCCCGT TATCCCGGCT TAATCTGGGG GATATTCATC
GGCGCTGGCA GCTATAACCA GGACCTCTCG CCAAGGGCGA TATTGCTGGA AGCAGGTAGT
CATACCAATA CGCTGCAGGA AGCAGAACGG GGCGTCACCC TTTTTGCCGA CGTCGTGCCG
CCGGTCCTGG GGTTTGCCGC CCGGCCCGCT GCTGCCCGTA CGCCCAGTAC GGCCGCCGAC
TGGAGGGGAG TCCTCTACGT CCTCCTGGCC TTTGTAATCG GTGGCGGTGC CTTTCTCCTG
ATTTCCGCCG GTAGCTGGGA GAAGGCCGTT GCCCGGGTGA AGCAGTTTAC CTCTATAGAA
TGGGTTAACC TGCTGGGCTG GCGGCAGCTG CGCAAACCCG GAGTTGACCG GAACAAAATA
ACAGGCCGGG AAAGGGAGGC GGTCGAACTG GCGCCAGTCC CCCCTCGGGA ATTGGAAGCC
AATGACGAGC GGGCCGACTG GCAAAAGGAC TGA
 
Protein sequence
MGKASRSWRI RGSLGLLVLA LVLVGVVYTR SQQQGTAVRV FSLAELLPGN HTTGQYSILV 
DEQGRVLDMM ARRIYIHDEF ISADNRRYRV IRIEGNKGIC REIGVEQISQ EDTGVPAQAG
QTMPGDGTNP VQAAGSQIIG VYHSHDDESY VPSDGSQSIP GNGGVLRVGS AFADRLRSLG
LTVVHDTTSH APHDDGAYRR SRRTAMSLMQ RGAAALFDIH RDGVPDPTFY RRTINGQDVT
MVRLVVGREN QNMSANLDYA KRLKAAADAR YPGLIWGIFI GAGSYNQDLS PRAILLEAGS
HTNTLQEAER GVTLFADVVP PVLGFAARPA AARTPSTAAD WRGVLYVLLA FVIGGGAFLL
ISAGSWEKAV ARVKQFTSIE WVNLLGWRQL RKPGVDRNKI TGREREAVEL APVPPRELEA
NDERADWQKD