Gene Moth_2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2374 
Symbol 
ID3832013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2499278 
End bp2500258 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID637830293 
Productsporulation protein and related proteins 
Protein accessionYP_431199 
Protein GI83591190 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG2385] Sporulation protein and related proteins 
TIGRFAM ID[TIGR02669] SpoIID/LytB domain
[TIGR02870] stage II sporulation protein D 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000760464 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCAAGC TCATGGGGAT TTTCATTATC CTGGTATTTG CCGCGGTCAT AATTACGCCG 
GTTGTAATTA TCGAAGGCAT CCGCCTGTTT CAGCCGCCCG TCCAGGTCCA GACCGGCAAA
CAACTGGTAA GGGTCTACTT TCACCAGGCA GGTATCACTA AAATCATGCC CCTGGAGGAA
TATATAGCCG GGGTGGTCGC CGGGGAGATG CCGGCCAACT TCGAGCCTGA GGCCCTGAAG
GCCCAGGCCA TTGCCGCCCG CACCTACACC TTGAAAAAAA TCGAAGAAGC AAAGATCAAG
CCCGATGCCA GCCATCCCAA CGCCGACATC TGTACCGACC CGGCCCACTG CCAGGCCTGG
GCCGGGGATG ATGTCCTGCG CCAGCGTTGG GGCCTGATAG GCTTCTGGCG TTACAAAAAC
AAAATCCAGT CCGCAGTCCA GGCCACCAGC GGTATGGTCC TGACCTACCA GGGACAGCTC
ATTGACCCCG TCTATCATGC CAACGGCGGT GGTCGGACCG AAAGCGCGGC TGCCGTCTGG
GGCCGGGACG TACCCTACCT CCAGAGCGTG CCGTCACCCT GGGATAAAAC GTCACCCCGT
TATAGCGACA GCCGGACCTT CAGCCTCCGG TATCTGGATA GCAAACTGGG CGTCAACCTG
GAGGCCGTAC CGGCGGCAGC CCTGGCCGCG CCCGGGGGCA CAGCTATCAG GGTCCTGGAG
AAAACCCCCA CCGGTCGAGT CAAAACCATC AAAATCGGCG GCAAAACCTT TGCCGCCACC
GATTTACGAA AACTACTGGG ATTATCCTCG ACGGATTTCA CCTGGGAGGT CCAGGGGGAC
CGGATAACCT TTCATACCAT CGGCTACGGC CACGGCGTCG GCATGAGCCA GTACGGAGCC
AACGGTATGG CCCGGGAGGG CAAAAACTTC GCCGAGATTC TGGCTTACTA CTATCGCGGT
ACGAAGATTG AGAACAGATA G
 
Protein sequence
MRKLMGIFII LVFAAVIITP VVIIEGIRLF QPPVQVQTGK QLVRVYFHQA GITKIMPLEE 
YIAGVVAGEM PANFEPEALK AQAIAARTYT LKKIEEAKIK PDASHPNADI CTDPAHCQAW
AGDDVLRQRW GLIGFWRYKN KIQSAVQATS GMVLTYQGQL IDPVYHANGG GRTESAAAVW
GRDVPYLQSV PSPWDKTSPR YSDSRTFSLR YLDSKLGVNL EAVPAAALAA PGGTAIRVLE
KTPTGRVKTI KIGGKTFAAT DLRKLLGLSS TDFTWEVQGD RITFHTIGYG HGVGMSQYGA
NGMAREGKNF AEILAYYYRG TKIENR