Gene Moth_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1696 
Symbol 
ID3833296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1733727 
End bp1735142 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content61% 
IMG OID637829621 
Productsporulation protein and related proteins-like 
Protein accessionYP_430541 
Protein GI83590532 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG2385] Sporulation protein and related proteins 
TIGRFAM ID[TIGR02669] SpoIID/LytB domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.600188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGAAAA CTTACCGCCG GGGGATGGCT GCCCTGCTCC TGGCCTTAAT ACTAATCCCT 
GCTGCCTCCG GGGAGGCGGC TTCCCGGCCC ATCCGCGTCC TGCTGGACAG TAGCCCCGGC
GAGGTGGAAT TCCAGGTGGA GCAGGGCGGT TACCAGCTGG TTGACGATCA CAGCGGCCAG
GAGATCGCCA CGGCCACTTC CGGGGTTAAG TGGACGGTCA GGCAGGACGG CAGCACCCTG
CAGCTTTTAA AAGATGGTGC CCCTGTAGGC AGCTTCAATG GCCCCATTCA GCTAAAACCT
GCCAGGGCAG GTCTCAACCT CTTCAGTTAC CGGGGCAACC GCTACCGGGG GAGCCTGAGT
ATCCTGCGGG GCGAGGGCGG TTTGCTGGTT ATCAACATCG TCGACCTGGA ACAATACCTT
TACGGCGTCG TTGGTAAAGA AATGCCGGCC AGCGCGGCCC TGGAAGCCCT TAAGGCCCAG
GCTGTAATCG CCCGTACCTA TGCTATCACC AGGATGCAAC CGTCCCAGCT CTACGACGTC
ACCGACGATA CCTCGACCCA GGTATACGGC GGTTATGAGG CCGAGGTCAA TTACGGCGCC
GCCAGGGATA AAGTTCTGCA GGCGGTAGAC AGCACCCGGG GAGAGGTGAT CTATTATGAC
GGCAAGGTCA TCCAGGCCTA CTTCCACGCC AACGCCGGCG GCTACACCGA GGATAGTGAG
AACGTCTGGA GCAATCCCCT GCCCTACCTG CGGGGCGTGC CCTCGCCCGA TGACGACTGG
GCCGTCAAGT ATCCCTACCA GACTCCTGGC GGTTACCCGG CCAATACATA TAACTGGACG
GTGACCCTGA CCAGGCAGCA GGTCCAGGAC CAGGTTAATA GCTGGCTTGC CGGTCAGGGT
AAAGGCGCGG TCGGGGAGGT GGTCGACCTG GTCCTTTCGC GGCTGGGGCG TGACGGCCAA
AAGGAGACGG TATCCGGCCG GGTAACCAGG ATGGATATCC GCACCACCAC CGGAACGGCC
CAGGCTTTCC GGGACGGCAT TCGCGCCGTC TTTGGCCTGA AAAGTACCCT CTTCACGGTG
CAGATGGACT CCACGGTGAA TGTCCTGGAC GGTTCCGGGC AGCAACGGGC GGTGAATTAC
GGCGCCGAAC TGGTAGCCCT GGGAGCCGGC GGCGTCCTCA ACGCCCCTAA CGGTGCCGCC
GGAGATTATA CGGTAGCCGG GCGCGACGGC ACACGCCAGG TACCCAAGCT CTTCACCCGG
GTAATCTTCC AGGGGAAGGG ATACGGCCAC GGCCTGGGCC TCAGCCAGTG GGGGGCCATG
GGCATGGCCG AAAAAGGGTA TACTTACCAG CAAATTATCG AACACTACTA CAACCAGGAT
CATTATGACG GCCACCTGAA GATTGCGACC TATTGA
 
Protein sequence
MRKTYRRGMA ALLLALILIP AASGEAASRP IRVLLDSSPG EVEFQVEQGG YQLVDDHSGQ 
EIATATSGVK WTVRQDGSTL QLLKDGAPVG SFNGPIQLKP ARAGLNLFSY RGNRYRGSLS
ILRGEGGLLV INIVDLEQYL YGVVGKEMPA SAALEALKAQ AVIARTYAIT RMQPSQLYDV
TDDTSTQVYG GYEAEVNYGA ARDKVLQAVD STRGEVIYYD GKVIQAYFHA NAGGYTEDSE
NVWSNPLPYL RGVPSPDDDW AVKYPYQTPG GYPANTYNWT VTLTRQQVQD QVNSWLAGQG
KGAVGEVVDL VLSRLGRDGQ KETVSGRVTR MDIRTTTGTA QAFRDGIRAV FGLKSTLFTV
QMDSTVNVLD GSGQQRAVNY GAELVALGAG GVLNAPNGAA GDYTVAGRDG TRQVPKLFTR
VIFQGKGYGH GLGLSQWGAM GMAEKGYTYQ QIIEHYYNQD HYDGHLKIAT Y