Gene Moth_0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0837 
Symbol 
ID3831534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp867387 
End bp869522 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content56% 
IMG OID637828767 
Productstage V sporulation protein D 
Protein accessionYP_429697 
Protein GI83589688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0768] Cell division protein FtsI/penicillin-binding protein 2 
TIGRFAM ID[TIGR02214] stage V sporulation protein D
[TIGR03423] penicillin-binding protein 2 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.677302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGGA ATATCAATGT TCATAAACGC CTTGTCCTTG TTTTTTTGGT TTTGACATCA 
TGCCTGGCGA TAATCTTGTT GAGGCTGGCC TGGTTACAAT TCGTGCGGGG CGGCGAATTA
CAACAAAAAG CCCTCAATAA CCGGCTTAAT GAAATGCCGG TGGCAGCCCA GCGCGGGGTT
ATCTATGACC GCAACCGGCA CGAACTGGCC GTTAGCATCG AGAGTGACTA TATCGGCGCC
TTTCCGCCGG AGATCAAAGA TTCCGGTAAA GCCCAGGAAA TAGCCAACAC CCTGGGTTCC
ATCCTTGGCC TTCCCGCGCA AAAGATCCTG GAGAAAATAA CTAGCAACAC CGGCTTTGAG
TTTGTCGAAC GCCAGGTTGA CTATACAAAA GCCCAGCAAA TCAAGAATCT GGTTAAGGAA
AAGAAATTAC CGGGGATCGA AGTGGTACCG GAGAGTCGGC GGTATTACCC CAATGGCCCC
CTGGCTGCTC AGGTCCTGGG TTTTTCCGGT ATTGATAACC AGGGCCTGGA AGGCATTGAG
GCCATGTATG ATAAGGAACT GGCCGGGGAG CCGGGTAAAA TCGTCATGGA GTTTGATGCC
CTGGGACACG AGATCCCCCG GGCGACCCAC CGTTATATAC CACCGCAGAA CGGCCACAGC
CTGGTGCTGA CCATTGATCA AACCATCCAG TATATTGCTG AACGTGAACT GGATAAGCTC
ATGGCCAGCC CTACCAATCC CAAAAAAGCC AATATTATTG TCATGGACCC CAAAACAGGG
GAAATTCTGG CTATGGCCAG TCGTCCGGCC TTTGATCCGA ATAACTTTAA TAAATACCCC
AACAGCGTCT GGGGCAACCC CCTGGTGCGG GATGCCTACG AGCCCGGTTC GGCCTTTAAG
ATCATCACCG CGGCGGCGGC CCTGGAAGAA GGGGTAGTCA AGCCGGGAGA CCGTTTTTAT
GATCCCGGTT ACATTAAGGT TGGGCCGGAC ACCATTAACT GCTGGTTGCC CGGGGGGCAC
GGCAGCGAGA CCTTTGTGGA TGGGGTCAAG AATTCTTGTA ACCCGGTCTT TGTCCAAACG
GCCCTGCGGT TGGAAGAAAA GAAAGCCGGT CTTTTTTATA ATTACATCAG GGCCTTTGGT
TTCGGGCAGC CTACGGGTAT CGATTTAACA GGTGAAGGTA CCGGTTTACT GATTCCCGAG
CAGGAATTAA AGCCCATCAA CATCGCCACC ATTGGCATCG GGCAGGGTAT CGCGGTTACT
CCATTGCAAC TGGTAACTGC AGTTTCAGCG GTGGCCAACG GGGGCAAACT CATCCGGCCC
CACCTGGTGA AGGAAATCCT GGATGATAAA GGCAATGTCC TGAAAAAGAT TGAACCAGAG
GTGGAACGGC AGGTCATCTC TCCCGAGACG GCCAAGCAGC TGCGGGAGAT GTTAGAAACC
GTTGTCAGCG AAGGTACAGG GCGTAACGCC TATATCCCCG GCTACCGGGT AGGCGGGAAA
ACGGGTACGG CCCAGAAGGC CGGTCCCGGC GGGTACATGG AAGGAAAATA TGTTGCTTCA
TTTATCGGTA TGGCCCCGGT CAACGACCCC CGCCTGGTGG CCCTGGTAAC AATCGACGAG
CCAAAAGGAT ATCCTTATTA TGGTGGTACC CTGGCGGCTC CCATTTTCCA GCGGGTGGTA
GCCGATGCCC TCCACTACCT GAAGGTACCG CCCCAGTACG ACAGCAAGCA GGGAAGTGAG
GAAAAACCGG CGGCCCCGGT CACCGTACCG AATGTTACCG GAAAGGACCT GGCTGCGGCC
AGGGCGGAAC TGGAAAAAGC CGGCCTGGCA GCGCGGGTCG AGGGTGACAG CGGCAAAGTC
ATTGCCCAGG TGCCCAAGGC AGGCGCCGTA GTACCTGCGG GTACAAAGGT AATCCTCTAC
CTCCAGGGCG ATCCCAACCG GCCCCGGGTG CCGGACGTCA ATAGCCTGAG GGTTACCGAG
GCGGCAGAAG TACTGGCAGC CTATGGGCTG CAACTGGTTC CCGAGGGAAC GGGTCAGGCC
GGGGAACAAA ACCCCATTCC CGGGACCATA GTCACGCCGG GAAGCAGCGT GCGCGTCAAA
TTCCACGAAC CGGCCCAGGA GGTTCTGGGG CCTTAG
 
Protein sequence
MQGNINVHKR LVLVFLVLTS CLAIILLRLA WLQFVRGGEL QQKALNNRLN EMPVAAQRGV 
IYDRNRHELA VSIESDYIGA FPPEIKDSGK AQEIANTLGS ILGLPAQKIL EKITSNTGFE
FVERQVDYTK AQQIKNLVKE KKLPGIEVVP ESRRYYPNGP LAAQVLGFSG IDNQGLEGIE
AMYDKELAGE PGKIVMEFDA LGHEIPRATH RYIPPQNGHS LVLTIDQTIQ YIAERELDKL
MASPTNPKKA NIIVMDPKTG EILAMASRPA FDPNNFNKYP NSVWGNPLVR DAYEPGSAFK
IITAAAALEE GVVKPGDRFY DPGYIKVGPD TINCWLPGGH GSETFVDGVK NSCNPVFVQT
ALRLEEKKAG LFYNYIRAFG FGQPTGIDLT GEGTGLLIPE QELKPINIAT IGIGQGIAVT
PLQLVTAVSA VANGGKLIRP HLVKEILDDK GNVLKKIEPE VERQVISPET AKQLREMLET
VVSEGTGRNA YIPGYRVGGK TGTAQKAGPG GYMEGKYVAS FIGMAPVNDP RLVALVTIDE
PKGYPYYGGT LAAPIFQRVV ADALHYLKVP PQYDSKQGSE EKPAAPVTVP NVTGKDLAAA
RAELEKAGLA ARVEGDSGKV IAQVPKAGAV VPAGTKVILY LQGDPNRPRV PDVNSLRVTE
AAEVLAAYGL QLVPEGTGQA GEQNPIPGTI VTPGSSVRVK FHEPAQEVLG P