Gene Moth_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0953 
Symbol 
ID3832838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp983596 
End bp987159 
Gene Length3564 bp 
Protein Length1187 aa 
Translation table11 
GC content63% 
IMG OID637828883 
Productcondensin subunit Smc 
Protein accessionYP_429812 
Protein GI83589803 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1196] Chromosome segregation ATPases 
TIGRFAM ID[TIGR02168] chromosome segregation protein SMC, common bacterial type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTTA AGGGTATTGA GATTCAGGGC TTTAAGACCT TCGTCGACCG GGTCAGGCTG 
GAGCTTGGCC CCGGGGTTAC CGGTATCGTC GGTCCCAACG GTAGTGGCAA GAGCAATATC
GTTGACGCCA TCCTCTGGGT CCTGGGGGAG CAGAGTGCCA AAAGTTTGCG CGGAACCAGG
ATGGACGATG TCATCTTTGC TGGCAGCGCC CGGCGCCGGC CGGTGGGAAT GGCGGAAGTA
ACCCTGTACC TGGATAACAG CGACGCAAGC CTCCCCCTGG ACTTTCAGGA GGTAGCCATC
ACCCGGCGTT TTTTTCGTTC CGGCGAAGGC GAGTACTATA TAAATAAAGT TCCCTGCCGC
CTGAAGGATA TCCAGGAACT CCTCCTGGAC ACCGGCTCCG GCCGGGGCGG CCTGGCTATT
GTAAGCCAGG GTCGCCTGGA AGAAATCCTC GCCGCACGTC CGGAGGAACG GCGTTCTGTC
CTCGAAGAAA CGGCCGGTAT TGCCCGCTAC CGCCTGCGAA AAAAAGAGGC CCGGCAACGC
CTGGAGGCAG TAGAGCAGGA TTTAACCCGC TTGCAGGATT TAATTGGCGA ACTAAAGGAT
CAACTTGGTC CTGCCGCCGT TGAGGCGGCC AGGGCGCGGC GACACCAGAA ACTGATGGCC
CTTTTGAACC TGGTGGAATT AATCCTCAAG TCCCGGGAAA TGGCGGAAAG CCAGAACCGC
CTCCGGCGCA TCCGGGAGCG CTGGCAGGCC CTGAAGAGCC AGGAGGAGGA ATTGGAGGCC
AGGGAAGAGG AACTCGCAGG GCGCTCTCGG GACATCCGGG AACGCCTGGC CCGGAACCAG
GAAGCCAGGG AGCGTAGCCG TGTCGAACTG CAAGGTCTCC GGGAGCAACT GGTTCAGGTC
CGGGGCCGGT TGAGCCTGGT GGATGAAAAA CTGGCGGCCC TGGCCCGGCA AAGAGTAGAG
GACCGGGAGC GGGAAGGGCT TCTGGCCAGG GAGGAGGAAA AGTTGCGGGC GGCGGCCGCC
GAGCTGGCCC GACAGGTGGA GACAGGTAGG GAAGAGATGG CCGCCCTGGA ACAGGACCTG
GCTGCCGGTC GGGAGACCCG GGAAAAACTC CGGGCCGAAA GGGATGAACT GGCGGCAAGA
TTGGCACGAT TAAAAGAAGA TCTCTTCCAG GTAGCCCACG AGCGCGCCGG GTGTCACAAT
GAACTGGTAC GCCTGGAGGA AAAACAGGCT GGCATGGAGC GGGTGCTGGA GCAGAAGCAA
CGCCAGCTTC AGGAGCTAAA TAATGAACGG GAGCGCCTGG AGGGCCTCCT CCGGGCCGGG
GAGGAGCAAC TGGGGGAGAT TGAGGCGAAC TTAAAAGCCC TGGAGGGGAA GAAAGCAAGC
CTGGAAACCG AACTACCCCT GCAGGAGGCC GACCTGGCTG CCCGTGAGAA GCACCTGGCC
GGTTTAAAGG AGCAGCAAAG GCTGCTTGTG GCCAGGCTGA AGGTTTTACG CCAGGCCCAG
GCCGACTATG AGGGCTTCGG CGAGGGCGTG CGGGCTATTC TCCAGGCCAG GAGCCGGGGA
GAAGCCGCCT GTGCCGGCGT CCTGGGGGTA GTTGTCGAAA AGATAGAGGT GCCCGGGGAA
CTTACCAGGG CCATAGAGGT CGCCCTGGGC GGTGCCGCCC AGCAGGTGCT GGTGAGGACG
GCCTCCGAAG CCGAGAGGGT TATCCAGTTT TTAAAGTCCC GGCGCCACGG CCGGGCGACC
ATTTTACCCC TGGCCTGGCT GGAGCCGCGT CGCTGGCCGA ACTGGGCCGG CTGGGTGCTG
AATGAGCCCG GGGTTGTGGG GGTGGCAGCC GCACTGGTCC GGAGCGAAGC CGAAATCCGC
CCTGCGGTTG ATTACCTCCT CGGCCAAATC CTGGTCGTGG CCGATCTCCG GCGGGCCTTG
GATCTGGGGG AACGCCTGCG GCCGCCGGTA CGCCTGGTGA CCCTGGAAGG GGAGGTCATC
CAGCCCCGGG GACCGGTCAC CGGTGGTAAT ACCAGGCAGA GAGCCGGCTT TCTCCAGCGC
CGCCTGGAGA TCCAGCAGGG TGAGACCGAA CTGGCCAATC TGGTCGCCAG GCTTAATGAC
GCTCGGCAGC AGGCCAGAAA ACTGGCCAGC ACCCTGGAAA CCGGCCGGCA GGAACTGCGC
CGGGTAACAG AGGCTTTGAT TGCCCGCCGG GGGGAACTTC ACAATTTATT ACAGCGCCTG
AGCGAGTATA AAGATCAGCT GGCGCGCCTG GCGGAAAAGA CGGCGGTACT GGGAGAAGAA
CTGGCCCGGA GTACCACCGA TTCTCGAGAA CTGGTAACTA GCCGGAGGGA GAGGGAGGAA
CTCCTGACCC GCCTGGAAGC CAGGGAAGGA GAGCTCCAGG GAGAATTAAC GGGCTGCCAG
GAGCAGCTAA ACGCCTGCCA GCAGGCCCTG GCTGCCGTGG AGCAGGAACT GGCAGTAAAC
GAAACCCGGC AGCAGGCCCT GGCGAAGGCC GGGGAGCAGC TGGCCGCGCG CGTAGAGGAA
TTGGCCCGGC AGAAGGAAAA CTGGCGGCGA CAACAGGCTG AACTGGCTGC CAGGATGACA
GCGGCCGCAA CCGCGACCAA TGAACTCCAG GAAAATAGGG AAAAGCTGGC CAGGGAAGAG
GAGTGGCTGG CAGGGGCTAT TCAACAGGCT GAAGAAGGGC TACAACGCCT GGATAATGAC
GGTTCGGCCT GCAGCCAGCA ACAGGAAGAG CTGGCCCGGG AACTGGAGGA ACTTCGGGCC
CGGAAAGGAA AAATCGCTGC CCACAGGCAG CAGGAAGAAC TCAATCTGGC CCGGCTGGAA
ACGACCCTGG AAGGTAGCCG GGCCGAACTG GAAGAGCGGT TTGGCCCCGG CTGGCAGGAA
GTGCTCCAAA AACCCCGGCG CCACCTGGAA AAAGAGGCTC CCCGCCTGCG CAGGGTTTTG
CAGGAGAAAC TGGCAGCCCT GGGGGAGGTC AATCCAGGGG CTCCCCGGGT TTACGAGGGC
CTAAGGAGGC GTTTTGAGGA ACTGGAGCAG CAGCGACAGG ACCTGGAAGA AGGGCGGGCG
GCCCTGGAAC AGGTAATCGC TGAAATGGAA AAGCTAATGG CCCGCCAGCT CCGGGCCACC
CTGACCGCCG TCCAGGAACA CTTTGCCGCC CTCTTCAGGG AACTCTTCGA GGGCGGCGAG
GCCAGCCTGG AGCTCACCGG GAGCGATAAC ATTTTAGAAG CAGGCCTGGA GATCATCGCC
CGGCCCCCGG GCAAGAAACC CCAGCACCTG GCCCTCCTCT CCGGCGGCGA GAAGGCCCTG
ACGGCGGTGG CCTTTATCTT TGCCCTCCTT AAGGTCAAGC CCAGCGCCTT CTGTATCTTT
GACGAGGTGG ATACGGCCCT GGATGAGGCC AATGTGGAAC GCTTCGCCCG GCTGTTGCGC
CAGTTTGCCA GCCGGACCCA GTTCATCGTC ATCTCCCACC GCCAGGGTAC CATGGCCGCC
GCCGACGTCC TTTACGGGGT GACCATGATG GAACAGGGTG TCTCCCGCCT GGTCTCGGTG
CGGCTGGAAC AACTGCCGGC TTAA
 
Protein sequence
MFLKGIEIQG FKTFVDRVRL ELGPGVTGIV GPNGSGKSNI VDAILWVLGE QSAKSLRGTR 
MDDVIFAGSA RRRPVGMAEV TLYLDNSDAS LPLDFQEVAI TRRFFRSGEG EYYINKVPCR
LKDIQELLLD TGSGRGGLAI VSQGRLEEIL AARPEERRSV LEETAGIARY RLRKKEARQR
LEAVEQDLTR LQDLIGELKD QLGPAAVEAA RARRHQKLMA LLNLVELILK SREMAESQNR
LRRIRERWQA LKSQEEELEA REEELAGRSR DIRERLARNQ EARERSRVEL QGLREQLVQV
RGRLSLVDEK LAALARQRVE DREREGLLAR EEEKLRAAAA ELARQVETGR EEMAALEQDL
AAGRETREKL RAERDELAAR LARLKEDLFQ VAHERAGCHN ELVRLEEKQA GMERVLEQKQ
RQLQELNNER ERLEGLLRAG EEQLGEIEAN LKALEGKKAS LETELPLQEA DLAAREKHLA
GLKEQQRLLV ARLKVLRQAQ ADYEGFGEGV RAILQARSRG EAACAGVLGV VVEKIEVPGE
LTRAIEVALG GAAQQVLVRT ASEAERVIQF LKSRRHGRAT ILPLAWLEPR RWPNWAGWVL
NEPGVVGVAA ALVRSEAEIR PAVDYLLGQI LVVADLRRAL DLGERLRPPV RLVTLEGEVI
QPRGPVTGGN TRQRAGFLQR RLEIQQGETE LANLVARLND ARQQARKLAS TLETGRQELR
RVTEALIARR GELHNLLQRL SEYKDQLARL AEKTAVLGEE LARSTTDSRE LVTSRREREE
LLTRLEAREG ELQGELTGCQ EQLNACQQAL AAVEQELAVN ETRQQALAKA GEQLAARVEE
LARQKENWRR QQAELAARMT AAATATNELQ ENREKLAREE EWLAGAIQQA EEGLQRLDND
GSACSQQQEE LARELEELRA RKGKIAAHRQ QEELNLARLE TTLEGSRAEL EERFGPGWQE
VLQKPRRHLE KEAPRLRRVL QEKLAALGEV NPGAPRVYEG LRRRFEELEQ QRQDLEEGRA
ALEQVIAEME KLMARQLRAT LTAVQEHFAA LFRELFEGGE ASLELTGSDN ILEAGLEIIA
RPPGKKPQHL ALLSGGEKAL TAVAFIFALL KVKPSAFCIF DEVDTALDEA NVERFARLLR
QFASRTQFIV ISHRQGTMAA ADVLYGVTMM EQGVSRLVSV RLEQLPA