Gene Moth_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0251 
Symbol 
ID3833214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp252738 
End bp254729 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content62% 
IMG OID637828187 
Productexcinuclease ABC subunit B 
Protein accessionYP_429129 
Protein GI83589120 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCT TTATTTTAAA ATCCGACTAC CAGCCCCGGG GCGACCAGCC CCGGGCCATA 
GCCGCCCTGG TGGAGGGGCT AAAAAAAGGT TACCGGCACC AGACCTTACT CGGAGCCACC
GGTACCGGCA AGACCTATAC CATGGCCCAG GTCATTCAGG CCGTACAGCG GCCGACCCTG
GTCCTGGCTC CCAACAAGAC CCTGGCGGCC CAGCTCTGCG GCGAATTTAA GGAGTTTTTC
CCCGACAACG CCGTGGAGTA CTTCGTAAGC TACTACGACT ACTACCAACC GGAGGCCTAT
GTGCCCCAGA CAGATACCTA TATCGAAAAG GACAGCTCCA TCAACGACGA GATCGACAAG
CTGCGCCACT CGGCCACTGC CGCCCTTTTT GAACGGCGGG ATGTGATCAT CGTGGCCAGC
GTCTCCTGTA TCTACGGCCT GGGCTCGCCG GAGGACTACA GCACCCTGAT GCTCTCCCTG
CGGGAGGGCC AGGAGTATGA CCGGGACGCC ATTTTACGCA AACTGGTGGA CATCCAGTAC
AGCCGCAATG ACTACGACTT CAAGCGGGGC ACCTTCCGCG TCCGCGGCGA CGTTATCGAG
ATCTTCCCGG CCTCCTTTAC AGAGAAGGCT ATCCGAGTGG AGATGTTCGG TGACGAGATC
GAGCGCCTCC TGGAGATCGA CACCCTCACC GGCGAGATCC TCGGCCGGCG CAGCCATGTA
GCCGTCTTCC CGGCCAGCCA TTATGTGGTG GAAGAGGCCA AGATGGAAAG GGCCCTGGAG
AGCATCCAGG CCGAACTGGA GGAGCGCTTG CGCGAGCTGC GGGCCCAGGG CAAACTCCTG
GAGGCCCAGC GCCTGGAGCA GCGGACCAAT TTCGACCTGG AGATGATGCG GGAGGTCGGC
TTCTGCAAGG GAATCGAGAA TTACTCCCGT CACCTGACGG GCCGGGCGCC AGGGGAGCCC
CCCTACACCC TGCTGGATTA TTTTCCCGAT GACTTCCTTA TGATGATCGA TGAGTCCCAT
ATCACCGTGC CCCAGATAGG GGGCATGTAC GAGGGCGACC GTTCCCGGAA AGAGACCCTG
GTGGAATACG GTTTTCGCCT GCCTTCGGCC CTGGACAATC GGCCCCTGAC CTTTGAGGAG
TTCTGCCGGC ATATCAACCA GGTAATTTAC GTCTCGGCCA CGCCGGGCCC CTATGAGCTG
GAGCACTCCC AGCAGGTGGT GGAGCAGATC ATCCGGCCCA CCGGGCTGGT GGACCCGGAG
GTCCTGGTGC GGCCGGTAAA GGGTCAGATT GACGACCTCC TGGCGGAAAT CCAGAAGCGG
GTGGCTAAAA ACCAGCGCGT CCTGGTGACC ACCCTGACCA AGCGCATGGC GGAGGACCTG
ACGGACTACC TGCGGGAAGC CGGCCTCAAG GTGCGCTATA TGCACTCCGA CATCGGCGCC
ATCGAGAGGA TGGAGATCAT CCGCGACCTG CGGCTGGGAG TCTTTGATGT CCTGGTAGGT
ATTAACCTGT TGCGCGAGGG CCTGGACCTG CCGGAGGTCT CCCTGGTGGC CATCCTGGAC
GCCGATAAAG AGGGCTTCCT GCGCTCGGAG CGCTCCCTCA TCCAGACTAT CGGCCGGGCG
GCCCGCAACG CCGAGGGCCA GGTTATCATG TATGCCGATA CCATCACCGA CTCCATGCGG
CGGGCTATAG ATGAGACCAA CCGCCGCCGT CAGATCCAGA TGGAGTACAA CCGCCGGCAC
GGCATCACCC CCCGTACCGT CAGCAAGCCG GTACGGGAGG TCATCGAGGC CACCCGGGCG
GCCGAGGAGC CGGCCCGATA CGAGACCGCG GGCGAAGGCG CCAGGAAGAA GACCAAATTG
AGCAAACGGG AACTGAAGGC CCTTATCAGT CAGCTGGAGA AAGAGATGCG GGCGGCGGCC
AAAAGGCTGG AATTTGAAAG GGCCGCCGAG CTGCGGGACG CCATCCTGGA GCTTCGGCTA
CAAGCAGGGT AA
 
Protein sequence
MPPFILKSDY QPRGDQPRAI AALVEGLKKG YRHQTLLGAT GTGKTYTMAQ VIQAVQRPTL 
VLAPNKTLAA QLCGEFKEFF PDNAVEYFVS YYDYYQPEAY VPQTDTYIEK DSSINDEIDK
LRHSATAALF ERRDVIIVAS VSCIYGLGSP EDYSTLMLSL REGQEYDRDA ILRKLVDIQY
SRNDYDFKRG TFRVRGDVIE IFPASFTEKA IRVEMFGDEI ERLLEIDTLT GEILGRRSHV
AVFPASHYVV EEAKMERALE SIQAELEERL RELRAQGKLL EAQRLEQRTN FDLEMMREVG
FCKGIENYSR HLTGRAPGEP PYTLLDYFPD DFLMMIDESH ITVPQIGGMY EGDRSRKETL
VEYGFRLPSA LDNRPLTFEE FCRHINQVIY VSATPGPYEL EHSQQVVEQI IRPTGLVDPE
VLVRPVKGQI DDLLAEIQKR VAKNQRVLVT TLTKRMAEDL TDYLREAGLK VRYMHSDIGA
IERMEIIRDL RLGVFDVLVG INLLREGLDL PEVSLVAILD ADKEGFLRSE RSLIQTIGRA
ARNAEGQVIM YADTITDSMR RAIDETNRRR QIQMEYNRRH GITPRTVSKP VREVIEATRA
AEEPARYETA GEGARKKTKL SKRELKALIS QLEKEMRAAA KRLEFERAAE LRDAILELRL
QAG