Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0251 |
Symbol | |
ID | 3833214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 252738 |
End bp | 254729 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828187 |
Product | excinuclease ABC subunit B |
Protein accession | YP_429129 |
Protein GI | 83589120 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCCT TTATTTTAAA ATCCGACTAC CAGCCCCGGG GCGACCAGCC CCGGGCCATA GCCGCCCTGG TGGAGGGGCT AAAAAAAGGT TACCGGCACC AGACCTTACT CGGAGCCACC GGTACCGGCA AGACCTATAC CATGGCCCAG GTCATTCAGG CCGTACAGCG GCCGACCCTG GTCCTGGCTC CCAACAAGAC CCTGGCGGCC CAGCTCTGCG GCGAATTTAA GGAGTTTTTC CCCGACAACG CCGTGGAGTA CTTCGTAAGC TACTACGACT ACTACCAACC GGAGGCCTAT GTGCCCCAGA CAGATACCTA TATCGAAAAG GACAGCTCCA TCAACGACGA GATCGACAAG CTGCGCCACT CGGCCACTGC CGCCCTTTTT GAACGGCGGG ATGTGATCAT CGTGGCCAGC GTCTCCTGTA TCTACGGCCT GGGCTCGCCG GAGGACTACA GCACCCTGAT GCTCTCCCTG CGGGAGGGCC AGGAGTATGA CCGGGACGCC ATTTTACGCA AACTGGTGGA CATCCAGTAC AGCCGCAATG ACTACGACTT CAAGCGGGGC ACCTTCCGCG TCCGCGGCGA CGTTATCGAG ATCTTCCCGG CCTCCTTTAC AGAGAAGGCT ATCCGAGTGG AGATGTTCGG TGACGAGATC GAGCGCCTCC TGGAGATCGA CACCCTCACC GGCGAGATCC TCGGCCGGCG CAGCCATGTA GCCGTCTTCC CGGCCAGCCA TTATGTGGTG GAAGAGGCCA AGATGGAAAG GGCCCTGGAG AGCATCCAGG CCGAACTGGA GGAGCGCTTG CGCGAGCTGC GGGCCCAGGG CAAACTCCTG GAGGCCCAGC GCCTGGAGCA GCGGACCAAT TTCGACCTGG AGATGATGCG GGAGGTCGGC TTCTGCAAGG GAATCGAGAA TTACTCCCGT CACCTGACGG GCCGGGCGCC AGGGGAGCCC CCCTACACCC TGCTGGATTA TTTTCCCGAT GACTTCCTTA TGATGATCGA TGAGTCCCAT ATCACCGTGC CCCAGATAGG GGGCATGTAC GAGGGCGACC GTTCCCGGAA AGAGACCCTG GTGGAATACG GTTTTCGCCT GCCTTCGGCC CTGGACAATC GGCCCCTGAC CTTTGAGGAG TTCTGCCGGC ATATCAACCA GGTAATTTAC GTCTCGGCCA CGCCGGGCCC CTATGAGCTG GAGCACTCCC AGCAGGTGGT GGAGCAGATC ATCCGGCCCA CCGGGCTGGT GGACCCGGAG GTCCTGGTGC GGCCGGTAAA GGGTCAGATT GACGACCTCC TGGCGGAAAT CCAGAAGCGG GTGGCTAAAA ACCAGCGCGT CCTGGTGACC ACCCTGACCA AGCGCATGGC GGAGGACCTG ACGGACTACC TGCGGGAAGC CGGCCTCAAG GTGCGCTATA TGCACTCCGA CATCGGCGCC ATCGAGAGGA TGGAGATCAT CCGCGACCTG CGGCTGGGAG TCTTTGATGT CCTGGTAGGT ATTAACCTGT TGCGCGAGGG CCTGGACCTG CCGGAGGTCT CCCTGGTGGC CATCCTGGAC GCCGATAAAG AGGGCTTCCT GCGCTCGGAG CGCTCCCTCA TCCAGACTAT CGGCCGGGCG GCCCGCAACG CCGAGGGCCA GGTTATCATG TATGCCGATA CCATCACCGA CTCCATGCGG CGGGCTATAG ATGAGACCAA CCGCCGCCGT CAGATCCAGA TGGAGTACAA CCGCCGGCAC GGCATCACCC CCCGTACCGT CAGCAAGCCG GTACGGGAGG TCATCGAGGC CACCCGGGCG GCCGAGGAGC CGGCCCGATA CGAGACCGCG GGCGAAGGCG CCAGGAAGAA GACCAAATTG AGCAAACGGG AACTGAAGGC CCTTATCAGT CAGCTGGAGA AAGAGATGCG GGCGGCGGCC AAAAGGCTGG AATTTGAAAG GGCCGCCGAG CTGCGGGACG CCATCCTGGA GCTTCGGCTA CAAGCAGGGT AA
|
Protein sequence | MPPFILKSDY QPRGDQPRAI AALVEGLKKG YRHQTLLGAT GTGKTYTMAQ VIQAVQRPTL VLAPNKTLAA QLCGEFKEFF PDNAVEYFVS YYDYYQPEAY VPQTDTYIEK DSSINDEIDK LRHSATAALF ERRDVIIVAS VSCIYGLGSP EDYSTLMLSL REGQEYDRDA ILRKLVDIQY SRNDYDFKRG TFRVRGDVIE IFPASFTEKA IRVEMFGDEI ERLLEIDTLT GEILGRRSHV AVFPASHYVV EEAKMERALE SIQAELEERL RELRAQGKLL EAQRLEQRTN FDLEMMREVG FCKGIENYSR HLTGRAPGEP PYTLLDYFPD DFLMMIDESH ITVPQIGGMY EGDRSRKETL VEYGFRLPSA LDNRPLTFEE FCRHINQVIY VSATPGPYEL EHSQQVVEQI IRPTGLVDPE VLVRPVKGQI DDLLAEIQKR VAKNQRVLVT TLTKRMAEDL TDYLREAGLK VRYMHSDIGA IERMEIIRDL RLGVFDVLVG INLLREGLDL PEVSLVAILD ADKEGFLRSE RSLIQTIGRA ARNAEGQVIM YADTITDSMR RAIDETNRRR QIQMEYNRRH GITPRTVSKP VREVIEATRA AEEPARYETA GEGARKKTKL SKRELKALIS QLEKEMRAAA KRLEFERAAE LRDAILELRL QAG
|
| |