Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1438 |
Symbol | |
ID | 7408096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1520551 |
End bp | 1522536 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715801 |
Product | excinuclease ABC subunit B |
Protein accession | YP_002573309 |
Protein GI | 222529427 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTAAACTTGT TTCAGACTTT AAACCAACAG GTGATCAACC AAAAGCAATT GAGATGTTAA CAGAAGGAAT TTTAAAGGGA GAAAAGTTTC AGACACTTTT AGGTGTTACT GGGTCTGGCA AGACATTTAC AATGGCAAAG GTTATAGAAA ATGTTCAAAG ACCCACGCTT GTGCTGGCAC ATAACAAGAC TTTAGCAGCA CAGCTTTGTA GTGAGTTTAG AGAATTTTTC CCAGAAAATG CAGTGGAATT CTTTGTAAGT TACTATGACT ATTATCAGCC TGAAGCTTAT ATTCCAGAAA CTGACACGTA TATTGAAAAG GATTCATCTA TAAATGAAGA GATTGACAAA CTCAGACACT CAGCTACATC TGCCTTATTT GAAAGAAGAG ATGTTATAAT TGTTGCAAGT GTATCATGTA TTTACAGTTT GGGCAGTCCT GAAGATTATT TAAATCTTAC TATTTCGTTG CGCCCCGGCA TGACAAAAGA CAGAGATGAG GTTATAAGAG ACCTTATAAG AATGCAATAT GAAAGAAATG ACATTGATTT TAGAAGAGGA AGATTTAGAG TAAGAGGAGA CGTACTCGAA GTTTTCCCTG CATCTAATAC AGACAGGGCG ATAAGAATAG AATTTTTTGG GGATGAGATA GAAAGAATCA CAGAGTTTGA TGTTGTAACA GGTGAAGTAA TTGGTCGAAG AAACCATGTT GCAATTTTTC CAGCATCCCA CTATGTGACA ACTGCTGAAA AATTGAAAAG AGCCATAAAA AGTATAGAAG AAGAACTTGA ACAAAGGCTA AAAGAGCTAA GAAGTATGGG TAAGCTTGTT GAGGCTCAGA GGCTTGAGCA AAGAACGCGC TATGACATAG AGATGCTTCA GGAGATGGGT TTTTGTAAAG GGATAGAGAA CTATTCAAGG CATTTAACTG GCAGACCACC TGGAAGTCCA CCGTATACTT TGCTTGATTA TTTTCCAAAT GATTTCATAA TGTTCATTGA TGAGTCGCAT GTTACAATAC CTCAAGTAAG AGCTATGTAC AATGGGGACA AAGCAAGAAA AGATACCCTT GTAGAATATG GTTTTAGACT TCCATCTGCT TATGATAACA GACCATTGAC ATTTGAAGAG TTCGAAGAAA AGCTCAACCA AGTAATTTTC GTAAGTGCAA CACCCGGACC GTATGAACTC AAAAAATCTT CACGCATTGT TGAACAAATT ATAAGACCGA CAGGACTTGT TGACCCTGAA ATTGAGGTTC ATCCTGTACA AGGTCAGATT GACCATCTGA TTGGCGAGAT ACGAAAACGA GTGGAAAAGA ACCAGAGAGT TCTTGTCACT ACTCTTACCA AAAAGATGGC TGAAAGCCTT ACTGAGTATT TAAAAGATGT GGGAATCAGG GTCAGATATA TGCATTCAGA CATAGACACA ATTGAGCGTA TGCAGATTAT CAGAGATTTA CGGCTTGGAA AATTTGATGT TTTAGTAGGG ATAAATTTGC TCAGAGAAGG TCTTGACCTT CCTGAGGTGT CACTTGTTGC CATTTTAGAT GCTGACAAGG AAGGTTTTTT GAGGTCAGAA ACTTCGCTTA TCCAGACAAT TGGACGTGCT GCGAGAAATG TTGATGGAAA GGTTATAATG TATGCAGATA GAATCACAAA CGCTATGCAA AAAGCTATTG ATGAGACAAA CAGACGTAGG AAAATCCAAA TAGAATATAA TCAAAAACAC GGCATTGTAC CTCAAACTGT AAGAAAAGGT ATAAGGCAGA TAATTGAAGC GACAGTGTCT GTGGCTGAAG AGGAAGAGAA ATACGAAGTT GTGGAGAAAG AGATTGTAGA AAATATGACA AAGGAAGAGA TAGAAGAATA TATCAAGGAA CTTGAACAGG AGATGAAAAA GCTTGCTATA GAACTTGAGT TTGAAAAGGC TGCAAAAGTA AGGGACAAAA TATTTGAACT CAAAAAACTT CTTTAA
|
Protein sequence | MKKFKLVSDF KPTGDQPKAI EMLTEGILKG EKFQTLLGVT GSGKTFTMAK VIENVQRPTL VLAHNKTLAA QLCSEFREFF PENAVEFFVS YYDYYQPEAY IPETDTYIEK DSSINEEIDK LRHSATSALF ERRDVIIVAS VSCIYSLGSP EDYLNLTISL RPGMTKDRDE VIRDLIRMQY ERNDIDFRRG RFRVRGDVLE VFPASNTDRA IRIEFFGDEI ERITEFDVVT GEVIGRRNHV AIFPASHYVT TAEKLKRAIK SIEEELEQRL KELRSMGKLV EAQRLEQRTR YDIEMLQEMG FCKGIENYSR HLTGRPPGSP PYTLLDYFPN DFIMFIDESH VTIPQVRAMY NGDKARKDTL VEYGFRLPSA YDNRPLTFEE FEEKLNQVIF VSATPGPYEL KKSSRIVEQI IRPTGLVDPE IEVHPVQGQI DHLIGEIRKR VEKNQRVLVT TLTKKMAESL TEYLKDVGIR VRYMHSDIDT IERMQIIRDL RLGKFDVLVG INLLREGLDL PEVSLVAILD ADKEGFLRSE TSLIQTIGRA ARNVDGKVIM YADRITNAMQ KAIDETNRRR KIQIEYNQKH GIVPQTVRKG IRQIIEATVS VAEEEEKYEV VEKEIVENMT KEEIEEYIKE LEQEMKKLAI ELEFEKAAKV RDKIFELKKL L
|
| |