Gene Athe_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1438 
Symbol 
ID7408096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1520551 
End bp1522536 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content37% 
IMG OID643715801 
Productexcinuclease ABC subunit B 
Protein accessionYP_002573309 
Protein GI222529427 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTAAACTTGT TTCAGACTTT AAACCAACAG GTGATCAACC AAAAGCAATT 
GAGATGTTAA CAGAAGGAAT TTTAAAGGGA GAAAAGTTTC AGACACTTTT AGGTGTTACT
GGGTCTGGCA AGACATTTAC AATGGCAAAG GTTATAGAAA ATGTTCAAAG ACCCACGCTT
GTGCTGGCAC ATAACAAGAC TTTAGCAGCA CAGCTTTGTA GTGAGTTTAG AGAATTTTTC
CCAGAAAATG CAGTGGAATT CTTTGTAAGT TACTATGACT ATTATCAGCC TGAAGCTTAT
ATTCCAGAAA CTGACACGTA TATTGAAAAG GATTCATCTA TAAATGAAGA GATTGACAAA
CTCAGACACT CAGCTACATC TGCCTTATTT GAAAGAAGAG ATGTTATAAT TGTTGCAAGT
GTATCATGTA TTTACAGTTT GGGCAGTCCT GAAGATTATT TAAATCTTAC TATTTCGTTG
CGCCCCGGCA TGACAAAAGA CAGAGATGAG GTTATAAGAG ACCTTATAAG AATGCAATAT
GAAAGAAATG ACATTGATTT TAGAAGAGGA AGATTTAGAG TAAGAGGAGA CGTACTCGAA
GTTTTCCCTG CATCTAATAC AGACAGGGCG ATAAGAATAG AATTTTTTGG GGATGAGATA
GAAAGAATCA CAGAGTTTGA TGTTGTAACA GGTGAAGTAA TTGGTCGAAG AAACCATGTT
GCAATTTTTC CAGCATCCCA CTATGTGACA ACTGCTGAAA AATTGAAAAG AGCCATAAAA
AGTATAGAAG AAGAACTTGA ACAAAGGCTA AAAGAGCTAA GAAGTATGGG TAAGCTTGTT
GAGGCTCAGA GGCTTGAGCA AAGAACGCGC TATGACATAG AGATGCTTCA GGAGATGGGT
TTTTGTAAAG GGATAGAGAA CTATTCAAGG CATTTAACTG GCAGACCACC TGGAAGTCCA
CCGTATACTT TGCTTGATTA TTTTCCAAAT GATTTCATAA TGTTCATTGA TGAGTCGCAT
GTTACAATAC CTCAAGTAAG AGCTATGTAC AATGGGGACA AAGCAAGAAA AGATACCCTT
GTAGAATATG GTTTTAGACT TCCATCTGCT TATGATAACA GACCATTGAC ATTTGAAGAG
TTCGAAGAAA AGCTCAACCA AGTAATTTTC GTAAGTGCAA CACCCGGACC GTATGAACTC
AAAAAATCTT CACGCATTGT TGAACAAATT ATAAGACCGA CAGGACTTGT TGACCCTGAA
ATTGAGGTTC ATCCTGTACA AGGTCAGATT GACCATCTGA TTGGCGAGAT ACGAAAACGA
GTGGAAAAGA ACCAGAGAGT TCTTGTCACT ACTCTTACCA AAAAGATGGC TGAAAGCCTT
ACTGAGTATT TAAAAGATGT GGGAATCAGG GTCAGATATA TGCATTCAGA CATAGACACA
ATTGAGCGTA TGCAGATTAT CAGAGATTTA CGGCTTGGAA AATTTGATGT TTTAGTAGGG
ATAAATTTGC TCAGAGAAGG TCTTGACCTT CCTGAGGTGT CACTTGTTGC CATTTTAGAT
GCTGACAAGG AAGGTTTTTT GAGGTCAGAA ACTTCGCTTA TCCAGACAAT TGGACGTGCT
GCGAGAAATG TTGATGGAAA GGTTATAATG TATGCAGATA GAATCACAAA CGCTATGCAA
AAAGCTATTG ATGAGACAAA CAGACGTAGG AAAATCCAAA TAGAATATAA TCAAAAACAC
GGCATTGTAC CTCAAACTGT AAGAAAAGGT ATAAGGCAGA TAATTGAAGC GACAGTGTCT
GTGGCTGAAG AGGAAGAGAA ATACGAAGTT GTGGAGAAAG AGATTGTAGA AAATATGACA
AAGGAAGAGA TAGAAGAATA TATCAAGGAA CTTGAACAGG AGATGAAAAA GCTTGCTATA
GAACTTGAGT TTGAAAAGGC TGCAAAAGTA AGGGACAAAA TATTTGAACT CAAAAAACTT
CTTTAA
 
Protein sequence
MKKFKLVSDF KPTGDQPKAI EMLTEGILKG EKFQTLLGVT GSGKTFTMAK VIENVQRPTL 
VLAHNKTLAA QLCSEFREFF PENAVEFFVS YYDYYQPEAY IPETDTYIEK DSSINEEIDK
LRHSATSALF ERRDVIIVAS VSCIYSLGSP EDYLNLTISL RPGMTKDRDE VIRDLIRMQY
ERNDIDFRRG RFRVRGDVLE VFPASNTDRA IRIEFFGDEI ERITEFDVVT GEVIGRRNHV
AIFPASHYVT TAEKLKRAIK SIEEELEQRL KELRSMGKLV EAQRLEQRTR YDIEMLQEMG
FCKGIENYSR HLTGRPPGSP PYTLLDYFPN DFIMFIDESH VTIPQVRAMY NGDKARKDTL
VEYGFRLPSA YDNRPLTFEE FEEKLNQVIF VSATPGPYEL KKSSRIVEQI IRPTGLVDPE
IEVHPVQGQI DHLIGEIRKR VEKNQRVLVT TLTKKMAESL TEYLKDVGIR VRYMHSDIDT
IERMQIIRDL RLGKFDVLVG INLLREGLDL PEVSLVAILD ADKEGFLRSE TSLIQTIGRA
ARNVDGKVIM YADRITNAMQ KAIDETNRRR KIQIEYNQKH GIVPQTVRKG IRQIIEATVS
VAEEEEKYEV VEKEIVENMT KEEIEEYIKE LEQEMKKLAI ELEFEKAAKV RDKIFELKKL
L