Gene Athe_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1437 
Symbol 
ID7408095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1517692 
End bp1520520 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content38% 
IMG OID643715800 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002573308 
Protein GI222529426 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG AGTATATAGT TATAAAAGGT GCAAAAGAAC ACAACCTCAA AAATATTGAT 
TTGGTACTTC CACGAGACAA ACTTATTGTT TTTACTGGTC TTTCTGGTTC AGGAAAGTCA
TCTTTGGCGT TTGATACAAT CTATGCTGAG GGACAGAGAA GATATATAGA ATCTCTCTCT
TCTTATGCAA GGCAATTTTT AGGAATGATG GAAAAACCAG ATGTAGAATA CATTGAAGGA
CTTTCTCCGG CAATTTCAAT TGACCAAAAG ACAACCTCTA AAAATCCACG TTCGACTGTT
GGAACAATTA CTGAAATTTA CGATTATTTG AGGCTTTTGT TTGCAAGAGT TGGAAAACCT
CATTGCTATA TATGTGGAAA ACCTATCTCC CAGCAAACAG TTGACCAGAT GGTAGACGAG
GTATTGAAAC TTAAAGAGGG GACAAAGATC CAAATACTTG CGGCAGTTGT AAGAGGAAGA
AAAGGTGAGT ATCAGAAACT GTTTGAAGAC CTGAGAAGGA GCGGATTTGC AAGAGTTAGA
GTAGATGGTA TTGTATATGA ACTTGAAGAA GAGATAAAAC TTGATAAGAA CAAAAAACAT
AGTATTGACG TCATTGTGGA TAGGCTCATT GTAAAAGAGG GAATAGAATC AAGGCTTGCG
GGTTCAATAG AAACAGCGCT CCAGCTCGCA GGGGGGATTG TAACTGTATC TATTGTTGAT
GGCGATGAAA TTGTGTTTTC CCAAAACTTT GCATGTGTAG ACTGTGGAGT TTCGTATGAA
GAGATAACTC CACGTCTTTT TTCTTTCAAC ACACCATATG GTGCATGCCC AACGTGTATG
GGTCTTGGTT ATTTGCAAAA GGTAGACCCT GACCTATTAA TTCCAGATAA ATCTATTCCA
ATAGGTCAGG TTGAAATAAA TGGGTGGAAC TTTACTGAAA CAAATTCATA TTCAAGAATG
ATTTTGGAAT CACTTGCAAA AGAGTATAAT TTTAGTTTAA ATACTCCTGT TGAAAAACTG
GACAAGAAAA TATTAGATAT CTTTTTATAT GGAACAGGCG AGGAGAAGAT AAAAATTTAT
ACTCCACGTG GTATATACTT TGCAAAGTAT GAAGGGCTTA TAAATAATCT TGAAAGAAGA
TATAAAGAGA CCCAGTCAGA ATATGTCAAG CAAGAGATTG AAGAGTATAT GAGTACGTTT
ACATGTCCTG ACTGTCAAGG GAAAAGACTT AAAAAAGAGG CTTTGGCGGT TTTGATAGAG
GGTAAGTCTA TAGCAGATGT TGCTGACATG ACCGTTTTAC AAGCAAAAGA GTTTCTTAAG
AAGTTAAACC TTCAAGGAAA AGATAAAGTA ATTGCACAGC CGATAATAAA AGAGATTCTG
GCAAGGCTGG ACTTTTTGAT AGATGTAGGA CTTGATTACT TAACCCTTTC ACGGTCAGCT
GGCACACTTT CTGGTGGTGA AGCGCAGAGA ATAAGGCTTG CTACCCAGAT AGGATCTGGA
CTTGTTGGAG TTTTGTATAT TCTTGATGAG CCAAGTATAG GATTGCATCA GCGTGACAAT
CACAGACTAA TAAAAACCCT TAAAAAACTC AGAGATCTTG GCAACACATT GATTGTTGTG
GAGCATGATG AGGATACAAT AAGGTCAGCT GATTTTATTG TAGACATTGG ACCAGGAGCA
GGCGAGCATG GTGGAAGAGT GGTTGCTGCA GGGACATTGG ATGATATAAT TTCATGTGAA
GAGTCTATCA CAGGACAGTA TCTTTCCGGA AAGAAAAAGA TTGAGATACC TGATAAAAGA
AGAGAACCTG ATGGTAGATG GCTTACTATC AAGGGTGCAT CAGAAAATAA CCTTAAAAAT
ATTGATGTTA GCTTTCCTGT TGGACTTTTT ACTTGTGTAA CTGGTGTTTC AGGCTCAGGC
AAAAGTACCC TTGTAAACGA AATACTTTAT AAAGCAGCAA GTGCAATTTT GAACAAGTCC
AAAGAAAAAC CAGGTAAATT TCAAGAGATA ATAGGCCTTG AACATTTTGA TAAGGTTATA
AATATAGATC AATCACCTAT AGGAAGAACT CCACGTTCGA ACCCTGCAAC TTACACAGGT
GTTTTTGATT ATATACGAGA AGTTTTTGCC CAAACGCCCG AGGCAAAGCT CAGGGGTTAT
AAGGCAGGAA GATTTAGTTT CAATTTGAAA GGTGGGAGAT GTGAAGCTTG TTCGGGAGAT
GGTATTATAA AGATAGAAAT GCATTTTTTA CCTGATGTAT ACGTACCGTG CGATGTGTGC
AAAGGAAAAA GGTATAACAG AGAGACGTTA GAGGTAAAGT ACAAGGATAA GACCATTGCT
GATGTGCTTG AAATGACAGT GGAAGAGGCG TTGGAATTTT TCAAGAACAT TCCGAGGATA
AAATCCAAGC TTCAAACACT TTATGATGTA GGGCTTGGTT ATATAAAACT GGGTCAGCCT
TCCACCACTT TATCTGGTGG AGAAGCGCAG AGAGTAAAAC TCGCAACAGA ACTTTCTAAA
AAAGCAACTG GAAGAACCCT GTATATCTTG GACGAGCCTA CAACAGGTCT TCACATGGAT
GACGTCAATA AGTTAATTGC TGTCCTTCAG CGCCTTGTGG ATATGGGCAA CACAGTAATT
GTAATTGAAC ACAATCTTGA TGTTATAAAA GTTGCAGATT ATATAATTGA TTTAGGACCA
GAGGGTGGAG ATAAAGGCGG CGAGGTAGTT GTGTGTGGCA GCCCAGAAGA GGTTGCTATG
TGCGAAAGGT CATATACAGG AATGTTTTTA AAGGAAATAT TGAAAGATAG AATTTATGCC
AAAAAATAG
 
Protein sequence
MSKEYIVIKG AKEHNLKNID LVLPRDKLIV FTGLSGSGKS SLAFDTIYAE GQRRYIESLS 
SYARQFLGMM EKPDVEYIEG LSPAISIDQK TTSKNPRSTV GTITEIYDYL RLLFARVGKP
HCYICGKPIS QQTVDQMVDE VLKLKEGTKI QILAAVVRGR KGEYQKLFED LRRSGFARVR
VDGIVYELEE EIKLDKNKKH SIDVIVDRLI VKEGIESRLA GSIETALQLA GGIVTVSIVD
GDEIVFSQNF ACVDCGVSYE EITPRLFSFN TPYGACPTCM GLGYLQKVDP DLLIPDKSIP
IGQVEINGWN FTETNSYSRM ILESLAKEYN FSLNTPVEKL DKKILDIFLY GTGEEKIKIY
TPRGIYFAKY EGLINNLERR YKETQSEYVK QEIEEYMSTF TCPDCQGKRL KKEALAVLIE
GKSIADVADM TVLQAKEFLK KLNLQGKDKV IAQPIIKEIL ARLDFLIDVG LDYLTLSRSA
GTLSGGEAQR IRLATQIGSG LVGVLYILDE PSIGLHQRDN HRLIKTLKKL RDLGNTLIVV
EHDEDTIRSA DFIVDIGPGA GEHGGRVVAA GTLDDIISCE ESITGQYLSG KKKIEIPDKR
REPDGRWLTI KGASENNLKN IDVSFPVGLF TCVTGVSGSG KSTLVNEILY KAASAILNKS
KEKPGKFQEI IGLEHFDKVI NIDQSPIGRT PRSNPATYTG VFDYIREVFA QTPEAKLRGY
KAGRFSFNLK GGRCEACSGD GIIKIEMHFL PDVYVPCDVC KGKRYNRETL EVKYKDKTIA
DVLEMTVEEA LEFFKNIPRI KSKLQTLYDV GLGYIKLGQP STTLSGGEAQ RVKLATELSK
KATGRTLYIL DEPTTGLHMD DVNKLIAVLQ RLVDMGNTVI VIEHNLDVIK VADYIIDLGP
EGGDKGGEVV VCGSPEEVAM CERSYTGMFL KEILKDRIYA KK