Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1437 |
Symbol | |
ID | 7408095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1517692 |
End bp | 1520520 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643715800 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_002573308 |
Protein GI | 222529426 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAG AGTATATAGT TATAAAAGGT GCAAAAGAAC ACAACCTCAA AAATATTGAT TTGGTACTTC CACGAGACAA ACTTATTGTT TTTACTGGTC TTTCTGGTTC AGGAAAGTCA TCTTTGGCGT TTGATACAAT CTATGCTGAG GGACAGAGAA GATATATAGA ATCTCTCTCT TCTTATGCAA GGCAATTTTT AGGAATGATG GAAAAACCAG ATGTAGAATA CATTGAAGGA CTTTCTCCGG CAATTTCAAT TGACCAAAAG ACAACCTCTA AAAATCCACG TTCGACTGTT GGAACAATTA CTGAAATTTA CGATTATTTG AGGCTTTTGT TTGCAAGAGT TGGAAAACCT CATTGCTATA TATGTGGAAA ACCTATCTCC CAGCAAACAG TTGACCAGAT GGTAGACGAG GTATTGAAAC TTAAAGAGGG GACAAAGATC CAAATACTTG CGGCAGTTGT AAGAGGAAGA AAAGGTGAGT ATCAGAAACT GTTTGAAGAC CTGAGAAGGA GCGGATTTGC AAGAGTTAGA GTAGATGGTA TTGTATATGA ACTTGAAGAA GAGATAAAAC TTGATAAGAA CAAAAAACAT AGTATTGACG TCATTGTGGA TAGGCTCATT GTAAAAGAGG GAATAGAATC AAGGCTTGCG GGTTCAATAG AAACAGCGCT CCAGCTCGCA GGGGGGATTG TAACTGTATC TATTGTTGAT GGCGATGAAA TTGTGTTTTC CCAAAACTTT GCATGTGTAG ACTGTGGAGT TTCGTATGAA GAGATAACTC CACGTCTTTT TTCTTTCAAC ACACCATATG GTGCATGCCC AACGTGTATG GGTCTTGGTT ATTTGCAAAA GGTAGACCCT GACCTATTAA TTCCAGATAA ATCTATTCCA ATAGGTCAGG TTGAAATAAA TGGGTGGAAC TTTACTGAAA CAAATTCATA TTCAAGAATG ATTTTGGAAT CACTTGCAAA AGAGTATAAT TTTAGTTTAA ATACTCCTGT TGAAAAACTG GACAAGAAAA TATTAGATAT CTTTTTATAT GGAACAGGCG AGGAGAAGAT AAAAATTTAT ACTCCACGTG GTATATACTT TGCAAAGTAT GAAGGGCTTA TAAATAATCT TGAAAGAAGA TATAAAGAGA CCCAGTCAGA ATATGTCAAG CAAGAGATTG AAGAGTATAT GAGTACGTTT ACATGTCCTG ACTGTCAAGG GAAAAGACTT AAAAAAGAGG CTTTGGCGGT TTTGATAGAG GGTAAGTCTA TAGCAGATGT TGCTGACATG ACCGTTTTAC AAGCAAAAGA GTTTCTTAAG AAGTTAAACC TTCAAGGAAA AGATAAAGTA ATTGCACAGC CGATAATAAA AGAGATTCTG GCAAGGCTGG ACTTTTTGAT AGATGTAGGA CTTGATTACT TAACCCTTTC ACGGTCAGCT GGCACACTTT CTGGTGGTGA AGCGCAGAGA ATAAGGCTTG CTACCCAGAT AGGATCTGGA CTTGTTGGAG TTTTGTATAT TCTTGATGAG CCAAGTATAG GATTGCATCA GCGTGACAAT CACAGACTAA TAAAAACCCT TAAAAAACTC AGAGATCTTG GCAACACATT GATTGTTGTG GAGCATGATG AGGATACAAT AAGGTCAGCT GATTTTATTG TAGACATTGG ACCAGGAGCA GGCGAGCATG GTGGAAGAGT GGTTGCTGCA GGGACATTGG ATGATATAAT TTCATGTGAA GAGTCTATCA CAGGACAGTA TCTTTCCGGA AAGAAAAAGA TTGAGATACC TGATAAAAGA AGAGAACCTG ATGGTAGATG GCTTACTATC AAGGGTGCAT CAGAAAATAA CCTTAAAAAT ATTGATGTTA GCTTTCCTGT TGGACTTTTT ACTTGTGTAA CTGGTGTTTC AGGCTCAGGC AAAAGTACCC TTGTAAACGA AATACTTTAT AAAGCAGCAA GTGCAATTTT GAACAAGTCC AAAGAAAAAC CAGGTAAATT TCAAGAGATA ATAGGCCTTG AACATTTTGA TAAGGTTATA AATATAGATC AATCACCTAT AGGAAGAACT CCACGTTCGA ACCCTGCAAC TTACACAGGT GTTTTTGATT ATATACGAGA AGTTTTTGCC CAAACGCCCG AGGCAAAGCT CAGGGGTTAT AAGGCAGGAA GATTTAGTTT CAATTTGAAA GGTGGGAGAT GTGAAGCTTG TTCGGGAGAT GGTATTATAA AGATAGAAAT GCATTTTTTA CCTGATGTAT ACGTACCGTG CGATGTGTGC AAAGGAAAAA GGTATAACAG AGAGACGTTA GAGGTAAAGT ACAAGGATAA GACCATTGCT GATGTGCTTG AAATGACAGT GGAAGAGGCG TTGGAATTTT TCAAGAACAT TCCGAGGATA AAATCCAAGC TTCAAACACT TTATGATGTA GGGCTTGGTT ATATAAAACT GGGTCAGCCT TCCACCACTT TATCTGGTGG AGAAGCGCAG AGAGTAAAAC TCGCAACAGA ACTTTCTAAA AAAGCAACTG GAAGAACCCT GTATATCTTG GACGAGCCTA CAACAGGTCT TCACATGGAT GACGTCAATA AGTTAATTGC TGTCCTTCAG CGCCTTGTGG ATATGGGCAA CACAGTAATT GTAATTGAAC ACAATCTTGA TGTTATAAAA GTTGCAGATT ATATAATTGA TTTAGGACCA GAGGGTGGAG ATAAAGGCGG CGAGGTAGTT GTGTGTGGCA GCCCAGAAGA GGTTGCTATG TGCGAAAGGT CATATACAGG AATGTTTTTA AAGGAAATAT TGAAAGATAG AATTTATGCC AAAAAATAG
|
Protein sequence | MSKEYIVIKG AKEHNLKNID LVLPRDKLIV FTGLSGSGKS SLAFDTIYAE GQRRYIESLS SYARQFLGMM EKPDVEYIEG LSPAISIDQK TTSKNPRSTV GTITEIYDYL RLLFARVGKP HCYICGKPIS QQTVDQMVDE VLKLKEGTKI QILAAVVRGR KGEYQKLFED LRRSGFARVR VDGIVYELEE EIKLDKNKKH SIDVIVDRLI VKEGIESRLA GSIETALQLA GGIVTVSIVD GDEIVFSQNF ACVDCGVSYE EITPRLFSFN TPYGACPTCM GLGYLQKVDP DLLIPDKSIP IGQVEINGWN FTETNSYSRM ILESLAKEYN FSLNTPVEKL DKKILDIFLY GTGEEKIKIY TPRGIYFAKY EGLINNLERR YKETQSEYVK QEIEEYMSTF TCPDCQGKRL KKEALAVLIE GKSIADVADM TVLQAKEFLK KLNLQGKDKV IAQPIIKEIL ARLDFLIDVG LDYLTLSRSA GTLSGGEAQR IRLATQIGSG LVGVLYILDE PSIGLHQRDN HRLIKTLKKL RDLGNTLIVV EHDEDTIRSA DFIVDIGPGA GEHGGRVVAA GTLDDIISCE ESITGQYLSG KKKIEIPDKR REPDGRWLTI KGASENNLKN IDVSFPVGLF TCVTGVSGSG KSTLVNEILY KAASAILNKS KEKPGKFQEI IGLEHFDKVI NIDQSPIGRT PRSNPATYTG VFDYIREVFA QTPEAKLRGY KAGRFSFNLK GGRCEACSGD GIIKIEMHFL PDVYVPCDVC KGKRYNRETL EVKYKDKTIA DVLEMTVEEA LEFFKNIPRI KSKLQTLYDV GLGYIKLGQP STTLSGGEAQ RVKLATELSK KATGRTLYIL DEPTTGLHMD DVNKLIAVLQ RLVDMGNTVI VIEHNLDVIK VADYIIDLGP EGGDKGGEVV VCGSPEEVAM CERSYTGMFL KEILKDRIYA KK
|
| |