Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1002 |
Symbol | |
ID | 7407903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1099476 |
End bp | 1100840 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643715367 |
Product | exodeoxyribonuclease VII, large subunit |
Protein accession | YP_002572876 |
Protein GI | 222528994 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1570] Exonuclease VII, large subunit |
TIGRFAM ID | [TIGR00237] exodeoxyribonuclease VII, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000181465 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATAA GCAATATAGT TTCTAAGAAA GAATGGAGCG TTTATGAGCT TACAAGCTAT TTAAAAAAGA AAGTTGAAAT GGATGTGCTT TTGAAAAACA TATACCTAAA AGGTGAAGTT ATAAGACCTT CAGTTTCAGG TGACCATCTT TATTTTGAAC TGAAGGATTT AGAATATGAT GCAAAAATAA AGTGTGTATT TTTTTGGTTT GACAAAAATG TTGAGATAAA GCATGGCTCA AAAGTGCTTG TAAAGGGCAA TGTCATTTTT TATGAAAAAG AAGGGATAAT TGAGCTAAAG GTAAGCGAAA TCACTGATAT AGGACTTGGA GAGCTGTTTG TAAAATTAAA ACAACTTGAA GAAAAGCTAA GACAGGAGGG ACTTTTTGAT TCAAAATACA AAAAAGAAAT TCCACGTTAT CCTAAAAAAG TGGGAATAGT TACTTCAAAA AATGGCGCAG CAATCAGGGA CATTCTTAAT ACAATTTATA CCCGATTTGA AAACATTCAG GTATATATTT ACAGCTGCTC AGTTCAGGGA CAAAACGCTC CATATGAGAT TTGCGAAGGA ATAGAATATT TTAACACCGA GGAGCCTGTT GAGGTTATTA TTGTTGGACG TGGTGGCGGC GCATTTGAAG ATTTAATGGC GTTCAACAAT GAGATGGTAG TAAGGAAGAT ATTTGAATCT AAAATTCCTA TTATATCAGC GGTAGGGCAT GAGAGGGACT ATGTTTTAAG TGACTTTGTT GCGGACATGA GGGCTATAAC TCCCACCAAT GCCGGCGAAA TGGTGGTGAG TTTTCAGAAA CAGGCACTGG AGAAACTGAT TGAATATCAA AAGAAAATGA AAAGTGCAAT TGAAAAGAAA TTTAATAATG TCAAAGAGAA AGTAGGAATA CTTCAATACA AATTATACCA AAATTCGCCA GCAAATACCG TTGCAAAACG TGCACAGGAT ATTGATTTGT ATTGCCACAA GCTTTCTTTT GCAATTAGTA GAAAGCTCCA TGAAGCTCAT AGAAATTTGA AGAATTTTGA AAAAAGATTG GCTGATTTGA ATCCTGAATC AAGGCTTTCG ATTGCGAGAA CAAATTTTGA TATATGTAGC AAAAGGCTTG GGGAAGCTTT CAAGAAGATT TTTCAGCAAA AAGAGTTTGC ATATAAGGTA AATCTGGAAA AATTAATTGC TCTAAATCCT CTTAATGTTT TAAAGAGGGG GTATTCTATA ACATTACATA ATTCTAAGAT TTTGACTTCC ATTTCGCAGG TAAGTAATGG AGATGAGATA GTTACTCAAC TTTCAGATGG TATAATAAAA TCTAAGGTGT TTTTCAAGCA AAAAGGAGCT GAAAGTGATG TGTGA
|
Protein sequence | MMISNIVSKK EWSVYELTSY LKKKVEMDVL LKNIYLKGEV IRPSVSGDHL YFELKDLEYD AKIKCVFFWF DKNVEIKHGS KVLVKGNVIF YEKEGIIELK VSEITDIGLG ELFVKLKQLE EKLRQEGLFD SKYKKEIPRY PKKVGIVTSK NGAAIRDILN TIYTRFENIQ VYIYSCSVQG QNAPYEICEG IEYFNTEEPV EVIIVGRGGG AFEDLMAFNN EMVVRKIFES KIPIISAVGH ERDYVLSDFV ADMRAITPTN AGEMVVSFQK QALEKLIEYQ KKMKSAIEKK FNNVKEKVGI LQYKLYQNSP ANTVAKRAQD IDLYCHKLSF AISRKLHEAH RNLKNFEKRL ADLNPESRLS IARTNFDICS KRLGEAFKKI FQQKEFAYKV NLEKLIALNP LNVLKRGYSI TLHNSKILTS ISQVSNGDEI VTQLSDGIIK SKVFFKQKGA ESDV
|
| |