Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1600 |
Symbol | |
ID | 7409430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1695063 |
End bp | 1697957 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643715969 |
Product | protein of unknown function DUF1156 |
Protein accession | YP_002573467 |
Protein GI | 222529585 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000970911 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAT CTTTAATAGA GGTTCAATTT CCAGTTTCAA AACTTTCAAA GGAGGCATTT AAAGAGCGAA AAGCAGGTGC AGGACAGACA TTAACAGGAC TTGGCAAATG GTGGGGCAGA AAACCTCTTG TGCTTGTGCG TGCACTACTT TTGGGAGTTC TTTTGCCTGC AACAGATGAC CCCAAAAAAG ATATGAAAAT TTTTCTTAAG CTTATGACTA TGGATGAAGA GGGCTTGAAG CTGAGAAGAA AAAGTAGCAT ATCAGCAAAG GATGCTTATG AGTTTGCAAC AGAAGAAGAA AAGGCAAAAT ATTTTGATGT TGCAGATGAG GGGAAAATAA GCTACAAAAA AGATTTAAAA AAAGCTGAGA GGGAAGGATT TCAAGAAAGA ATATTTAAAA GGATGCCATA TGAACAGAAG CTAAGATATT GCAAGCGTCC CGAAGAGATA GAAAATCTGC CTAAGAGCGC TTGGGATGAG ATAAATGAGC ATCTTGGAAC AAATGCATAT TCATATCAAG CCCTTGTTGA AGAGCTTGGC AAAAAAAGAT TTGGACAACT TCCAACAGTT GGAGATTGTT TTTGCGGTGG AGGGAGTATT CCATTTGAAG CAGCAAGGCT TGGGTTTGGT GTTTTTGCGT CAGACCTAAA TCCAATTGCA ATGCTTCTTA CGTGGGCAGC TTTGAATCTT TTGAGCCTGC CCGAAGATGC GATTGAAAAG CTAAAAGATT TTCAAAAAAG AATATTTGAG CAGGCAGACA AGATTGTAAC TCAGTGGCAG ATTGAGCACA ACTCAAAAGG GCATAGAGCA AATGCATATT TATACTGCGT TGAGACAATT TGTCCTGAAT GTGGGTTCAA AGTTCCTCTA CTACCTTCTT TGGTAATTGG CAAAAATTCT AAAACTATTG CTGTGTTGCA CGAAAACCCA GCTAAAAAAG GGTTTGATAT AGAAATCAAA ACGAAAGTCA GCCAATCAGA GCTCGAACAA GCAGCTAAAA ACGGAACTGT AAAGGATGGT TATTTAATTT GTCCACATTG TAAAATGGAA ACAAGTATTT CGTCAATTAG GGGAGATAAG GTTGATGAGA GTAGCAAAAC AATTTGGGGG CTAAGACGCT GGGAAAAACA TGAATTTGTG CCAAGAGAAG ATGATGTTTT TCAAGAAAGA TTGTATTGCA TAAGATATGA GGATGAAAAA GGGCAAAGAT ACTACAAAGC TCCGGATGAT GAGGATTTTG AAAGGGAAAA GAAAGTTATA GAACTTTTGA AAGAAAGATT TGAGGAGTGG CAGCAAAAAG GATATATTCC AAGTGATATG ATTGAAGAGG GTGAAGAAAC AAGCAGATTG TACAGAGAAA GAGGTTGGGC ATATTGGCAT CAGCTTTTCA ATCTACGACA GTTACTTTTA CATGGGCTTT TGATGGAACT GATTGACAAA AAGGCAAAGA CGAAAGAGGA GAAGATTGTG GGGCTACTGG GGGTTAATAG GTGTGTTACG TGGAATTCTA AGTTATGTCT ATGGGACAAT ACGCGTGAGG ATAATGGGAA GAACACATTT TACAATCAGG CATTGAATAC GTTGTTCAAT TTTAATGTAA GAGGATTGAC TGTTTTACCA TGGTTTTTAG ACAGTTTAAA GCCTTATTTG TTGTCAAATA ATCACAAAAT AGTTTATCCT ACCGATGCCC GCGATGTAAA TCAAGCCTGC CACATTTGGA TAACAGACCC GCCTTATGCT GATGCAATAA ATTACCATGA GCTCTCTGAG TTCTTCTTAG CATGGGATAA GAAGTTTTTG AAAGAAGTTT TTCCAGATTG GTATACAGAC AGCAAAAGGG CATTGGCAGT TCGGGGTGAC CGCGAACTGT TTAAAACTGC TTTTACGGAG ATTCTTAAAA ACATAGTTTC TAATATGCCT GAAAATGGCT ATTTTGTCCT CATGTTTACA CACCAGGACT CACAGGTTTT TGCAGACCTA ACAGAAATTT TGCTTAACTC AGGGCTTTTG TCTGTCAATG CATGGAGCAT TGCAACAGAG ACAGAGGACA ATATGTCAGA GGGCAATTTT GTCCAGTCAA CTGTGTGTGT TGTTTTAAAG AAGATTGATA GAACTCAACT TGAGCCTGTA TTTATTGAAG AGCTATATCC TTTTGGCAAA GAAGAGGTAG AAAGACAGAT AAAGCTCATG TATGAGCTTG ACAAAGATGA AGCAGAGCCT AATTTTAGTC CAACAGATTT GGAGCTGTCA GGCTACTATG CAGCATTGCG TGTTTTGACA TCCTGCAATC TTAAAGCAAC AAACCAGAAG ATTAAAGAGT TTTTGGATTC CATGCGCGAA TATGCAAGCA GCTACATAGT ACCAGAAGGT TTGAAATACC TTGGGTTTGA CCAAGATACC ATTTACGAAA TTTGGCGCAA GATGGAAAGC TATGAGAAGT TTTATATAAG AGGCATCGAA TTTGAAACAA GGGGTGAAAA AAGAATAGGT GCATATCAAG ATGCTGCAAG GAGTCTTGGT GTTGCTGATT ATGATGAACT TTTTGCAATT AAAAAATCAA ATTCGGCAAG ACTGAAGACA GCAAGCGAGC TTGGGCAGGG ACTTCTTGAT ACCAAACATG CGTTTAGTAC AACAATCTTG CGCTTGTGTC TTTTAGCGAT AAATAGCGCA ATAAAAAAAG ACCAGGAAAT AAATGACACT GCCGAGGCGG TTGCACTTTC ACATGAGATG TTAAAGACCA AACTTGGAAC AAAATACTGG AACAACAAAA CCAAGATAGA GATAATATTC AGATACCTTG CAAGGCTTGA GAAAATTGAT GGCATGGAAC ACTGGCAAAA CGACTCAAAA ATAGCTTCAT ATCTTGCTGA GCGTGTGGCA AACGATAGAC TGTAA
|
Protein sequence | MEKSLIEVQF PVSKLSKEAF KERKAGAGQT LTGLGKWWGR KPLVLVRALL LGVLLPATDD PKKDMKIFLK LMTMDEEGLK LRRKSSISAK DAYEFATEEE KAKYFDVADE GKISYKKDLK KAEREGFQER IFKRMPYEQK LRYCKRPEEI ENLPKSAWDE INEHLGTNAY SYQALVEELG KKRFGQLPTV GDCFCGGGSI PFEAARLGFG VFASDLNPIA MLLTWAALNL LSLPEDAIEK LKDFQKRIFE QADKIVTQWQ IEHNSKGHRA NAYLYCVETI CPECGFKVPL LPSLVIGKNS KTIAVLHENP AKKGFDIEIK TKVSQSELEQ AAKNGTVKDG YLICPHCKME TSISSIRGDK VDESSKTIWG LRRWEKHEFV PREDDVFQER LYCIRYEDEK GQRYYKAPDD EDFEREKKVI ELLKERFEEW QQKGYIPSDM IEEGEETSRL YRERGWAYWH QLFNLRQLLL HGLLMELIDK KAKTKEEKIV GLLGVNRCVT WNSKLCLWDN TREDNGKNTF YNQALNTLFN FNVRGLTVLP WFLDSLKPYL LSNNHKIVYP TDARDVNQAC HIWITDPPYA DAINYHELSE FFLAWDKKFL KEVFPDWYTD SKRALAVRGD RELFKTAFTE ILKNIVSNMP ENGYFVLMFT HQDSQVFADL TEILLNSGLL SVNAWSIATE TEDNMSEGNF VQSTVCVVLK KIDRTQLEPV FIEELYPFGK EEVERQIKLM YELDKDEAEP NFSPTDLELS GYYAALRVLT SCNLKATNQK IKEFLDSMRE YASSYIVPEG LKYLGFDQDT IYEIWRKMES YEKFYIRGIE FETRGEKRIG AYQDAARSLG VADYDELFAI KKSNSARLKT ASELGQGLLD TKHAFSTTIL RLCLLAINSA IKKDQEINDT AEAVALSHEM LKTKLGTKYW NNKTKIEIIF RYLARLEKID GMEHWQNDSK IASYLAERVA NDRL
|
| |