Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2114 |
Symbol | |
ID | 7408823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2241500 |
End bp | 2243590 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643716479 |
Product | KWG repeat protein |
Protein accession | YP_002573962 |
Protein GI | 222530080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTAA AAAGAATTCA AGATAATATT TTGTACAAAT CAAGATTGAT TTTAATTATA GCAATTGTAA TTTTGTTGTT AACTGAATTT AAAGTTGCTT TTGCAAACAT TGAAATTGAA AGATTTCCAG TTAAGAAATT TGTTTATATT CACCCTCAAT TTACCGAAGT TAAATATTTA GATTGGGTAC ATGGTGATTG GTCAAGAGAC TATGAAATAA GGGTTTTGGT TAAAAAAGAT GGTAAATGGG GTATTTTTTT AAAAAAAGCA AGAGTACTTG TGAAACCTCA ATTCGATGAA ATAGAACAGC TAAGCACCGG GTTTAAGGTA AAGAAAAACG AAAAATGGGG ATTTATTGAT AATACAGTGA AGGTATTAGT TGAACCTGTA TTTGATGATG TCTATGATAT ATACAACGGT CTTTTAAAAA TAAAAGTTGG CAATAAGTTT GGTTTTGTAA ATGAAAATGG TAAGATTGAA ATTGAGCCCA AGTTTGAAGA CGCTAAATAT TTTATTGGAA ACATGGCTCC TGTAAAACAA AACGGGAAAT GGGGCATAAT TGATAAAACT GGAAGGTTTA TTGCTGAACC TATGTATGAT GAATGTGTTA TACCAAATTT ACCTCCATAT GACAAAATAA TTATAATTTC AAAAGATGAA AAATATGGTT ATGTTTCAGC TGCTGGGACT ACAGTTGTTC AGCCACAGTT TGAAGAAGTA GAACTACTAA ATGAGAACTT GGTTGCAATA AAAAAAGAAG GAAAAATCGG GTTTGCAGAT ATAAGTGGAA AAATTCTCAT TAATCCAGAG TATGATAAAT ACTATTCTAT TGTTGGAGGA TCTGATAATA TACGAATAAT AGCAGTATCT AAAAATAATC ACATTGGTGC TGTTTCTATG ACTGGACATG TTATGTTTGA ACCTGCTTAT GAAGATATCT CAGTTGTTTC AAAGAATATA TTGATCGCAA AGAAAAACGG AAAATGGGGA TTTATAACTT TCGACGGTAA GGTGAAGGTT GATTTTAAGT ATGATGAGTT TGAACGGTTG ATAGATAAAA ACTTTATATT AATTAAAAAA GGAAAAAAGG TTGGGGTTGC AAATTTGAAT GGTGAAATTA TTGCTGAGCC TCAATATGAT TATGTCGGTG ACCCCTTTTT GGGTCCGACT AAAAAAGATG CTTTGATGAC AGGCTCGAAA GGAAAAAGAG GAATTATTTA CAACAAAATT GTTGTTCCTC CTCAATTTGA TGTCATTAAA TTTTGTTCAA CCTCTAAAAA TGCTACTATA TTAACTGCTG TAAAAAAAGA TGGTAAGTGG ACTTATATAA ATAAATATGG GAAACTTATT ACTCAACCTC AGTTTGACAG TGTAGACGAA TATTTTTATT CTGGTGTAGC AAAGATTATA GAAAACAACA AAATTGGATT TATAAATGAA AATGGTAAAA TTATAACCAA ACCACAGTTT GATGGTGTTA CACCATTTGA TGACTGTGGG TTTGCAGGAG TTAATCAAAA AGGTAAGTGG GGCTTCATTG ATAAAAGCGG AAAGCTCATT ATAAAACCTC AGTTTGAAGA AATATCTAAT TTTACAGCGG ATGGTTTGGC AAGAATAAAG CTAAAAGGTA AATGGGGGTA CATTGAAAAA GGTGGGAAAG TTATAATCAA ACCTAAGTTT AATCAATTGG GTATTTTCAA AGAAGGGTTA GCTCCTGCAA AACTTGGTGG GAAATGTGGT TATATAGACA GGAAAGGTAA TTTTGCAATT AAACCGCAAT ATGAAGATGC ATTGTATTTT GTTGGTGATA CTGCTGCTGT CAAACTGAAT GGCAAATGGG GTTTTATTGA TAAAAAAGGC AGGTTTAAAA TAAAACCTCA ATATGATGAA GTGATAAACG TTTATATTAT GGGATATAAG GATTTAAGAG TTATAATTAA AAACAACAGA ACTGGACTAA TTGATTCGAA AGGAAATATT CTAATTGATC CAAATTTCGA ATCAATAGAA GGTGGTAATT TACAGTTTTT AGGTTATGTA CTTTTAAAAT CAACAGACAA TAAGTACGGT TTTTTGCTTG ATGAAGAATA A
|
Protein sequence | MFLKRIQDNI LYKSRLILII AIVILLLTEF KVAFANIEIE RFPVKKFVYI HPQFTEVKYL DWVHGDWSRD YEIRVLVKKD GKWGIFLKKA RVLVKPQFDE IEQLSTGFKV KKNEKWGFID NTVKVLVEPV FDDVYDIYNG LLKIKVGNKF GFVNENGKIE IEPKFEDAKY FIGNMAPVKQ NGKWGIIDKT GRFIAEPMYD ECVIPNLPPY DKIIIISKDE KYGYVSAAGT TVVQPQFEEV ELLNENLVAI KKEGKIGFAD ISGKILINPE YDKYYSIVGG SDNIRIIAVS KNNHIGAVSM TGHVMFEPAY EDISVVSKNI LIAKKNGKWG FITFDGKVKV DFKYDEFERL IDKNFILIKK GKKVGVANLN GEIIAEPQYD YVGDPFLGPT KKDALMTGSK GKRGIIYNKI VVPPQFDVIK FCSTSKNATI LTAVKKDGKW TYINKYGKLI TQPQFDSVDE YFYSGVAKII ENNKIGFINE NGKIITKPQF DGVTPFDDCG FAGVNQKGKW GFIDKSGKLI IKPQFEEISN FTADGLARIK LKGKWGYIEK GGKVIIKPKF NQLGIFKEGL APAKLGGKCG YIDRKGNFAI KPQYEDALYF VGDTAAVKLN GKWGFIDKKG RFKIKPQYDE VINVYIMGYK DLRVIIKNNR TGLIDSKGNI LIDPNFESIE GGNLQFLGYV LLKSTDNKYG FLLDEE
|
| |