Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1862 |
Symbol | |
ID | 7408975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1954763 |
End bp | 1956301 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 643716234 |
Product | KWG repeat protein |
Protein accession | YP_002573723 |
Protein GI | 222529841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.528379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTA AAAAGAAGAA CAAAAGAAAA AAAAACAAGC AAAAAAAACA GCAAACGAAA GTTCAATTCA AAGAATTTTT GAGGTTGCCT ATTGTAAAAA GTTTTATAAT CGTATTAATT TTGACCTTAT TGATAACGAC AATATATAGC ACAGCAACAA TTGTAATTAG AAAAAATATT GCCAAACATT TTTTCACAAT CTCAGATGAA AATTATCAAA ATGTAGCATA TCTTAACAAT AAATTGGTTA TGTTTGAAAA GAATAGTAAA TGGGGGATAA TGGATATACA AGGAAAAATA GTAGTAAAGC CTACATATGA AAGGATACTA TTAGAAAGTG AAGGAATGAT TCCAGTTAAG CTTAAAAATA AATGGGGCTT CATTGATATC TATGGAAAAG TTAAGATAAA ACCCCAATTT GACAATGTAA CTGCATTCAC AGACCAGCTT GCAGCTGTGT GTGTGAACAA CAGGTGGGGA ATTATAGATA AGTCAGGTAA ATATACTATA AAACCTCAGT ATCAAGACAT AATTATACAT CCAAACAAAA TGATTCAAAC TAAAAAGTAT AATAAATGGG GAATTATTGA TGAGAAAGGA AATCAAATTA TACCATATAA ATACAAAGAA ATACAAATTT TGAATTATAA GGGAGTCATT GTTGCTAAAG AAAAAGATAG TTACAAAATA ATATCGATCA AAAATAAAAA GGAAAGTAAG GAAAACTATA GCAACTTTTC ATTAAACATT GGGAAAATGT TACCTGTTAT GAGAAACAAT AAGTGGAGCA TTTTAAATCT TGAAACGTTA TCAGAAGTTT TTCCGTTGAT TTATGATGAA ATATGGGTCC ACAATGAAGG CTGGATAGAG CTTAAAAAAG ATAAAAAGTT GTACATTATG TTTTCTGACG GTAAAATATT AGATAAAACT TTTAAAAGGG ATTCAGTAGT TGTTTCCTCA AACAAAATGA TTTCAATAAA TCAAAATAAT GGAGTCATAA CTTTAGTAAA TCTTGATTCA CGAAAGACAG TAGATATAAA AGCACATGAT ATAACAGTAT TTAACAATGG TTTTGCGGCA GTCAAGGTTA GGGACAAGTG GGGGTTGATT GCAGAAAATG GGAAGTTTGT TATTAAGCCA AAGTATGATT CTATTTGGAT AGCAGACAAA GACATAATTG TAGTTTATCT CAATGGGAAG TGGGGGTTAG CGAAAATAAA CGGAGAAACT TTAACACCAC TGAACTATGA TTTAATAGGC GAAGTAAAAG ATGGGTACGT AGCATTTCTG AAAAATGGCA AATGGGGTGT GATGCTAAAG ACAGGCAAGA TACTGTTGAA ACCGAAATTT GATCAAATTA CACTTCATAC AAAGAACTAT ATATTTGCTA GACAAAATGA TACATGGTTT CTCATAGTAA TTAAGAATAA TAAAAAATAT TTCTATAGAT TCAAAACGTT CTCTGTGATA AGAGTTAATG ACAATATTTG GGCATATACT ACAGAGAAAG GTATGAAAGT TATTATTCTA AAAAAATAG
|
Protein sequence | MNSKKKNKRK KNKQKKQQTK VQFKEFLRLP IVKSFIIVLI LTLLITTIYS TATIVIRKNI AKHFFTISDE NYQNVAYLNN KLVMFEKNSK WGIMDIQGKI VVKPTYERIL LESEGMIPVK LKNKWGFIDI YGKVKIKPQF DNVTAFTDQL AAVCVNNRWG IIDKSGKYTI KPQYQDIIIH PNKMIQTKKY NKWGIIDEKG NQIIPYKYKE IQILNYKGVI VAKEKDSYKI ISIKNKKESK ENYSNFSLNI GKMLPVMRNN KWSILNLETL SEVFPLIYDE IWVHNEGWIE LKKDKKLYIM FSDGKILDKT FKRDSVVVSS NKMISINQNN GVITLVNLDS RKTVDIKAHD ITVFNNGFAA VKVRDKWGLI AENGKFVIKP KYDSIWIADK DIIVVYLNGK WGLAKINGET LTPLNYDLIG EVKDGYVAFL KNGKWGVMLK TGKILLKPKF DQITLHTKNY IFARQNDTWF LIVIKNNKKY FYRFKTFSVI RVNDNIWAYT TEKGMKVIIL KK
|
| |