Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0718 |
Symbol | |
ID | 7407142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 808914 |
End bp | 810263 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 643715090 |
Product | KWG repeat protein |
Protein accession | YP_002572606 |
Protein GI | 222528724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAATA AAGTAAAAAT TTTGAACAGG GAAAATAATG AGTCTATCAA ATTGATTGTA GCAATTATGG TGTTAATTTT AGTACTTAAT ACAATTGGAG TTTATAGTAA GCAAAATGTG AATGTTAAAG ACAATCTAAT TATAATAGAA CCTAAGTTTA AATATATATA TCCATTTTCA GAAGGGTTAG CTGTTTTTAT AAGCGAAGAT AATAAACATG GATACTTGGA TATGAAAGGC AATATTATAA TAAAACCTAA TTTCGAAGAA GCCCATAGTT TTTCTGAAGG TATAGCAGTA GTTAAGGAAA AAGGAAAATG GAAAGTTATA GACAAAAAAG GTAATATCAT TTTAAGTTTG GATTTTGACT ATGTTGGAAG CTTCAGCGAG GGTTTAGCAC CAGTAGAAAA AGATAAAAAA TGGGGGTATA TAAACAAAAA AGGTGAAATA GTGATTGAGC CTCAATTCGA AAGTGCGGGT AATTTTTTTG AGGACAGAGC TGTAATACAG CTAAATGGAA AATTTGGCTA TATAGATAAG AGTGGAAAAA TAATAATATC ACCCAAATAT TATGTAGCTT ATGAATTTTC AAAAGGTGTT GCTGCTGTTG CAATTTTAGA TGCACAAAAA AATACAAAGT ATGGCTTTAT AGATAGAAAT GGCAAATATG TTATAGAACC AAAATATGAT TTTTTAGCAG GGTTTATTGG TAACATTTAC TATGATAAAA TTTTCAAGAA TGGGTTAGCA AGAATAAGAG TGAATAACAA GTTTGGATAT ATAAATGAAA AAGGTGGAGT AGTTATTCAA CCAAAATTTG AATATGCTCT TGAGTTTGAT AAAAATATTG CCTTAGTTTG CTACAATGGA AAATATGGAT ATATAAATAA GAAAGGAGAT TTTATAATCA ATCCAGTTTA TGAAGATATG TTATTATTTT CAGAAGGTTT AGCAGCAGCT AAGTTAAATG GTAAATGGGG TTTTATTAAT TATAAAGGTG ATTTTACTAT AAAACCTCAG TTTGAAAAAG CTTATAGTTT TTCAGAGGGA GTAGCTGCTG TAAAATTGAA AGGTAAATGG GGTTTTATTG ACAAAAAAGG TAATTTTATA ATTAAGCCTC AATTTGATGA ACCTATAATA TATTGCAGTG CTATATATAC TTTTACAAAT GGTTTAGCTG CAGTTTGTAA AAATAGAAAA TATGGATATA TTGATAAAAA TGGGAAATGG ATTGTTGAAC CTAATTATGA CGTAGCAAGT GGGTTTATGA ATGGCTTTGC TCATATTGAT AAAAATAACC TTGTAGGATT TATTGCATTA AAGAACGTGG TATTGAAAGA GAAAAACTAA
|
Protein sequence | MWNKVKILNR ENNESIKLIV AIMVLILVLN TIGVYSKQNV NVKDNLIIIE PKFKYIYPFS EGLAVFISED NKHGYLDMKG NIIIKPNFEE AHSFSEGIAV VKEKGKWKVI DKKGNIILSL DFDYVGSFSE GLAPVEKDKK WGYINKKGEI VIEPQFESAG NFFEDRAVIQ LNGKFGYIDK SGKIIISPKY YVAYEFSKGV AAVAILDAQK NTKYGFIDRN GKYVIEPKYD FLAGFIGNIY YDKIFKNGLA RIRVNNKFGY INEKGGVVIQ PKFEYALEFD KNIALVCYNG KYGYINKKGD FIINPVYEDM LLFSEGLAAA KLNGKWGFIN YKGDFTIKPQ FEKAYSFSEG VAAVKLKGKW GFIDKKGNFI IKPQFDEPII YCSAIYTFTN GLAAVCKNRK YGYIDKNGKW IVEPNYDVAS GFMNGFAHID KNNLVGFIAL KNVVLKEKN
|
| |