Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0032 |
Symbol | |
ID | 7407267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 40550 |
End bp | 42073 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643714442 |
Product | KWG repeat protein |
Protein accession | YP_002571967 |
Protein GI | 222528085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGGA TTAAAAGTTT TATATGTCTT GTATTGGTAT TTATCTGGCT CACTTACAGT CTGACTTTTG CGCATGCTCA GCAAGCCTCT TCTCAAAAGT TTGTGTATAT AAAACCGCAG TTTGAAAATG TAATGTTCTG GGAAAACGGC TGGATTTCTT ATTTTCAAGG TGGCAAATGG GGAATTCTTA GCTCATCTGG CAATGTTCTT TTAAAGCCTC AGTTTGATAA GATAGAACCT GTAGTTTACG ACAGTCCTTC AATAATGGGT ATTGAGGATA AATTTTACAA AGATATCTTT GTTATCTGGC AAAATGGCAG AGCAGGTTTT GTGGATACAA GCGCCAAAAT TCTTGTAAAA CCAGAGTTAG ATTCTCTTGA GGTTTTAAAT CCTTGGCTAA ATTTATATTT TTGCAAAAAA GAGGGCAAAT ACGGTTTTTT GAATCTGGAT AAAAAAATCT ATGTCAGTCC TCAGTTTGAC AAAATGTACT TTGTGGTTGT ACTCCCATAT AACCCAAATG GAAAATATCC AAATCCCCAT GCCAAGTGTA CTTTGTCAGA TGAAAACAAA AAAGAGTTCT GTCTTTATGC TCTTGATTTG AAAGATGGAA TATGGCCAAA TTCTTATTTA GAATACATTC TTGTATCAAA AGATGAAAAG TGCGGAGCTG TTAATATAGA TGGAAATGTG TTTGTAGATT TTAAATATAA TTCTTTTGAA GAAGCTTTAT CAGACAGCAA GTTTACAGAA GCTGTGAAGA ATATGTTAGC AAATGAATCA AAACCTGCAA CATCTTTGAG TAAAAATGAG TCAAACAAAA CATCACCTGA TTATATTTCC AGCGAAATTA TTGGCAATAA ATACTTCCTT GTGTTTCAAA AAGCAACCAG TAAAGGTTAT ACACAAACTA AAAGCAAAGA ATACTATGAT AATGTCAAAA ATATTGGGTT TACCAATTTC ATTGCAGTTT GTAAAAATAA AAAATGGGGA ATTGTTGATA TAAACGGAAA ATACGTAGTT AAGCCTCAAC TTGATGATAT AAAAGAACTC AGCGAAGGAA AAATAGCTTT TAAACAAAAT GGAAAATGGG GATTTATGGA CAAAAACTTT AAAGTAGTTA TAAAACCTCA GTTTGATAAA GCAGAAAACT TTTCTGAAGG ATACGCTGCT GTAATGAAGT CAAATTTATG GGGATATATT AATCCTTCTG GAAAGTTTGT TATAAAACCT CAATATACTC AAGCAGGTCC ATTTTTTGCT CAGATGGCTG CTGTTGCTAC AAAGGATTAT GTAGGGCTTA TAGATACAAA AGGCAGTTTT GTAGTAAAGT TTTCAGCTAA GAATTCTCAA TATTCTTTTG TGGACAGTGA AACCTACAGG TTTAGATATG CTCCAGATTC CATAATAAAC TCCAGAGCTT ATGAGCTTTA TAAATATACA CTTAAATTCC CCAAATTTGG CTATGTTGTG ATTGATAAAA AATCAAAAAA GGTTGGACTT GTTTTAAGAG GGCAAGGAAA ATAA
|
Protein sequence | MRRIKSFICL VLVFIWLTYS LTFAHAQQAS SQKFVYIKPQ FENVMFWENG WISYFQGGKW GILSSSGNVL LKPQFDKIEP VVYDSPSIMG IEDKFYKDIF VIWQNGRAGF VDTSAKILVK PELDSLEVLN PWLNLYFCKK EGKYGFLNLD KKIYVSPQFD KMYFVVVLPY NPNGKYPNPH AKCTLSDENK KEFCLYALDL KDGIWPNSYL EYILVSKDEK CGAVNIDGNV FVDFKYNSFE EALSDSKFTE AVKNMLANES KPATSLSKNE SNKTSPDYIS SEIIGNKYFL VFQKATSKGY TQTKSKEYYD NVKNIGFTNF IAVCKNKKWG IVDINGKYVV KPQLDDIKEL SEGKIAFKQN GKWGFMDKNF KVVIKPQFDK AENFSEGYAA VMKSNLWGYI NPSGKFVIKP QYTQAGPFFA QMAAVATKDY VGLIDTKGSF VVKFSAKNSQ YSFVDSETYR FRYAPDSIIN SRAYELYKYT LKFPKFGYVV IDKKSKKVGL VLRGQGK
|
| |