Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1569 |
Symbol | |
ID | 7409078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1658677 |
End bp | 1660008 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643715942 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_002573440 |
Protein GI | 222529558 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAATAA TTGTAACTGA CTTTGGAGCA AAAGGAGATG GAGTGAGTTT TTGTACAGAG GTAATACAAA AGGCTATTGA CACCTGTTTT GAAAATGGTG GTGGAGTTGT AGTTATACCT GCTGGAATTT ATTTGAGTCG CCCTATAAGG TTAAAATCTA ATGTAACCCT ATATTTGGAA GAAGGTGCTG TAATCAAAGC AACCAACAAT ATTGAAGATT ACTACCAAAT TGGTTATTAC CACAACGAAT GGGGAGAGGT TACTTCATTT TTATATGCGA TGAACGAAAA GAACATAGCA GTTGATGGAA AAGGTACAAT TGACCTTTCC GGAAGCAGTT TTATGGATTT TTCAAGAGCA TTTAATCAGT TCGAAGAGCT ATCTCAACTT GACAAAGACC AGTTTGAAGA GACAGAATGT AAGCCCATTT ACCGACCAAA TCAACCCATA TTCTTCTATA ACTGTGAAAA TATTAGTCTG AGCGGTATCT CAATTATTGA CTCACCTTGC TGGACAGTTT GCATCCACTC ATCAAAATAC ATCAAGGTGC ACAACATAAG GATTATGAAC AATCTCAGAG TTCCAAACAG CGACGGTATA CATCTTTGCT CATGCGAAAA TGTAATAATA TCAGATAGCT TCTTTACTTG CGGTGATGAC TGTGTTGCTA TATCTGGCAT TACAAACTGG GACAAACCAT GTGAAAATAT AATTGTGTCA AACTGTATTA TGCAAACACG CTCAGCAGCG TTGCGGATGG GACACCTCGA TAGCAAAGTA AAGAATGTGG TTGCTTCTAA TCTTATCATC CTTAATTCAA ACAGAGGAAT TGCAATATTC GCAAACGGGA AAAGCGGGTA TGTCAAATCA GTTACAATCT CAAATGTCAT TATGACAACA AAAATATTTG CTGGTACATG GTGGGGAAAA GGTGAACCAA TTGTAATTGC CTCTCCTGAA GAAGGTAATT TTATTGAAGA TATCTCTATT TCCAATATAA AAGCTTATTC GGAAAATGGG ATTTTAATCT ATGGCAAAGA TAATAATATC CGCAATATTT CACTTAAAAG TATTGATATA TACCTTAGAT TTGGCAAAAA CAGACCACTT TTTGGCAAAA GAATTGACAT TCTCCCAAGC AAATGTCCAC CTTTCCCTGA TTACATGAAT AAAATTCCAT GGATTTTTGC AAAAGATGTT TTGAATCTTA AACTTCAAGA TATAAACTAT GGATACAATT TACAAAACTT AAAAACTGAT TTTGACATAG AAGGAATTTT TCAAAATGTA AATAACTCTT TCTTTTCTGG TATTAGAAAA GTAGCTACTT AA
|
Protein sequence | MRIIVTDFGA KGDGVSFCTE VIQKAIDTCF ENGGGVVVIP AGIYLSRPIR LKSNVTLYLE EGAVIKATNN IEDYYQIGYY HNEWGEVTSF LYAMNEKNIA VDGKGTIDLS GSSFMDFSRA FNQFEELSQL DKDQFEETEC KPIYRPNQPI FFYNCENISL SGISIIDSPC WTVCIHSSKY IKVHNIRIMN NLRVPNSDGI HLCSCENVII SDSFFTCGDD CVAISGITNW DKPCENIIVS NCIMQTRSAA LRMGHLDSKV KNVVASNLII LNSNRGIAIF ANGKSGYVKS VTISNVIMTT KIFAGTWWGK GEPIVIASPE EGNFIEDISI SNIKAYSENG ILIYGKDNNI RNISLKSIDI YLRFGKNRPL FGKRIDILPS KCPPFPDYMN KIPWIFAKDV LNLKLQDINY GYNLQNLKTD FDIEGIFQNV NNSFFSGIRK VAT
|
| |