Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2555 |
Symbol | |
ID | 7409506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2674773 |
End bp | 2675759 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643716919 |
Product | glycosidase PH1107-related |
Protein accession | YP_002574396 |
Protein GI | 222530514 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000154698 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATCA AAATAATAGG GCAATCACTT CCAAACATGC CATGGGAAGA GAGGCCAAAA GACTGCAAAG ACATTGTATG GAGGTCTAAA CACAACCCAA TTATCAAGAG AAATCAGGCA AAAGATGCAA ACAGCATCTT CAACAGCGCA GTTGTTCCAT TCAAAGATGG TTTTGCAGGA GTTTTTAGAG TAGATGATAG GGCAAGAAGA ATGAACATCA GACGTGGGTT TAGCAAGGAT GGTTACAACT GGGAGATTGA CGATGAACCA ATCAACTTTA TCCAGCAGAC AAGAGACCCG CTTGTAAGTG AGTATAAATA CGACCCAAGA GTAACTTTCA TAGAAGATAG ATACTATATC ACATGGTGCA ATGGCTATCA TGGTCCGACA ATTGGTGTTG GCTATACATT CGACTTTGAA AAATTTTATC AGATTGAAAA TGCGTTTTTG CCTTACAACA GAAACGGTGT ACTTTTCCCG AGGAAAATAA ACGGCAAATA CGCTATGTTA TCCCGCCCAT CAGATACGGG GCACACACCA TTTGGTGATA TATTCTACAG CGAAAGCCCT GATATGATTC ACTGGGGTTG CCACAGACAT GTAATGTCAG CAGGCTATAC TCCATGGCAG TCGCTCAAAA TAGGGGCAGG GCCTACACCA ATTGAAACAA GCGAAGGATG GCTGCTAATT TATCACGGTG TACTTCTTTC ATGCAATGGT TATGTATACA GCTTTGGTGC AGCGCTTTTA GATTTGGAAA AACCATGGAT TGTGAAAGCA AGATCAAAAT CTTATCTTCT TTCACCACAA GAGTATTATG AATGTGTTGG CGATGTTCCA AACGTAGTGT TTCCATGCGC AACACTTTGC GACGCTAGCA CAGGAAGGCT TGCAATATAT TATGGCGGTG CTGATACTGT TGTAAACCTT GCATTTGCTT ATGTTCAGGA TATAATTGAA CTTCTCAAAA GAGAGAGCCA GGAATAA
|
Protein sequence | MDIKIIGQSL PNMPWEERPK DCKDIVWRSK HNPIIKRNQA KDANSIFNSA VVPFKDGFAG VFRVDDRARR MNIRRGFSKD GYNWEIDDEP INFIQQTRDP LVSEYKYDPR VTFIEDRYYI TWCNGYHGPT IGVGYTFDFE KFYQIENAFL PYNRNGVLFP RKINGKYAML SRPSDTGHTP FGDIFYSESP DMIHWGCHRH VMSAGYTPWQ SLKIGAGPTP IETSEGWLLI YHGVLLSCNG YVYSFGAALL DLEKPWIVKA RSKSYLLSPQ EYYECVGDVP NVVFPCATLC DASTGRLAIY YGGADTVVNL AFAYVQDIIE LLKRESQE
|
| |