Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0232 |
Symbol | |
ID | 7407223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 280418 |
End bp | 281611 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714632 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002572155 |
Protein GI | 222528273 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000273015 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGGATAC TGCAGCTTAC TTGGGAATAT CCACCAAGGA TTGTTGGCGG CATCTCAAGG GTGGTCAGAA GCATTTCACA GAAGCTATCT GAAACAGACA TAGTTTGTGT TGTTACCATT TCAGAAGATT ACGAAAGAAT AGAAGATCAT GGGAGTCTTA AAATATTTAG AGTTCCAGTG TATCCACTAA ATTCCCTTAA CTTTATCGAC TGGGTTATGA TGATGAACAT GGCACTTGCT GAAAAAGCTA TATATATTGC ACAAAAGGAA GGAAGATTTG ATATAATCCA CGCGCATGAC TGGCTTGTGG CATTTGCTGC GCGCATCGTT AAGTACGCTC TTCGGATTCC CTTGGTTGCT ACAATCCATG CAACAGAACA CGGACGAAAC GGTGGTATAT ACACAGATAT GCAGAGATTC ATCCACAATG TTGAGTGGTG GCTGACATTT GAGGCATGGA AAGTAATTGT GAACTCTGAG TTTATGAAAC ACGAATGTGA AAGAATTTTC AGTTTAACAC CAGACAAATG CATTACTATT CCAAATGGTA TAGATTATGG TGAGTTTGCA AATGTAGAGT TTGATTTAGA ATTTAGAAGA AAGTATGCAA TGGACAGTGA AAAGATAGTC TTTTTCATTG GAAGACATGT TTATGAAAAA GGAGTTCACA TCTTGATAGA AGCTTTCAGA AAGGTGCTTG ATAATTTTTA TGATGCAAAG CTAATAATTG CGGGCAATGG TCCAATGACA GGTGAACTTT ACTCAAAAGC TCACTTTTTA GGGCTTTCAC ATAAAGTGAT GTTTACAGGT TTTATTTCTG ATGAAGAAAG GAAAAAACTA TTTAAAGTTT CTGATATTGC TGTGTTCCCA AGTCTTTATG AACCTTTTGG AATAGTTGCT TTAGAAGCAA TGGCATCAGG ATGTTTGCCA GTTGTATCTG ACACAGGAGG TTTTTCTGAG ATTGTAAAAC ACCTTCACAA CGGACTTACT TTTTTCTGCG GGAATTCAAA TTCACTTGCT GATATGATTT TGCTTGCTTT GAAAGATAGT ACACTTCGAC AAAAATTGTC AAAACAAGCC CAGTCTGATG CAAAAGAAAT TTATTCATGG GATGAGATAG TGAAAAGACT GAAAAATGTG TACCAGATGA TTGTCACAGA AGCCAAAAAG ATGGAATGGT TTTCAGTACG TTAG
|
Protein sequence | MRILQLTWEY PPRIVGGISR VVRSISQKLS ETDIVCVVTI SEDYERIEDH GSLKIFRVPV YPLNSLNFID WVMMMNMALA EKAIYIAQKE GRFDIIHAHD WLVAFAARIV KYALRIPLVA TIHATEHGRN GGIYTDMQRF IHNVEWWLTF EAWKVIVNSE FMKHECERIF SLTPDKCITI PNGIDYGEFA NVEFDLEFRR KYAMDSEKIV FFIGRHVYEK GVHILIEAFR KVLDNFYDAK LIIAGNGPMT GELYSKAHFL GLSHKVMFTG FISDEERKKL FKVSDIAVFP SLYEPFGIVA LEAMASGCLP VVSDTGGFSE IVKHLHNGLT FFCGNSNSLA DMILLALKDS TLRQKLSKQA QSDAKEIYSW DEIVKRLKNV YQMIVTEAKK MEWFSVR
|
| |