Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0452 |
Symbol | |
ID | 7407530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 514611 |
End bp | 516005 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643714840 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002572357 |
Protein GI | 222528475 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACATA TAATAAACTG GTTTTCATGG TTTGTATCTT ATTATGTTTT AGTTTTGAAT ACTGTTTATG CCATTTTAAT TTTAATATCT CTTTTTGGTA TTGTTAGTTA CTGGAGAAAT AAGATAAAAG GAAGGATTGT GGAGGTTGTC TCATCAGACT TTGCACTCCC TGTATCACTG CTGGTGCCGG CATACAATGA AGAAGAAACA ATAGCAAAAT CTGTAAAATC TTTTTTGCAG ATTGAATATC CAGAATATGA AGTAGTAGTA ATTAATGATG GGTCAAAAGA TGGGACATTG GATGTATTAA AAAACGAATT TGACCTTTAC ATTGTAGATA GGAAATTTAG AAAGATTTTG TCAACAAAAG AGATAAAGGT CATATACTAT TCCAAAAAGT ATTCAAATTT AATCGTAGTG GATAAAGAAA ATGGGGGAAA AGCTGATGCT CTCAATGCAG GGATAAACGT ATGTACATAC CCATATGTTT GTTCACTTGA TGCCGATTCA ATTTTAGAAA GAGATTCAAT AGCCAAGGTT ATGCAACCAT TTTTTGATAA CCCTTATGAA GTTGTAGCCA CAACGGGTAT TGTTAGGATT GTAAATGGAA CCGAATTGGA TTCCTTTGGC AATATAAAAA AATTAAAGCT ACCAAGTTCA AGTCTTGCAA GATTTCAGAT AATAGAATAC TTGAGAGCTT TTTTGGGTGC GAGAAAAGGT CTTTCTATGA TAGGAAGTCT TGTTATTGCA TCTGGAGCAT TTGCAGCATT TAATAAAAAT GCTATTATAA AGGTTGGAGG GTTTTCTGAT AGAACTGTTG GCGAGGATAT GGAGATTGTT GTAAAGCTCA GAAAAAACTC TTATAAAGAA GGCGCACTGG GAAGAGTTGA ATTTGTACCA GACCCGATTG TATGGACTCA ATGTCCAGAG ACTTTAAAAG ACCTTTCAAA ACAAAGAAGA AGATGGCAAA GAGGTCTTTG CCAAGTTATT TTCATGCACA AAGATATTCT ATTTAATCCT AAGTATGGCA TATTAGGTCT TTTTGCTATG CCATATCAGC TTATGTTTGA ACTATTGGGG CCGTTTGTGG AGATGTTGGG TTATATTTTT ATACCTATAT CGTATTTTGC TCACATAATC AATTTAGAAG TGGCCTTATT TTTCTTTGCA GTTGAGATAA TGTACGGAAT AGTTATTTCG ATTTTAGCAG TTCTTCTTGG AGAATTTTCT GATAGAAAAT ATGAAGGGTG GAGAGAGTTC GGTATACTTG TATTGTTTGC GATATTAGAA AATTTTGGCT ACAGACAGAT GACAATGTTA TTCAGAATTG TTGGTACATT TGAAGCTATA CTTAGAAAGA AAGGTTGGGC AAAGCCTGAG AGAAAGAAGT TATAA
|
Protein sequence | MRHIINWFSW FVSYYVLVLN TVYAILILIS LFGIVSYWRN KIKGRIVEVV SSDFALPVSL LVPAYNEEET IAKSVKSFLQ IEYPEYEVVV INDGSKDGTL DVLKNEFDLY IVDRKFRKIL STKEIKVIYY SKKYSNLIVV DKENGGKADA LNAGINVCTY PYVCSLDADS ILERDSIAKV MQPFFDNPYE VVATTGIVRI VNGTELDSFG NIKKLKLPSS SLARFQIIEY LRAFLGARKG LSMIGSLVIA SGAFAAFNKN AIIKVGGFSD RTVGEDMEIV VKLRKNSYKE GALGRVEFVP DPIVWTQCPE TLKDLSKQRR RWQRGLCQVI FMHKDILFNP KYGILGLFAM PYQLMFELLG PFVEMLGYIF IPISYFAHII NLEVALFFFA VEIMYGIVIS ILAVLLGEFS DRKYEGWREF GILVLFAILE NFGYRQMTML FRIVGTFEAI LRKKGWAKPE RKKL
|
| |