Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0167 |
Symbol | |
ID | 7407158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 206185 |
End bp | 207282 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643714569 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002572092 |
Protein GI | 222528210 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00277804 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTTT TTATTATCTT CATCCTGGGC ATAGCTTCGG GATTTCTCCT TTTCTCAAAA ATATTTCTGG CAGACACTAA GGACGATTCT CTTGAACTAA ATCAGAAGAT TTCAGTAATA ATACCTGCTC GGAATGAAGA GAAAAATCTG CCTTACCTGC TCAAAAGTCT TTTTAGTCAA ACTACTGTTC CCGATGAAAT AATAGTTGTA GATGATTTTT CTGAAGATAA CACTTCTAAG ATTGCCAGAG AATTTGGCGT TAAATTAATT AAAAATCCAC CTTTGCCTCC AGGCTGGACA GGTAAAAATT GGGCTCTTTG GAATGGGTAT TTAAATTCAA TAGGTGATAT ACTGATATTT TTAGATGCTG ATGTGAGACT ATCTGAAAAT GGTATAGAAA GAATTATAAA GACACTCTTT TCAACAAATG GTGCAATTTC AGTTATACCA TATCATACAA CGCAGCAGCT TTATGAAAAA TTGTGTCTAA TTGTAAATAT CCTTGGTGTA TTTGCGTTTA TGTCACCTTA TGAAAGAAAG AGCAAGAACA AAGGTATGTA TGGTTCATGT ATAGCAGTTT TTAGAAAAGA CTACGAAAAG GTTGGCGGGC ACAAACGTAT ATGTAACAGA GTAACAGACG ATTTGAGCCT TGGCAAGCTT TTTTGCGAAA ATGGTATTAG AGTTGAAAAT TTTTTAGGAT ACGGTGCTGT TACATTTAGA ATGTACCCAA ATGGAATGAA AAGCCAGCTT GAAGGAATTG CAAAAAGTGC AGCTTTAAGC ATGCAGCTTT TAAATACAAA AACAGTCATT TTAATTGCTC TGTGGACTTT TGGGCTTGTC TTAACAGGTT TCTTAACACC GATTTTGCTG TACATTCATC ATCCTTTAGC AACTAAATTT TTAATAGGCT ATATTCTTTA TGTCATTCAG ATATTATATC TCCAAATATA TATAGGTGAT TTTGGTTTTC TACTTCCTAT ACTGTACTTT ATTCCTACTG CATATTTTTT ACTAATGATT TTGTATTCTT TTTATCAAGT AAAGTTTATT AGAAGTGTCT ACTGGAAAGG AAGACAAATT AAAGTAGGGG GTAAATAA
|
Protein sequence | MIFFIIFILG IASGFLLFSK IFLADTKDDS LELNQKISVI IPARNEEKNL PYLLKSLFSQ TTVPDEIIVV DDFSEDNTSK IAREFGVKLI KNPPLPPGWT GKNWALWNGY LNSIGDILIF LDADVRLSEN GIERIIKTLF STNGAISVIP YHTTQQLYEK LCLIVNILGV FAFMSPYERK SKNKGMYGSC IAVFRKDYEK VGGHKRICNR VTDDLSLGKL FCENGIRVEN FLGYGAVTFR MYPNGMKSQL EGIAKSAALS MQLLNTKTVI LIALWTFGLV LTGFLTPILL YIHHPLATKF LIGYILYVIQ ILYLQIYIGD FGFLLPILYF IPTAYFLLMI LYSFYQVKFI RSVYWKGRQI KVGGK
|
| |