Gene Athe_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0167 
Symbol 
ID7407158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp206185 
End bp207282 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content33% 
IMG OID643714569 
Productglycosyl transferase family 2 
Protein accessionYP_002572092 
Protein GI222528210 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00277804 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTTT TTATTATCTT CATCCTGGGC ATAGCTTCGG GATTTCTCCT TTTCTCAAAA 
ATATTTCTGG CAGACACTAA GGACGATTCT CTTGAACTAA ATCAGAAGAT TTCAGTAATA
ATACCTGCTC GGAATGAAGA GAAAAATCTG CCTTACCTGC TCAAAAGTCT TTTTAGTCAA
ACTACTGTTC CCGATGAAAT AATAGTTGTA GATGATTTTT CTGAAGATAA CACTTCTAAG
ATTGCCAGAG AATTTGGCGT TAAATTAATT AAAAATCCAC CTTTGCCTCC AGGCTGGACA
GGTAAAAATT GGGCTCTTTG GAATGGGTAT TTAAATTCAA TAGGTGATAT ACTGATATTT
TTAGATGCTG ATGTGAGACT ATCTGAAAAT GGTATAGAAA GAATTATAAA GACACTCTTT
TCAACAAATG GTGCAATTTC AGTTATACCA TATCATACAA CGCAGCAGCT TTATGAAAAA
TTGTGTCTAA TTGTAAATAT CCTTGGTGTA TTTGCGTTTA TGTCACCTTA TGAAAGAAAG
AGCAAGAACA AAGGTATGTA TGGTTCATGT ATAGCAGTTT TTAGAAAAGA CTACGAAAAG
GTTGGCGGGC ACAAACGTAT ATGTAACAGA GTAACAGACG ATTTGAGCCT TGGCAAGCTT
TTTTGCGAAA ATGGTATTAG AGTTGAAAAT TTTTTAGGAT ACGGTGCTGT TACATTTAGA
ATGTACCCAA ATGGAATGAA AAGCCAGCTT GAAGGAATTG CAAAAAGTGC AGCTTTAAGC
ATGCAGCTTT TAAATACAAA AACAGTCATT TTAATTGCTC TGTGGACTTT TGGGCTTGTC
TTAACAGGTT TCTTAACACC GATTTTGCTG TACATTCATC ATCCTTTAGC AACTAAATTT
TTAATAGGCT ATATTCTTTA TGTCATTCAG ATATTATATC TCCAAATATA TATAGGTGAT
TTTGGTTTTC TACTTCCTAT ACTGTACTTT ATTCCTACTG CATATTTTTT ACTAATGATT
TTGTATTCTT TTTATCAAGT AAAGTTTATT AGAAGTGTCT ACTGGAAAGG AAGACAAATT
AAAGTAGGGG GTAAATAA
 
Protein sequence
MIFFIIFILG IASGFLLFSK IFLADTKDDS LELNQKISVI IPARNEEKNL PYLLKSLFSQ 
TTVPDEIIVV DDFSEDNTSK IAREFGVKLI KNPPLPPGWT GKNWALWNGY LNSIGDILIF
LDADVRLSEN GIERIIKTLF STNGAISVIP YHTTQQLYEK LCLIVNILGV FAFMSPYERK
SKNKGMYGSC IAVFRKDYEK VGGHKRICNR VTDDLSLGKL FCENGIRVEN FLGYGAVTFR
MYPNGMKSQL EGIAKSAALS MQLLNTKTVI LIALWTFGLV LTGFLTPILL YIHHPLATKF
LIGYILYVIQ ILYLQIYIGD FGFLLPILYF IPTAYFLLMI LYSFYQVKFI RSVYWKGRQI
KVGGK