Gene Athe_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0461 
Symbol 
ID7407539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp526957 
End bp528180 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content35% 
IMG OID643714849 
Productglycosyl transferase group 1 
Protein accessionYP_002572366 
Protein GI222528484 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGAAGG TTTGGATATT AAATCACTAT GCAATCCCGC CAAAAGTTGG GGGAATTACA 
AGGCATTTTG ATTTTGCTAA GCAGCTTGCA GAAAGAGGCT ATAGCGTTAC CATCTTTGCT
TCAAGTTTTG ACCACAAACA GAGGGTTGAG ATGTTAGAAA AAGGCAAGAA ATTTAAAATA
GAGGAATATG AGAAAGTAAA ATTCGTGTGG ATAAAGACGT TTCCTTATAA AAAGAATGAT
ATAAAAAGAC TTTTTAACAT ATTTTCATAT GCCAAAAACC TTTATTTCAT TGCCAGAAAG
TTTGAAAGAC CTGATGTAAT ATTGGCATCT TCATTTCATC CTCTTGCCTG GATTGTGGGG
TATTTGCTGT CAAAAAAATT TAAATGTAAA TTCATTGCAG AGGTCCGAGA CCTTTGGCCG
CAAAGTGGTA TTGACCTTGG TGCTTTGAAG GAAGGAAGTG TAATTGTAAA GCTTTTAAGA
AGTCTTGAAA AGTTTATTTA CACAAAAGCA GACTATGTTG TAACGGTATT GCCAAAGGCA
GACCAGTACA TAGAAAGTTT GGGTATTGAT AAAAAGAAGA TTGTTCATAT TCCTAACGGA
TGTGATATAG AAAGGTTTAA TAGTCTTAAA AATATTCTCT CAGAGGAAAC GAAAAGGATT
TTAGATGAAC ATAAAGGATA TTTCAAAGCT TGTTATCTTG GTGCGCTTGG ACAGGCAAAT
GCAATGGAGA CCATAATAGA GGCAGCAAAA ATTGTCCAGG AAAATGTAGG TGATAGAATT
CATTTTTTGA TAATTGGTGA CGGTCCAGAA AAAGAAAAGC TTGAGAATAT GGCAAAGGAG
CTTGGACTTA AAAATGTATT TTTTTATTCT CCTATCTCAA AGCTTTCTGT GCCAAGCTTA
CTTGAATTCA TTGACATAAC ACTTGTTTCT ATGCACAATC TAAAAGTTTA CAGGTTTGGA
ATATCACTTA ATAAGCTTTT TGACTATTTG TGTGCTGCAA AGCCGATTGT TTTTGCGGGA
AATGTAGCAA ACGATATTGT CAAAGAGTCA GGTGCAGGAA TTTCCTGCAA AAGTTATGAC
AGCAAAGCAT TTGCTGAGGC AATATTGAGC TTGTATGCTA TGTCCAAAGA AGAAAGAGAG
AGAATTGGGC AAAAGGGAAG AGAATATGTT CAAAAGTACC ATGATATAAA GGTGCTTGCA
GACAGATTAG AAAAAATATT ATAA
 
Protein sequence
MKKVWILNHY AIPPKVGGIT RHFDFAKQLA ERGYSVTIFA SSFDHKQRVE MLEKGKKFKI 
EEYEKVKFVW IKTFPYKKND IKRLFNIFSY AKNLYFIARK FERPDVILAS SFHPLAWIVG
YLLSKKFKCK FIAEVRDLWP QSGIDLGALK EGSVIVKLLR SLEKFIYTKA DYVVTVLPKA
DQYIESLGID KKKIVHIPNG CDIERFNSLK NILSEETKRI LDEHKGYFKA CYLGALGQAN
AMETIIEAAK IVQENVGDRI HFLIIGDGPE KEKLENMAKE LGLKNVFFYS PISKLSVPSL
LEFIDITLVS MHNLKVYRFG ISLNKLFDYL CAAKPIVFAG NVANDIVKES GAGISCKSYD
SKAFAEAILS LYAMSKEERE RIGQKGREYV QKYHDIKVLA DRLEKIL