Gene Athe_0058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0058 
Symbol 
ID7407295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp76104 
End bp77345 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content28% 
IMG OID643714470 
Productglycosyl transferase group 1 
Protein accessionYP_002571993 
Protein GI222528111 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAA TTGTTTATTT AGTCCATTCA ATACCCAAAT ATGAAAAGAG TGGAACCCCA 
ATAGCTGCTT GGAGAGTGGC AAAGGGTGTA AAAGAAAAAT ATAATCAAAA TGTAGCATTT
ATAATTCCTT CACCTGATGG AGAAGAGGGA AAAGAAAAGG TTGACGATAT TTTGGTTTAC
AAAGTAAAAA GAATAGATTG GCATGAAAAT TTTTTTCATG ATTTTGATAT AGATAGAGAA
ACGTATATTA GAAAAATAAA GAATATTTTA AAAGAAGTTA ATTGTAATAT TTTGCATATT
TATAATTTAG TATTTAGCTC TTATCAGGTA ATGAAACTTA GGAAAGAAGG AATCAGAATT
GTTCGTACGA TAACACATAC TGAGGATATT TGTTTTAATG TTGATCCTTT TGTTAAAGTT
GGTGATAAAA TTGAGATATG TAGTGGGCCT GATCCGATAG CAAAATGCGC TTGTCATTAT
AAACAGATGT ATGGTGGTAA TAATTTGATG GAGTTTATAA TTAAAAAGAT ATCTAAGCAT
TTTACGAGCG TAGAATTATT ATATTCTAAT TTTTGTGATA TAATAACCTT TACTAATGAA
GAATTTGCTA AATATTTTAC CAACTACGTA AATATTCCAA GAGATGTAAT CCGAATAATA
CCACATGGAG TAGAAAATAA ATTAGAAAAA TATATATTGC CGAATATGCC AAAAAACGAA
GGATTTAGAT TTTTATATCT TGGAGGAGAT AATTTTAGAA AAGGATTTGT TATATTAGAT
AATGCGTTAA ATTCTTTAAA TGGGGAGTTA TTCAACAAGA TTAAGGAAAT AACTATAGTG
GGTAAAACAA CAAAGGAATT TAGAGAAAGG TTTAATAATG ATAAATATAT GTTTAAGGGG
GTATTGCCAG AAGAAGAATT ATATAACGAA ATAAGTAATG CTGATCTTGT GATATTGCCC
ACATTTTTCG AAACATATAA CATATCTTTA AGAGAAGCTA TCAAGTTAGG AAAACCTGTT
ATAACTACTA AAACTTTTGG TTCGAATATT GTAGTGGATG GATATAATGG GTTTAGATTT
GATATTGGTG ATAGCTTACA ATTAAAAAAC ATAATAGAAT TAATATTAAG CAATCCCCAA
ATACTGGTAG ATATGAGCAA AAATTGTTTA AATACTCATA TAACTGATAT TGAAGAAGAA
ATAAAGTTAT TTATGAAAGT TTACAATGAA TTAAAAGATT AG
 
Protein sequence
MSTIVYLVHS IPKYEKSGTP IAAWRVAKGV KEKYNQNVAF IIPSPDGEEG KEKVDDILVY 
KVKRIDWHEN FFHDFDIDRE TYIRKIKNIL KEVNCNILHI YNLVFSSYQV MKLRKEGIRI
VRTITHTEDI CFNVDPFVKV GDKIEICSGP DPIAKCACHY KQMYGGNNLM EFIIKKISKH
FTSVELLYSN FCDIITFTNE EFAKYFTNYV NIPRDVIRII PHGVENKLEK YILPNMPKNE
GFRFLYLGGD NFRKGFVILD NALNSLNGEL FNKIKEITIV GKTTKEFRER FNNDKYMFKG
VLPEEELYNE ISNADLVILP TFFETYNISL REAIKLGKPV ITTKTFGSNI VVDGYNGFRF
DIGDSLQLKN IIELILSNPQ ILVDMSKNCL NTHITDIEEE IKLFMKVYNE LKD