Gene Athe_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0059 
Symbol 
ID7407296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp77375 
End bp78382 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content28% 
IMG OID643714471 
Productglycosyl transferase family 2 
Protein accessionYP_002571994 
Protein GI222528112 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATT TAACAGTTAT AATACCTACA TATAATCAAA AAGATTTATT AGAAAGAGCA 
ATAAATTCGT TAATATCTAA GTGCAATGAC GAAATAAATA TATATGTATT AGTCAACAAT
ACGGAAAGTA ATTACATCGT AAGAAATAAA AATTATGCAA ATCTACATGT TGAATATTTG
AACACAAACT GTGGTTTTTG TAAAGCAGTC AATTATGGAT TACGATTAAT AAAAAAATCA
AGATTTATTT TTCTTTTAAA CGATGATACT GAAGTAATAA ATCAGATAGA TATAGATAAT
ATTATTCATG AATTGATTGA AAAAGGCAAC ATTTTTTCTA TATCGTTGAA AATGCTGAAG
GGTAATTATC CAAATCTTTT AGACGACGCA GGTGATATGT ACACTATCTT AGGTTGGCAG
TTTAAAAGAG GCAATGGTCT TCCAAAAGAA CTTTATGATA GACCGTGTGA AATTATTTCC
GCATGTGGTG GTGCTGCAAT CTATAACAAA AAAATTCTTG ATGAGATAGG TTACTTCGAT
GAAGATTTTT TTGCATATCT TGAGGATGTA GATTTAGGTT TGAGAGCGCT CATGAGGGGA
TATAAAAATT TATATTATCC TTACATAAGC GTATTGCATG TTGGAAGTGC GACAACAGGA
GGAAAATATA ACGATATTAC TATCAGACTT ACAGCAAGAA ACTCGATATA TGTTATATAC
AAAAATCTTC CCTTACCCCT TTTAATAATT AATTTTCCCT TTATTTTATT GGGATACTTA
ATCAAATTTA TATTCTTTGC TAAAAAAGGA AAAGGAAAAA TTTACATAAG TGGAGTTCTT
GAAGGACTAA AAAATTTGCC CAAATTTAAA GAAAAAAGAA GAGAAAATAT GAGAAAAAAG
AAAATTTCTA ACATAAAGCT TGAATGGATT CTTATTAAAG CTACATTTGA ATATTTTCAC
CAATATATAA AAAGAGCATT TTACACTTTA AGAGGTGCAA AGAAATGA
 
Protein sequence
MRNLTVIIPT YNQKDLLERA INSLISKCND EINIYVLVNN TESNYIVRNK NYANLHVEYL 
NTNCGFCKAV NYGLRLIKKS RFIFLLNDDT EVINQIDIDN IIHELIEKGN IFSISLKMLK
GNYPNLLDDA GDMYTILGWQ FKRGNGLPKE LYDRPCEIIS ACGGAAIYNK KILDEIGYFD
EDFFAYLEDV DLGLRALMRG YKNLYYPYIS VLHVGSATTG GKYNDITIRL TARNSIYVIY
KNLPLPLLII NFPFILLGYL IKFIFFAKKG KGKIYISGVL EGLKNLPKFK EKRRENMRKK
KISNIKLEWI LIKATFEYFH QYIKRAFYTL RGAKK