Gene Acel_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1803 
Symbol 
ID4485702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2044319 
End bp2045368 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content65% 
IMG OID639730593 
Productribokinase-like domain-containing protein 
Protein accessionYP_873561 
Protein GI117929010 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.453248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTCA GCATGAGCGC GATGATCAGC GACACCTCCA ATCGGGAGCC TTTCGACATT 
CTCACGATGG GGCGTGTCGG AGTTGACGTT TATCCACTGC AGCCGGGCGT GTCGCTCCGG
CATGTCACCG CGTTCGGCAA ATACCTCGGC GGGTCTGCGA CCAACGTTGC GGTCGCCGCG
GCTCGTCTTG GCCGGCGCAG CGCCGTCATT ACCAAGACCG GTCGGGACCC ATTCGGCGAG
TTCATCCACG ATGCGCTGCG CGCCTTTGGC GTCGACGATC GGTGGGTAGG TTCCGTCTCC
CACCTGCCCA CGCCGGTCAC CTTCTGCGAA ATATTTCCGC CCGATGACTT TCCCCTCTAT
TTCTATCGGC ACCCGAAAGC GCCCGACCTC GAGTTGACGA TCGACGACGT GGACGTCGCC
GCCGTCAAGG ACGCGCGGGT GTTCTGGGTG ACCGTGACCG GCCTGAGTCA AGAACCGAGC
CGGACGGCGA CGCTCCACGC ATTGCGACAG CGGGGCCGTC GAGATATCAC CGTGCTCGAC
CTGGATTACC GACCCATGTT CTGGCCGTCG CGGGAATACG CCCGTCGCTG GGTCGAGGAG
GCGCTCAGCA TGGCGACCGT CGCGGTCGGG AATCTCGATG AATGCCTGAC TGCTGTTGGG
ACAGCAGAAC CGTTGGCGGC AGCGGAAGCC CTTCGGCGGG CGGGGGTGCG CCTCGCCGTC
GTCAAGCAAG GCCCGAAGGG GGTGCTCGGG CTCGCCGACG ACGGACCAGT CGTCGTACCA
CCGATAGACA TCGATGTCGT CAATGGCATT GGTGCCGGCG ATGCCTTTGG CGGCGCCCTC
TGCCATGGCC TGCTTGCTGG TTGGCCGCTG GATCGGGTGC TCCGATTCGC CAACGCCGCA
GGTGCGATCG TCGCCGGCCG GCTCGCGTGC GCCGATGCGA TGCCGACCCA GGACGAGATC
GAGCAGCTGC TCACAGCGCG ATCCGACGGT CAATCCTCGC GGCGATCAGG GGGCGAGGCT
TCGGTTGAAC AGGAGAAAGC TGATGCGTGA
 
Protein sequence
MGLSMSAMIS DTSNREPFDI LTMGRVGVDV YPLQPGVSLR HVTAFGKYLG GSATNVAVAA 
ARLGRRSAVI TKTGRDPFGE FIHDALRAFG VDDRWVGSVS HLPTPVTFCE IFPPDDFPLY
FYRHPKAPDL ELTIDDVDVA AVKDARVFWV TVTGLSQEPS RTATLHALRQ RGRRDITVLD
LDYRPMFWPS REYARRWVEE ALSMATVAVG NLDECLTAVG TAEPLAAAEA LRRAGVRLAV
VKQGPKGVLG LADDGPVVVP PIDIDVVNGI GAGDAFGGAL CHGLLAGWPL DRVLRFANAA
GAIVAGRLAC ADAMPTQDEI EQLLTARSDG QSSRRSGGEA SVEQEKADA