Gene Acel_0575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0575 
Symbol 
ID4485239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp604831 
End bp606093 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content62% 
IMG OID639729342 
ProductROK family protein 
Protein accessionYP_872334 
Protein GI117927783 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.193602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTGC ACGACGCGCT GTCTGAAGAT CATGTCGAGG AGCTGGCGCA ACTGCTCGCT 
CTGATTCGGC GCGGGTCGGC TGTAACGCGG TCCGAGCTAG TCCAGAGCTC CGGGTTCGGT
CGTACTGCCA TCGCTCGTCG ACTTGATCAC CTGATGGCCC TCGGTCTTAT AGCGGAGAGT
GAGCCCCTTC CTTCAACCGG GGGACGAATG CCGCGCGGCG TTTATTTCCG CGCCGACGCC
GGCCGCTTGC TCGTCGCTGA ACTCGGCGCA ACGAGTATCG CCGCTGGCAT CTGTGATCTT
GACTGCCGTG TCTTGTCGTC TCGGGAGGAG GCATGGGACA TTGCTGCCGG CCCGGAGTCA
ACTCTCGCTC GCTTGGAGGC GCTGCTCAGC GAACTTCTTC GGGATCACCA AGCAAAGGTG
TGGGGAATCG GCGTTGGCCT GCCGGGACCC ATTGAATTTG CGACAGGACG TCCCGTATCG
CCGCCCATCA TGCCCGGTTG GGACCGCTAC CCGGTCCGGG AGCGACTAGC GAAACGCTTC
TCCGCGCCCG TCTGGGTAGA CAACGACGTC AACGTTCTCG CGCTAGGCGA ACTGCGGGCA
GGCGTCGGAC GAGCGCATAG TGATCTCATC TATATCAAGA TTGGAACAGG CATCGGCGCC
GGCATCGTCA GCGGTGGGCG GCTCCACCGC GGAGCGCAAG GATGTGCAGG GGACATTGGT
CATGTCGCGG TTACAGATGA TCAGGCGGTG GTATGCCGCT GCGGCAATAT TGGCTGTCTA
GAAGCGGTCG CCGGGGGAGC TGCCTTGGCA CGACGGGCCA CAGCATGGGC GCGCGAGGGT
CGTAGCGGCT ACCTGCAGCA ACGACTGACG GAAGGTGGTG ACCTCACGGC GGCGACGATT
GCGGCCGGTG CCGCAGCAGG TGACACCGGA TGCGTCGAGT TGCTCGCGGC CGCCGCGCGG
CAAATCGGGG ACTCATTAGC AACTTTCGTA AACTTCTTCA ATCCCTCAAT TGTCATCATC
GGCGGAGGCG TTGCCCAGGC GGGAAACGCG TTTCTCGCTT CGATACGCCA GCGGGTCTAC
TCAAGATCCC TTCCGTTAGC CACTCGAGAT TTGCAGATCG TCTTGTCCTC ACTGGGCGAT
ATCGGCGGAC TGATCGGTGC GGCGCACATG GTGGTAGACG AAATCCTGTC GCCGAAGGTG
CTATCCTTGT GGATCAAGGA AGGTACGCCC GAAGCACTTG CAGGAGCACA CGCGGGAAAT
TGA
 
Protein sequence
MVLHDALSED HVEELAQLLA LIRRGSAVTR SELVQSSGFG RTAIARRLDH LMALGLIAES 
EPLPSTGGRM PRGVYFRADA GRLLVAELGA TSIAAGICDL DCRVLSSREE AWDIAAGPES
TLARLEALLS ELLRDHQAKV WGIGVGLPGP IEFATGRPVS PPIMPGWDRY PVRERLAKRF
SAPVWVDNDV NVLALGELRA GVGRAHSDLI YIKIGTGIGA GIVSGGRLHR GAQGCAGDIG
HVAVTDDQAV VCRCGNIGCL EAVAGGAALA RRATAWAREG RSGYLQQRLT EGGDLTAATI
AAGAAAGDTG CVELLAAAAR QIGDSLATFV NFFNPSIVII GGGVAQAGNA FLASIRQRVY
SRSLPLATRD LQIVLSSLGD IGGLIGAAHM VVDEILSPKV LSLWIKEGTP EALAGAHAGN