Gene Acel_1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1377 
Symbol 
ID4485865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1535584 
End bp1536624 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID639730161 
ProductROK family protein 
Protein accessionYP_873135 
Protein GI117928584 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.407084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.525187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCC CACCGCAGCC AGCCGGCGAC TGGGTCGTCG TCGGCCTCGA CAACGGGGGC 
ACGGCCAACA ACGCAACTGT CCTCACGGGC GACGGCCGGT TCCTGGTGGA CGCCCTGGTG
GAGAGTCCAA GCCGGGTGAC GGAGGGACCG ACGGCTGCGC TGCAGGCCCT GCTGGCGGCG
TACCACGACA TTCTCGCCCG GACCGGCTGT TCGGAGGGGC AGGTCCGCGC AGTCGGCCTG
GACAGCCCAG GCCCGGCAAG CGCCGACGGC GTGATTTCCC GGGTCGGGGC AACGAATTTC
GGTCATCCGG ATTGGCGGGG ATTCGATTTT CGCGGCGAAC TCGAGAAGCT TCTCGGCGTC
CCAGTGATTT ATCACAATGA CGGCAATGCC GCGGCGCTAT ACGCGCACCG CATGTTCTTC
GGTGACGAGG CGCCGATCCG ATCGTCGGTC TCGGCCATCG TCGGCACCGG TCTGGGCGGC
GGGATCATCG TCTCCGGGGC GGTGATCCGT GGAGCCGCCG GGATGGCCGG CGAACTCGGC
CACGTCCACA TTCCGCTCGA CGGCATTCTG GCCGACGGCC AACCGGTACC CCGGTGCAAC
TGCGGCTTTC GTGCGGACGC CGAGAGCATC GCCAGCCTCA GCGCCATCGA GCGGAACCTG
CTGCCGTTCT GGCTCTCCCG GTACCCCGGC CACGCCCTGG CCGCCCTGCC GATCCGACAG
GCGGCCCGTG AAGTCCGCCG CCTTGCCGAG CAGGGGGATC CGTTGGCGCT CGATATCTTC
CGGCAGCAGG CGGCGGCGAT CGGCCGGCTC TTCACCATCC TGGCGAACGT CATCGACCCG
GACGCCTACT TTATCGGTGG CGGCGTCGTC CAGGCCACCG AACAGTTCCG TGAATGGTTC
CTCGCCCAGG TCAGGGCGGA GACCCGGCTC CGTCCCGAGC AGCAGGAGAC GGCCGCCTTC
GCGCTCACCC CCGACCTGGA CATGGCAGGG GCCCGCGGGG TCGCCATGGC GGCGCGGGAC
GCCGTCCTCG CCGGCCGCTG A
 
Protein sequence
MTPPPQPAGD WVVVGLDNGG TANNATVLTG DGRFLVDALV ESPSRVTEGP TAALQALLAA 
YHDILARTGC SEGQVRAVGL DSPGPASADG VISRVGATNF GHPDWRGFDF RGELEKLLGV
PVIYHNDGNA AALYAHRMFF GDEAPIRSSV SAIVGTGLGG GIIVSGAVIR GAAGMAGELG
HVHIPLDGIL ADGQPVPRCN CGFRADAESI ASLSAIERNL LPFWLSRYPG HALAALPIRQ
AAREVRRLAE QGDPLALDIF RQQAAAIGRL FTILANVIDP DAYFIGGGVV QATEQFREWF
LAQVRAETRL RPEQQETAAF ALTPDLDMAG ARGVAMAARD AVLAGR