Gene Acel_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1798 
Symbol 
ID4485697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2037482 
End bp2038903 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID639730588 
ProductLacI family transcription regulator 
Protein accessionYP_873556 
Protein GI117929005 
COG category[F] Nucleotide transport and metabolism
[K] Transcription 
COG ID[COG1051] ADP-ribose pyrophosphatase
[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.651793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAA TCCCCGCGGT GGGCGGGATT GCCGTTGTCG ACGGCAAGCT CCTGTTGGTG 
CGGCGCGGAC GACCGCCGTC CGCCGGTTCG TGGTCGGTGC CCGGAGGTCG GGTCGAACCG
GGTGAAGACG ACCAGGCCGC CCTGGTCCGC GAATTCCGGG AGGAGACCGG CCTGCTGGTC
AGCGTGAAGG AACTGCTCGG GGAGGTGCGC CGGCCGGGGC CGGCCGGCAC GACATACCGG
ATCCGGGATT ACCGGGTCGA GCTGGTCACC CCTGCCACGG CCGTCGCCGG GGACGACGCC
GCAGACGTTG CGTGGGTCCC GCTGGACGCC GTCGCCCGGT ATCCACTCAG TCCCGGCTTG
CTCCGCGCCC TGCAGCGTTG GGGGATCGTG CCGCCGGCGG CGTACCGGCA TCCGAGCCAC
GGTTCGCCAA CGTTGGAGGA GGTCGCGGCC CACGCCGGGG TGTCGCGCGC GACAGTCTCC
CGCGTGGTCA ACGATTCACC GCGGGTCTCT CCAGCGGTGC GGGAGGCCGT TCTCCGCTCG
ATCGAGGAAC TTGGCTATGT ACCGAACCGC GCTGCGCGCA CCCTGGTCAC CCGACGCACG
GACACGATCG CGTTGGTGAT TTCCGAACCG GAATCGCGGT TGTTCTCCGA CCCGGTGCTG
GCCGGTTTCG TTCGCGGCAT CGCCGATGTC CTGGCCGGTA CCGACTACAT GTTCGTGCTC
CTCACCGCGC AACCGGACAC CGAACGGATC GCCCGCTACA TCCGCAACGG CCATGCGGAC
GGCGTCATCC TGATGTCCTT GCACGGCGAT GACCCGCTCG TCGGCATGCT GGAAGCCCGG
CGGATGCCGG CGGTTCTCTC CGGCCGGCCG CTCGGCCGGG GACACACGAT CCCGTACGTT
GACGCCGACA ACGTCGGTGG AGCGCGGCAA GCGACGGAGT ACCTGGTCCG CCAAGGACGT
CGCACCATCG TCTCCATCAC CGGGCCGATG GAAATGTGCG CGGCGATTGA CCGGCTTGCC
GGATTCCGCA GCGGACTGCC ACCGGAGCTG CGCCGCCGTT GGCGCAGCCT CATCGCCACC
GGAGCGTTCA CCGAGGAGAG CGGCGAACGG GCGATGGCTG AGCTGCTGGA ACGCGTTCCT
GACCTTGACG CCGTTTTCGC CGCCAACGAT TTGATGGCGG CTGGTGCACT CCGGGTGTTG
AAGGCAGCCG GACGACGCGT GCCGGACGAC GTCGCGCTCG TCGGTTTCGA CGATTCCAGC
GCCGCCCGCC ACACCGATCC GCAGTTGACG AGCGTCCGAC AGTCTGCCGA GGAATTGGGA
CAGAACATGG CCAAGCTACT GCTCGTCCAG TTGGCGGATC CCGATGCCCG GCCGGATCCC
GTGATCCTCC CGACCGAGCT CGTCATCCGC GAGTCGGCCT GA
 
Protein sequence
MPEIPAVGGI AVVDGKLLLV RRGRPPSAGS WSVPGGRVEP GEDDQAALVR EFREETGLLV 
SVKELLGEVR RPGPAGTTYR IRDYRVELVT PATAVAGDDA ADVAWVPLDA VARYPLSPGL
LRALQRWGIV PPAAYRHPSH GSPTLEEVAA HAGVSRATVS RVVNDSPRVS PAVREAVLRS
IEELGYVPNR AARTLVTRRT DTIALVISEP ESRLFSDPVL AGFVRGIADV LAGTDYMFVL
LTAQPDTERI ARYIRNGHAD GVILMSLHGD DPLVGMLEAR RMPAVLSGRP LGRGHTIPYV
DADNVGGARQ ATEYLVRQGR RTIVSITGPM EMCAAIDRLA GFRSGLPPEL RRRWRSLIAT
GAFTEESGER AMAELLERVP DLDAVFAAND LMAAGALRVL KAAGRRVPDD VALVGFDDSS
AARHTDPQLT SVRQSAEELG QNMAKLLLVQ LADPDARPDP VILPTELVIR ESA