Gene Acel_0719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0719 
Symbol 
ID4485141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp786795 
End bp787949 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content66% 
IMG OID639729489 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_872478 
Protein GI117927927 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.356733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAA GATCCCGGCT CGTCGCGGTG AGCGCGGTGG CGGCCGCCGC CAGCCTGGTC 
CTCGCAGCCT GCAGCAGCTC CAAGAGCTCC TCGCCGTCGT CGAGCTCCGG CGCACCGGCC
GCGAACACGA GCGCGGCGTC CGCCAGCGCC GGCGCAGGGG GCGCGGGCGG CGGCAAGATC
CAGGCCGCAC TCATCCTCAA GGAGTTCACC AACCCGTACT GGATCTCGAT GGAGAATGCC
GCGAAGGCCG AGGCGGCGAA GCTCGGCGTG GATCTCCACG TTTCGGCCGG CAAGGCGGAC
GATGACGCCA CGTCCCAGAT TCAGCAGATC GACGCCGCGA TCTCCGCGGG TTACAAGGGC
ATCATCATTG CCATCAACAG CGACGCGGTG AACACCGCCT TGCAACGCGC GAAAGCCGCC
GGGCTCCTGG TCGTGGTCGT CGACACGCCG CCGATCCCGG CGAGCATCGC GGACGTCACC
TACGCGACCG ACAACCTGCA GGCCGGCCTG TTCATCGGCA AGTGGATGGC GCAGAAGCTG
AACGGCGCCA ACGCCGACAT CGCCATGCTG GACGACCTCG CCAATCAGGT GATCACAGTG
GACGTCGACC GGGACCATGG CTTCCTCCAA GGCATGGGCA TCCCGGTCGG CAATCCCAAC
GTGAACGGGC AAGAGCCCAA GTCCGGTCAT TACACCGGGG GGAAGGGCGG CAGCTATAAG
ATCGCATGTC AGCTACCCAC CAACGGTTCC GCGACTGGGG GTCTGTCGGC AATGGAGACC
TGTCTGTCGA AGGACCCGAA CATCAATGTC GTCTACACCA TCAACGAGCC GGCGGCCAAG
GGGGCGGCGC AAGCCCTGAA GAACGCCGGC AAGACCCCCG GCAAAGACGT GACGATCGTG
ACCATCGACG GAAGCTGCAA CTACCTGTCC CTCCTCACCA GTGGGGAGAT CGGAGCGGAC
TCGGGGCAGT TCCCGGGCAA GATGGCACAG CTCGGCGTCG ACGCCATTGC GCAGTTCGCG
AAGACCGGTG CGAAACCGAG CATGCCGGCG GGCAAGGACT TCATCAACAC CGGCGTCCAG
CTGATCACCG CTCAACCGCA ACCAGGGGTG GACAGTGTCA CCCCGGACCA AGCGAAGTCC
AGCTGCTGGG GATGA
 
Protein sequence
MKGRSRLVAV SAVAAAASLV LAACSSSKSS SPSSSSGAPA ANTSAASASA GAGGAGGGKI 
QAALILKEFT NPYWISMENA AKAEAAKLGV DLHVSAGKAD DDATSQIQQI DAAISAGYKG
IIIAINSDAV NTALQRAKAA GLLVVVVDTP PIPASIADVT YATDNLQAGL FIGKWMAQKL
NGANADIAML DDLANQVITV DVDRDHGFLQ GMGIPVGNPN VNGQEPKSGH YTGGKGGSYK
IACQLPTNGS ATGGLSAMET CLSKDPNINV VYTINEPAAK GAAQALKNAG KTPGKDVTIV
TIDGSCNYLS LLTSGEIGAD SGQFPGKMAQ LGVDAIAQFA KTGAKPSMPA GKDFINTGVQ
LITAQPQPGV DSVTPDQAKS SCWG