Gene Acel_2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2150 
Symbol 
ID4485618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2432880 
End bp2434016 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content68% 
IMG OID639730952 
Productcell wall hydrolase/autolysin 
Protein accessionYP_873908 
Protein GI117929357 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGGCA CCGTACTGAT GCGCCTTGGG GATCGGAGCT TTGCGGTCGC GGAGATCCGG 
CACCGGCTGG CGCACCTCGG CCTGCTCAGC CGCACAAATG GGGATCACGA CTGTGTGCGC
GCCGCGTGCG ACGTCTTCGA CGAGACGGTC GATCATGCGA TCCGTGCGTT TCAGCAGCAG
CGCGGTCTCC GCACCGATGG AGTGGTGGAC GCCGAGACGT TCCGCGCTCT CGACGAGGCG
AAATGGCGGC TCGGCGATCG GGTGTTGAGC TACGTCCCCG GGCATCCGCT GGTCGGCGAC
GACGTCGCCG CCTTGCAGCG ACGACTCTGC GACATGGGCT TCGATTGCGG GCGGTGCGAC
GGCATCTTCG GTCCGCTCAC CGAGGCGGCG CTTCGGGAGT TCCAGCGCAA CGTGGGCCTG
CCCGCGGACG GCACGTGCGG AGCTGACACC CTGCGTGCGC TCCAGCGGCT GCGCCGGACC
GTGGTCGGCG GCCGCCCCTA TGAACTGCGT GAGACACTCC GGCTGCGTCA CCATCCGCCG
ACCGTCGCCG GCAAATGCGT CGTGCTTGAT CCAGGACACG GCGGACGCGA CACTGGTGCG
CGAACGGCCG ATCTCTGCGA GGCATCGCTG GTCGACGACA TCGCGAATCG CATTGAGGGC
CGACTTCTCG CTGTCGGAGC CCAGCCGTTC CGGACCAGAG CCGCGCAGCA CGTGCTCCAC
CCCGAGGATG TCCCGCCGTC CGATGGCGAC CGGGCCTCAT TCGCCAATGC GGCCGAAGCT
GATGTCGTCG TCTCGCTGCA CATCGACGGG CATCACGATC CGGCGTGCAA CGGCTTTGCG
GTCTACTACT ACGGCACTGC CCGCGAAAGG TCGGTCGTCG GCGAACGCCT CGCCGAACTG
GTCCGGGCGG AAGTCTTGGA GAGAACGGAC TTCTTGGACT GCCGGACCCA TCCCAAGACC
TGGGAGTTGC TCCGGCGGAC CCGCATGCCG GCGGTGCGTG TGGAGTGCGG ATACCTCACC
AACCCGGCGG ATGCCGAGCG GCTCGCAGAC CCCGGTGTCC GGGACCACAT TGCGGAGGGA
ATCGCCTCCG CACTACGCCG CCTTTACGAA GATCCCGGTG AACCGGCAGC CCTGTGA
 
Protein sequence
MQGTVLMRLG DRSFAVAEIR HRLAHLGLLS RTNGDHDCVR AACDVFDETV DHAIRAFQQQ 
RGLRTDGVVD AETFRALDEA KWRLGDRVLS YVPGHPLVGD DVAALQRRLC DMGFDCGRCD
GIFGPLTEAA LREFQRNVGL PADGTCGADT LRALQRLRRT VVGGRPYELR ETLRLRHHPP
TVAGKCVVLD PGHGGRDTGA RTADLCEASL VDDIANRIEG RLLAVGAQPF RTRAAQHVLH
PEDVPPSDGD RASFANAAEA DVVVSLHIDG HHDPACNGFA VYYYGTARER SVVGERLAEL
VRAEVLERTD FLDCRTHPKT WELLRRTRMP AVRVECGYLT NPADAERLAD PGVRDHIAEG
IASALRRLYE DPGEPAAL