Gene Acel_0883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0883 
Symbol 
ID4485715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp975092 
End bp976222 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content63% 
IMG OID639729658 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_872642 
Protein GI117928091 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.583201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGGG AGGTGCGGAT GGGTGATGAA TACCCGACCG TGCATGCCGC GGCTGTCCAG 
GCGGCCTCCG TCTTCCTCGA CCGGGAACGG TCGACGCAGA AGGCTTGTCG ACTGATCCGG
GAGGCGGGTC GCGGAGGCGC TGACATCATT GGTTTTCCGG AGGGATTCAT TCCGGCGCAT
CCCATCTGGT TTCACTTCCA CCCCGCAACG GGGTCGATCG CGACCGAGCT GAGCGTCGAA
TTATTCAAGA ACGCCGTTGA AATCCCGGGA CCGGAGGTAG TCGAGCTGCA GCGGGCAGCG
GCTGATGCCC GCGCCTACGT TGTCGTTGGC GTCTGCGAGA AACGCCCCAA TACGTTCGGC
ACGCTGTACA ACAGCCAACT GTTCATCGGA CCGGACGGTA CACTTCTTGG CTGCCGCCGT
AAGATCACGC CCACCGTGGG AGAGCGTCTC GTGCACACCG GCGGCAGCGG GGACGGTTTG
TCGGTGTTCC GGACGGATTT CGGGCCGGCG AGTGCACTCA TCTGCGGAGA GAACTCCAAT
CCGCTGGCCA TTTTTGCGCT GACCGCGCAA TACACCCAGG TGCACGTCAT GAGCTGGCCG
TGTCACTTTC CGACAACCGG CGCCCCGATG CGCGACCGGG TCTCGGTGGA TTCCCGGGCC
TTCGCTCAGA TGACCAAGGC ATACGTCATG AGCTGCTGTG GAACAGTCGA CGAGACCGCT
CTCGCGAAGC TTCGTCTCAG CCCGGACGAC GAGGAACTCA TCCGCCGCCC CGACTTTTGC
GGCGGATCCC TCATCGTCGC ACCGGATGGT CGGGTGATTG CCGGACCACT CGGCAACGAG
GAAGCCATCC TCTACGCGGA TTTGGATCTG GAACTCGGGA TTCGGATGAA ATTGCGTCAC
GATTTCGCCG GGCATTACAA CCGCCCGGAC ATTTTTGAGC TTCGGATCCG CACTGCGGAG
CCTCGACTGC TCACCGTCCG GGACACTGCC GAAAATCCGG TTCTCGAACA GGTCGAGGGC
CCTGCGCGGG CCGAACAAGT TTCTGCACCG GTGCGGTTCG CCGTCGAGCA GGGCGGCCTG
CCGAGCCTAA CCGGTGGTCT CGGGGTAGAC GTTGGCGGTG AGCAGCACTA G
 
Protein sequence
MNWEVRMGDE YPTVHAAAVQ AASVFLDRER STQKACRLIR EAGRGGADII GFPEGFIPAH 
PIWFHFHPAT GSIATELSVE LFKNAVEIPG PEVVELQRAA ADARAYVVVG VCEKRPNTFG
TLYNSQLFIG PDGTLLGCRR KITPTVGERL VHTGGSGDGL SVFRTDFGPA SALICGENSN
PLAIFALTAQ YTQVHVMSWP CHFPTTGAPM RDRVSVDSRA FAQMTKAYVM SCCGTVDETA
LAKLRLSPDD EELIRRPDFC GGSLIVAPDG RVIAGPLGNE EAILYADLDL ELGIRMKLRH
DFAGHYNRPD IFELRIRTAE PRLLTVRDTA ENPVLEQVEG PARAEQVSAP VRFAVEQGGL
PSLTGGLGVD VGGEQH