Gene Acel_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2096 
Symbol 
ID4485685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2371728 
End bp2372855 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID639730896 
ProductNLP/P60 protein 
Protein accessionYP_873854 
Protein GI117929303 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.685486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.535161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCACG CCGTAGCTGC TGCCTCGACG ACGTCGAGGG CGCGTCGCGC AGCGATCGGG 
ATGCTGGCTG CTCTCTGCGT GGCCGTGACC CTGGTTGCCG GGTGGCCGGC GGACGCCTTC
GCCGCCAAGC CGAGCACGAC GGCCGCTATT GCGAAGGCGC AGCAGCAGCT CGATCAGCTG
AATATCGCGT CGGAGAAGGC GGTCGAGGCA TACGACCAGG CGCAGAACGC GTTCGCGGCG
GCGTCCGCGC GGTTGGCGGC GCAACAGAAG GCGGTCGCGA ACGCGAAGCT TCAACTGGCT
GCCCTGCAGG AAAAGATCCG CGGTCTAGCC GTCCTGGAGT ATGAGAGCGG AGGCACGACC
CCGCTCATCG CCCTTCTCGT CTCAGGCGAT CCGCAGACGT CATTCCGGCG AGTGGATCTG
CTTGCGCAGG TCAACCGCTA CCACGAAGCA GACCTCGTGG CCGTAGCGGC TGCAGCGCAG
AAGCTGGAAC AGAGTGAGAA GGTCCTGGCG GAAACCGTTG CCGCTCAACG CAAGGCGCTT
GGCGAGGTCA GCGCGAAGAA GGCAGCGGTC GAGAAGTTGA TTGCCAAGCA GCAGGCGTTG
CTCGCGTCCC TGCACGTGAA AGCCCAACAG GAGGACGCAG CGGCGCGGGC CGCTGCTCAG
GCGGCGGCTC AGGCGGCGGC GCGTCAGTAT CTTGCCGCGC CGAGGGTCTC CCGGTCTGCG
CCGCGGGTCC TGGCCGATGC GTCGCAGCCG TCGGACGGAG GCGTGGCGCC TCCTGCGCCG
AGCGGCTCGG GTGCCGCGGC CGCACTTGCG TTCGCGTATG CGCAACTCGG CAAGCCGTAC
GTGTTCGGTG GAGCCGGACC GTACGGGTAC GACTGCTCGG GATTGACGAT GCGGGCCTGG
CAGGCCGCCG GGGTGCAGCT CAGCCACTCC GCGGAAGCCC AGCGGCACGA AGGACGGCCG
ATTCCGCTCT CCGCGGTCCA GCCTGGTGAC CTGATCTTCT GGGGGATTCC GGCCTGGCAC
GTTGCCATCT ACATCGGCGG CGGCCGCGTC ATCACCGCGC CGCATACCGG GACGGTTGTA
CAAATTCAAA GCATTTGGGG ATCCCCAAGC GGCGCGGTTC GTCCGTAG
 
Protein sequence
MPHAVAAAST TSRARRAAIG MLAALCVAVT LVAGWPADAF AAKPSTTAAI AKAQQQLDQL 
NIASEKAVEA YDQAQNAFAA ASARLAAQQK AVANAKLQLA ALQEKIRGLA VLEYESGGTT
PLIALLVSGD PQTSFRRVDL LAQVNRYHEA DLVAVAAAAQ KLEQSEKVLA ETVAAQRKAL
GEVSAKKAAV EKLIAKQQAL LASLHVKAQQ EDAAARAAAQ AAAQAAARQY LAAPRVSRSA
PRVLADASQP SDGGVAPPAP SGSGAAAALA FAYAQLGKPY VFGGAGPYGY DCSGLTMRAW
QAAGVQLSHS AEAQRHEGRP IPLSAVQPGD LIFWGIPAWH VAIYIGGGRV ITAPHTGTVV
QIQSIWGSPS GAVRP