Gene Acel_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1523 
Symbol 
ID4486126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1719791 
End bp1721032 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID639730307 
Productpeptidase M50 
Protein accessionYP_873281 
Protein GI117928730 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.36214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTGC TCGGCATCAT TGCCTTCGTC GTCGCTTTGC TGATTTCCGT CCTGCTGCAC 
GAGGCGGGGC ACTTCGCGTT CGCTCGGCTG TTCGGCATGA AAGCCACCCA ATTCTTCGTC
GGTTTCGGGC CGACCCTCTG GTCACGCAAG AAGGGCGAGA CCGAATACGG AATCAAAGCA
ATTCCGGCCG GGGGTTTCGT CAAAATCGTC GGCATGACGC CGCTGGAACA CATCGACCCG
GCGGACCGGC CGTGGGCCTT CATCAACCAA CCCGGGCCGC AGCGCCTGGT GGTGCTGGTC
GCCGGTTCGG CGGTGCATTT CGTGATCGGA CTCGTGCTGC TCTTCGTCTT CGCCCTTGCG
TGGCCGACGA AACCCACCGG GTACGCCCAG GTCGCAAAGG TGTACTCCTG CGCAATTCCC
AACGACGCCG GGCAATGCCC GCCGGGCGCC GCCCCGGCAC CTGCGGCGGG GAGACTGCAG
GTCGGTGACG TCATCCTCGC GGTCAACGGC CGCAGCGTCA AAGACACCCC GGCCGTCCTC
CGGAACCCGT CCAACCCGGC GTCCGCACAC CAGGTCACCG GCGGCGCCGA CGGCCTCGTC
GCACTGACCC GATCGACCCA CGGGCCCATC ACCTACACGG TCAAGCGCGG CGACCGTATC
CTCACGCTCA CCTTTCAGCC GGTCATCGGC AGTGACGGCT TGCCGCACAT CGGGTTCGTG
CCCGTCAACG ACTTCACCCG CCAGGGACCG GTCGGCGCGC TGACCTCCGC CGGCCGGATG
TTCGGCACTG CGGTGGTCGA TTCGTTCCGT GCCCTTGGGA CGGTGCCGCA TCAGCTGGCC
GTCCTGCTCA CGAATCCGAA CGCCCAGCGG AGCATCAACT CCGGAGGAGG CCAGGTCACC
AGCGTGGTCG GCGTCGCTCA ACTCACCGGC GAAGCCTTCG CCGCCGAGGG GGCGGGAAAC
GGCATTGCCG TCCTCCTCAC GGTGGTCGCG TCGGTGAACA TCTTTGTCGG CATCTTCAAC
CTGCTGCCGC TCCTGCCGCT GGACGGCGGT CACGTGGCGA TCCTCGGCTA CGAGAAGGCC
CGCGACGCGA TCCGCCGTCT GCGCGGGCGA CCCGCCGGCG GGCCGGTGGA CCTGACCAAG
CTCATGCCGA TCACATACAC CGCGTTAGCC TTGATCGTGG GTATGTCGCT GATCCTGCTG
TACGCGGATC TCGTCAACCC GGTGGCGAAC CCGTTCCAGT GA
 
Protein sequence
MMVLGIIAFV VALLISVLLH EAGHFAFARL FGMKATQFFV GFGPTLWSRK KGETEYGIKA 
IPAGGFVKIV GMTPLEHIDP ADRPWAFINQ PGPQRLVVLV AGSAVHFVIG LVLLFVFALA
WPTKPTGYAQ VAKVYSCAIP NDAGQCPPGA APAPAAGRLQ VGDVILAVNG RSVKDTPAVL
RNPSNPASAH QVTGGADGLV ALTRSTHGPI TYTVKRGDRI LTLTFQPVIG SDGLPHIGFV
PVNDFTRQGP VGALTSAGRM FGTAVVDSFR ALGTVPHQLA VLLTNPNAQR SINSGGGQVT
SVVGVAQLTG EAFAAEGAGN GIAVLLTVVA SVNIFVGIFN LLPLLPLDGG HVAILGYEKA
RDAIRRLRGR PAGGPVDLTK LMPITYTALA LIVGMSLILL YADLVNPVAN PFQ