Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1181 |
Symbol | |
ID | 4485069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1313976 |
End bp | 1315115 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639729957 |
Product | peptidase M50 |
Protein accession | YP_872939 |
Protein GI | 117928388 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.22871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCCAG CGTCCCCGCC GGAAGCGAGC CGCCCGTCGT CCGGAAGCGG GCTGCGGCTC GGCTACGTCT TCGGCGCGCC GGTCATCGTC GGTCCGAGCT GGCTGCTGTT CTTGGCGTGG ATCACCTGGC AATTCGCGCC CGTCGTCGCT GACGCAGTAC CAGGAATCCG AGAGGGGCGC TACCTCGTCT CAGCCTGCTT CGGGGTGTTT CTCGGCCTCT CGGTGCTCGC GCACGAACTT GCCCACCTCG TCGCGGCCCG GTGGTTCGGG ATCCCGAGCG ACCGCATTGT CCTGACCGCA CTGGCCGGCC ACACTGCGCT GAGCCGCGAA CCCGACCGGC CACGGCGGAT GTTCGTCGTG GCCCTGAGCG GGCCGCTCGC CAACCTGCTC ATCGCGGCAG CCGCGGCGGC CGCCCACCGG GTGCTGCCGG CGCACACGGT CTCCGCGGTG CTCACGGCAG GTCTTGCCTG GACCAACGGC ATCGTCGGTG TCTACAACCT GCTGCCCGGG CTCCCGCTGG ACGGCGGCCA GATGCTGCGC GCACTGGTCT GGGCGATAAC CCGCCGGCCG CGGCTCGGCG TCCTTGTCGG CGCCTGGTCG GGCCGGCTGG TCGGACTGGC GACCGTACTC TTCGGCGCCT ATCTCTTCAC CCGTCGGGAA CCGAACGCCC AGCTCGACGG CCTCTGGGCG TTGCTGATCG GCGGAATGCT GCTCATGTCA TCCGGCGCGG TTCTGCGTCA GCAGGCGCTG CGGGACAAGT TGCCTCAGCT CTCCGCCCGG GCCCTCACCC GCCGGGCGCT TCCGGTCACC GCTGACGTGC CGCTGGCCGA AGCGGTACGC CGCGCGCAGG AGTCGAAGGC CGGGGGACTC GTCATTGTCG ACGGGTATGG CCGGCCCACT GCCGTGGTGA GCGAAGCGGC TGTCGTCGCC ACCCCGGAGT CCCGCCGGCC GTGGGTGAGC GTCGCCACCG TGAGCCGCAC AATTTCGCAC TCCGAGCTCC TTCCGGCGGA TCTCGCCGGC GAAGCCCTGC TGGAGCGGCT GCGGTCCACG CCGGCCTCCG AGTACGTCGT CGTTGACACG GACGGCGCGA TCTACGGCGT CCTCGCCGCG GCGGATGTCG CCGCCGCGCT GCGGGGCTGA
|
Protein sequence | MNPASPPEAS RPSSGSGLRL GYVFGAPVIV GPSWLLFLAW ITWQFAPVVA DAVPGIREGR YLVSACFGVF LGLSVLAHEL AHLVAARWFG IPSDRIVLTA LAGHTALSRE PDRPRRMFVV ALSGPLANLL IAAAAAAAHR VLPAHTVSAV LTAGLAWTNG IVGVYNLLPG LPLDGGQMLR ALVWAITRRP RLGVLVGAWS GRLVGLATVL FGAYLFTRRE PNAQLDGLWA LLIGGMLLMS SGAVLRQQAL RDKLPQLSAR ALTRRALPVT ADVPLAEAVR RAQESKAGGL VIVDGYGRPT AVVSEAAVVA TPESRRPWVS VATVSRTISH SELLPADLAG EALLERLRST PASEYVVVDT DGAIYGVLAA ADVAAALRG
|
| |