Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2362 |
Symbol | |
ID | 7293835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 2652195 |
End bp | 2653247 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643590769 |
Product | peptidase M4 thermolysin |
Protein accession | YP_002488416 |
Protein GI | 220913107 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000000736237 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCACGTTC CGTTCTGCAC CATCGTTCCG CCGTACCTGC TGCGGCGTCT TGCCCGGCAG GACGCGCCGG AGTATTCCGC GGCAGCCAGG GCTGCCAAGG AAGCCCTGGG TCACGTGAAA AGTGTCCAGG CAGCCCGGAT AACCGCCCGG CCAGGGACCC CCACCGGGAT GCTGGCGGCC AAACCAGGCC CGGCAACCAG GACCATCTAT GACGCCCAAG GCGCCGAATC CCTCCCCGGA AAACTGGTGC GGAAGGAAGG CGAACCACAG ACGGGAGACG CCGCTGCCGA TGAAGCCTAT GACGGCCTGG GCCATACCCA CCGCCTCTAC GCGGATGTCT TCGGCCGGGA CTCCATTGAT GGCCGCGGGC TCCCTATGGA CGCCACCGTG CATTTCGGCA GGCTGTATGA CAACGCCTTC TGGGACGGCA AGCAGATGGT CTTTGGTGAC GGTGACGGCG CCGTCTTCCA GCGGTTCACC GCCTCACTGA GCGTTATCGG CCACGAGCTG GCGCACGGGG TCACGCAGTT TTCCGCCGGC CTGGCCTACC GCAACCAGGC CGGAGCCCTG AACGAATCAA TGTCGGATGT CTTCGGCGCG CTCGTGGAGC AGTATGTGAA GGGCCAGTCC GCGGCGGAGG CCAGCTGGCT CATCGGGGAA GGACTCTTCA CCCCGGAGGT GCAGGGGCAG GCGCTGCGGT CCATGAAGGC CCCGGGGACG GCATACGACG ACGATGTGCT GGGCAAGGAT CCCCAGCCCG ATTCCATGGA CGGCTACGTC CGGACCAGCG CTGACAACGG CGGCGTCCAC ATCAACTCAG GCATCCCCAA CAGGGCGTTC TACCTCGTGG CGCAAGCCCT GGGCGGGAAC GCCTGGGATG CTCCGGGACG GATCTGGTAC GAAACGCTGA CCGGGGGCTC GCTCCCCACC ACGGCCACCT TTGCCGTCTT TGCACGCGCC ACGCTCGCAG CGGCCGAGGA ACTTTTTGGG TCCGACAGCA AGGAGCATGA CGCCGTCCGG CAGGCGTGGG AAACTGTGAA GGTCAAGCTT TAA
|
Protein sequence | MHVPFCTIVP PYLLRRLARQ DAPEYSAAAR AAKEALGHVK SVQAARITAR PGTPTGMLAA KPGPATRTIY DAQGAESLPG KLVRKEGEPQ TGDAAADEAY DGLGHTHRLY ADVFGRDSID GRGLPMDATV HFGRLYDNAF WDGKQMVFGD GDGAVFQRFT ASLSVIGHEL AHGVTQFSAG LAYRNQAGAL NESMSDVFGA LVEQYVKGQS AAEASWLIGE GLFTPEVQGQ ALRSMKAPGT AYDDDVLGKD PQPDSMDGYV RTSADNGGVH INSGIPNRAF YLVAQALGGN AWDAPGRIWY ETLTGGSLPT TATFAVFARA TLAAAEELFG SDSKEHDAVR QAWETVKVKL
|
| |