Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3040 |
Symbol | |
ID | 7294520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3381730 |
End bp | 3383964 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643591450 |
Product | Oligopeptidase B |
Protein accession | YP_002489090 |
Protein GI | 220913781 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.873709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCA CTCCGCTGCA GAACTCCGCC AGCACGCCGT CCGCCGCCAC CCCGCCCGTG GCCAAAAAGG TGCCTACCGA AAGGACGCAC CACGGGGACA CCTACGTGGA CAACTATGAG TGGCTGCGGG ACAAGGAGTC CGCCGAGGTG GTGGCGCACC TGAAGGCCGA GAACGCCTAC CAGGAAACGG TCACCGCCCA CCAGGAACCG CTGCGCGAAG CCATCTTCCA GGAGATCAAG GGCCGCACCC AGGAAACCGA CCTCTCGGTG CCGCACCGCA AGGACGGCTG GTGGCACTTC AGCCGCTCGG TGGAGGGCAA GGAGTATGGC ATCCAGTGCC GGGTCCGTGC CAGCGACACC GGTGACAGGA TTGCCGACTG GACGCCGCCG GCCGTAGAGC CCGGCGTCGA AATTCCCGGC GAGGAAGTCC TGCTGGACGG CAACATCGAG GCCGAGGGTA AGCCGTTCTT TTCCGTGGGC GGCACCGCCG TGACAGTGGA CGGCACCCTC TACGCCTACG CCGTGGACAA CTCCGGCGAC GAACGCTTCA CCCTCCGGAT CAAGGACCTC CGCACCGGCG AACTGCTGCC GGACGTCATT GAGAACATCT TCTACGGCGT CAGCTTCTCC CCCGACGGCA CCCGCATTTT CTACACCGTG GTGGATGACT CGTGGCGCCC GTACCAGGTG AAAGCGCACG TCCTGGGCAC CCCGGTCAGC GAGGACACGG TGATCTACCA AGAGGACGAC ACCGCCATGT GGCTCGGCTT CGAGCTCTCC TCGGACCGGC GTTACCTGGT ACTGGGCATC GGCTGCTCCG AGTACAGCGA GACCCGGCTG CTCCGCTTCG ATGATCCCGC GCAGGAGGTC ACCACTGTCA TCTCCCGGGA CGAGCGCGTC CTGTATGAGG CAGAGCCGTT CCTGCTCGAG GGCCCGGACG GTGCCAAGGC CGAGAAGATC CTCCTCACCC ACAACCGCGG CGCCATCAAC TCCATGGTCT CCCTGGCCGA CCCGGCCGAG CTGGCAAAGC CGCTGGCCGA GCAGGCCTGG CAGACCGTCG TCGAACATTC CGACGACGTC CGCGTCAACG GCGCCGGCGT CACCTCCACG CACCTCATCG TCTCCATCCG CAAGGACACT ATCGAGCGCG TCCAGGTGAT GGGACTCGCC GGGCTTGGCA CGGCCGCACA GCAGGAACCG GTGGAGCCGG CGTTTGACGA GGAGCTTTAC ACCGCCGGGG TGGGCGGCTC GGACTATGAG GCACCCGTGA TCCGGCTCGG CTACACGTCC TACTTCACGC CGTCGCGCAT TTACGACTTC GTCCTGCCTA CCGCGGAGCA GCCCGCCGGC GAGCTGCTGC TCCGCAAGGA AAGCCCGGTG CTGGGCGGTT ACGACGGCAG CGACTACGTG GCAACACGGG AATGGGCCAC GGCGGCGGAC GGCACGCGCA TCCCGCTGTC CGTCCTCCGC CACAAAAGCG TCAGGCAGGA TTCGACGGCG GCCGGCGTGG TCTACGGGTA CGGCTCCTAC GAGCTGAGCA TGGATCCGAA CTTCGGCATC GCGCGGCTCT CGCTGCTGGA CCGCGGCGTG GTGTTCGTGA TCGCCCACAT CCGCGGCGGC GGTGAGCTCG GCCGGCACTG GTACGAGGAC GGCAAGAAGC TCACCAAGAA GAACACGTTC ACGGACTTCG TGGACGCCAC GGACTGGCTG GCCAACTCGG GATGGGTGGA CCCTGCACGG ATCGCGGCGC TGGGCGGCTC GGCGGGTGGC CTGCTGATGG GTGCCATTGC CAACATGGCG CCGGAAAAGT ACGCGGCCGT GGTGGCCCAG GTGCCGTTCG TGGACCCGCT CACCAGCATC CTGGACCCGG ACCTGCCGCT CTCGGCCCTG GAGTGGGAGG AGTGGGGCAA CCCGATCACC GACGCCAATG TGTACGCGTA CATGAAGTCC TACTCCCCGT ACGAAAACGT GCGGGAGGTG GCCTATCCCA AGATCGCCGC GGTGACGTCC TTCAACGACA CCCGCGTCCT CTACGTGGAG CCCGCCAAGT GGGTGCAGGA ACTGCGGAAC CGCACCACCG GGTCCGAGCC CATCCTCATG AAGATCGAGA TGGACGGCGG CCACGGCGGC GCGTCCGGCC GGTACGTGCA GTGGCGTGAA CGGGCCTGGG ACTACGCGTT CATCGCCGAC TCCCTCGGCG CCTCGGAACT GCTGCCGGGG GCCGGACTGA AGTAG
|
Protein sequence | MTSTPLQNSA STPSAATPPV AKKVPTERTH HGDTYVDNYE WLRDKESAEV VAHLKAENAY QETVTAHQEP LREAIFQEIK GRTQETDLSV PHRKDGWWHF SRSVEGKEYG IQCRVRASDT GDRIADWTPP AVEPGVEIPG EEVLLDGNIE AEGKPFFSVG GTAVTVDGTL YAYAVDNSGD ERFTLRIKDL RTGELLPDVI ENIFYGVSFS PDGTRIFYTV VDDSWRPYQV KAHVLGTPVS EDTVIYQEDD TAMWLGFELS SDRRYLVLGI GCSEYSETRL LRFDDPAQEV TTVISRDERV LYEAEPFLLE GPDGAKAEKI LLTHNRGAIN SMVSLADPAE LAKPLAEQAW QTVVEHSDDV RVNGAGVTST HLIVSIRKDT IERVQVMGLA GLGTAAQQEP VEPAFDEELY TAGVGGSDYE APVIRLGYTS YFTPSRIYDF VLPTAEQPAG ELLLRKESPV LGGYDGSDYV ATREWATAAD GTRIPLSVLR HKSVRQDSTA AGVVYGYGSY ELSMDPNFGI ARLSLLDRGV VFVIAHIRGG GELGRHWYED GKKLTKKNTF TDFVDATDWL ANSGWVDPAR IAALGGSAGG LLMGAIANMA PEKYAAVVAQ VPFVDPLTSI LDPDLPLSAL EWEEWGNPIT DANVYAYMKS YSPYENVREV AYPKIAAVTS FNDTRVLYVE PAKWVQELRN RTTGSEPILM KIEMDGGHGG ASGRYVQWRE RAWDYAFIAD SLGASELLPG AGLK
|
| |