Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2039 |
Symbol | |
ID | 7293500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 2299572 |
End bp | 2301584 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643590443 |
Product | Peptidyl-dipeptidase Dcp |
Protein accession | YP_002488102 |
Protein GI | 220912793 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000000000153602 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACTAACC CCCTGCTGTC TCCCAGCCCC CTGCCGTACG GGCTCCCGCC CTTCGCCCGC ATCGAGGCCG CCCATTACGC CGAGGCCATC GAAGCGGGCC TGTCTGAGCA CCTCGCTGAA ATAGACCGCA TTGTGCAGAA TCCGGAGGTG CCCACCTTCG CGAACACGGC TGTGGCGATG GAGCAGGCCG GCCGGTTGCT GGACCGTGCA GCCGCATCGT TCTTCACCCT GGTTTCGGCG GATGCTTCGC CGGAGATCAG GGAGCTGGAA ACAAAGTTCT CGCCGCGTTT TTCTGCCCAC CAGGATGAGC TCTACCTGAA CCATGCACTG TATGAACGTT TCCGGGGCAT CGACACCTCC GCCTGCGACC CCGAGTCGGC CCGGCTGGTG GATGAGTACC TGAAGGAGTT CCGGCAAACG GGTATCCAGC TCGATCCCGC CGGACAGGAC CGGCTCAGGG CCGTTAACGC AGAACTCGCC CGGCTGGGCA CGGAGTTCGG CCAGCGCGTC AAGGAGGGGA TGAAGTCCGC GGCGCTGCTT GTCGAGGATG CGGAAGAACT GGCCGGCCTG CCCGCCGACG ACGTCGCCGC CGCAGCGGAG GCGGCCCGGG CAGCAGGACA TGAGGGGCAG TTCCTGCTGG GGCTTATCCA GCCAAGCAAC CAGCCTGCCC TTGCCTCGCT CACGGACCGG GCCGTCCGGC GTCGGCTGTT CGAAGCATCT GCTGCCCGGG GCAGCAACGG CGGACCCCTG GATGTCCTGG ACCTGGCCAG GTCCACGGCC CGGCTGCGGG CAGAGAAAGC CTCGCTTCTC GGTTTCGCGA ATTACGCGGA ACTGGTGGCG GACCGCCAGA CTGCACCCGA CTTCGGGGCA GTTCAGTCCA TGCTGAACCG CATGGCCCCC GCCGCCGTCC GCAACGCGGA CCGTGAAGCG GCCGCCCTTG CCGAATCCGC CGGGCACCCT CTGGAACCGT GGGACTGGGC CTACTACTCG GCGAAAGTCC GCCGGGAGAA GTACAGTGTG GACGAACAGG CCCTCCGCCC CTACTTCGAG CTGGAGCGCG TGCTGCGGGA CGGCGTCTTC TTCGCGGCCG GATCCCTGTA CGGGACCAGC TTCCATGAGC GGGAGGACTT GGTTGGCTAC CATCCCGACG TTCGGGTCTG GGAGGTCAGG GATTCCGACG GCGGCGCGCT GGGGCTGTTC CTGGGCGACT ATTATTCCCG CGAGTCCAAG CGGGGCGGAG CATGGATGAA CTCGCTGGTG GACCAGAACT CCCTGCTGGG TACCAGGCCC GTGGTGATGA ACACCCTCAA CATCGCCAAA CCGGCGCCGG GTGAACCGAC ACTCCTGACG CTTGACGAGG TCCGGACGGT CTTCCACGAG TTCGGCCACG CACTGCACGG GCTCTTTTCC GACGTCACCT ACCCGCGGTT TTCCGGGACG GCTGTCCCCC GCGACTTCGT GGAGTTCCCG TCCCAGGTCA ACGAAATGTG GATCATGTGG CCGGAGGTCC TCACCAACTA CGCCCGCCAC CACGCCACCG GCGAACCGTT GCCGCAGGAC GTTGTGGACC GGCTGGAAGA GTCCAGGCTT TGGGGCGAAG GTTTTGCCAC CACCGAGTAC CTGGGCGCCG CCTTGCTGGA CCTGGCGTGG CATGTGCTGG AACAGGATGC CGTCCCGGAC GATGCGCTCG CGTTTGAGGC GAAGTCCCTC GCTGCGGCGG GAATTGCGCA CGCCCTCATC CCGCCGCGGT ACCGGACCGG TTACTTCCAG CACATTTTCG CCGGTGCGGG ATACGCCGCC GGCTACTACT CCTACATTTG GAGCGAGGTC CTGGATGCCG AGACGGTGGA CTGGTTCACG GAGAACGGCG GGCTTACCAG GGCCAACGGC GACCGGTTCC GGCAGGAACT GCTTTCGCGC GGCAACAGCC GCGACCCCCT GGAGTCCTTC CGGATCCTGA GGGGCCGCGA CGCCAAACTG GAACCCCTGC TCAAGCGCCG CGGCCTGGAG TAA
|
Protein sequence | MTNPLLSPSP LPYGLPPFAR IEAAHYAEAI EAGLSEHLAE IDRIVQNPEV PTFANTAVAM EQAGRLLDRA AASFFTLVSA DASPEIRELE TKFSPRFSAH QDELYLNHAL YERFRGIDTS ACDPESARLV DEYLKEFRQT GIQLDPAGQD RLRAVNAELA RLGTEFGQRV KEGMKSAALL VEDAEELAGL PADDVAAAAE AARAAGHEGQ FLLGLIQPSN QPALASLTDR AVRRRLFEAS AARGSNGGPL DVLDLARSTA RLRAEKASLL GFANYAELVA DRQTAPDFGA VQSMLNRMAP AAVRNADREA AALAESAGHP LEPWDWAYYS AKVRREKYSV DEQALRPYFE LERVLRDGVF FAAGSLYGTS FHEREDLVGY HPDVRVWEVR DSDGGALGLF LGDYYSRESK RGGAWMNSLV DQNSLLGTRP VVMNTLNIAK PAPGEPTLLT LDEVRTVFHE FGHALHGLFS DVTYPRFSGT AVPRDFVEFP SQVNEMWIMW PEVLTNYARH HATGEPLPQD VVDRLEESRL WGEGFATTEY LGAALLDLAW HVLEQDAVPD DALAFEAKSL AAAGIAHALI PPRYRTGYFQ HIFAGAGYAA GYYSYIWSEV LDAETVDWFT ENGGLTRANG DRFRQELLSR GNSRDPLESF RILRGRDAKL EPLLKRRGLE
|
| |