Gene Achl_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2039 
Symbol 
ID7293500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2299572 
End bp2301584 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content66% 
IMG OID643590443 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_002488102 
Protein GI220912793 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000153602 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACTAACC CCCTGCTGTC TCCCAGCCCC CTGCCGTACG GGCTCCCGCC CTTCGCCCGC 
ATCGAGGCCG CCCATTACGC CGAGGCCATC GAAGCGGGCC TGTCTGAGCA CCTCGCTGAA
ATAGACCGCA TTGTGCAGAA TCCGGAGGTG CCCACCTTCG CGAACACGGC TGTGGCGATG
GAGCAGGCCG GCCGGTTGCT GGACCGTGCA GCCGCATCGT TCTTCACCCT GGTTTCGGCG
GATGCTTCGC CGGAGATCAG GGAGCTGGAA ACAAAGTTCT CGCCGCGTTT TTCTGCCCAC
CAGGATGAGC TCTACCTGAA CCATGCACTG TATGAACGTT TCCGGGGCAT CGACACCTCC
GCCTGCGACC CCGAGTCGGC CCGGCTGGTG GATGAGTACC TGAAGGAGTT CCGGCAAACG
GGTATCCAGC TCGATCCCGC CGGACAGGAC CGGCTCAGGG CCGTTAACGC AGAACTCGCC
CGGCTGGGCA CGGAGTTCGG CCAGCGCGTC AAGGAGGGGA TGAAGTCCGC GGCGCTGCTT
GTCGAGGATG CGGAAGAACT GGCCGGCCTG CCCGCCGACG ACGTCGCCGC CGCAGCGGAG
GCGGCCCGGG CAGCAGGACA TGAGGGGCAG TTCCTGCTGG GGCTTATCCA GCCAAGCAAC
CAGCCTGCCC TTGCCTCGCT CACGGACCGG GCCGTCCGGC GTCGGCTGTT CGAAGCATCT
GCTGCCCGGG GCAGCAACGG CGGACCCCTG GATGTCCTGG ACCTGGCCAG GTCCACGGCC
CGGCTGCGGG CAGAGAAAGC CTCGCTTCTC GGTTTCGCGA ATTACGCGGA ACTGGTGGCG
GACCGCCAGA CTGCACCCGA CTTCGGGGCA GTTCAGTCCA TGCTGAACCG CATGGCCCCC
GCCGCCGTCC GCAACGCGGA CCGTGAAGCG GCCGCCCTTG CCGAATCCGC CGGGCACCCT
CTGGAACCGT GGGACTGGGC CTACTACTCG GCGAAAGTCC GCCGGGAGAA GTACAGTGTG
GACGAACAGG CCCTCCGCCC CTACTTCGAG CTGGAGCGCG TGCTGCGGGA CGGCGTCTTC
TTCGCGGCCG GATCCCTGTA CGGGACCAGC TTCCATGAGC GGGAGGACTT GGTTGGCTAC
CATCCCGACG TTCGGGTCTG GGAGGTCAGG GATTCCGACG GCGGCGCGCT GGGGCTGTTC
CTGGGCGACT ATTATTCCCG CGAGTCCAAG CGGGGCGGAG CATGGATGAA CTCGCTGGTG
GACCAGAACT CCCTGCTGGG TACCAGGCCC GTGGTGATGA ACACCCTCAA CATCGCCAAA
CCGGCGCCGG GTGAACCGAC ACTCCTGACG CTTGACGAGG TCCGGACGGT CTTCCACGAG
TTCGGCCACG CACTGCACGG GCTCTTTTCC GACGTCACCT ACCCGCGGTT TTCCGGGACG
GCTGTCCCCC GCGACTTCGT GGAGTTCCCG TCCCAGGTCA ACGAAATGTG GATCATGTGG
CCGGAGGTCC TCACCAACTA CGCCCGCCAC CACGCCACCG GCGAACCGTT GCCGCAGGAC
GTTGTGGACC GGCTGGAAGA GTCCAGGCTT TGGGGCGAAG GTTTTGCCAC CACCGAGTAC
CTGGGCGCCG CCTTGCTGGA CCTGGCGTGG CATGTGCTGG AACAGGATGC CGTCCCGGAC
GATGCGCTCG CGTTTGAGGC GAAGTCCCTC GCTGCGGCGG GAATTGCGCA CGCCCTCATC
CCGCCGCGGT ACCGGACCGG TTACTTCCAG CACATTTTCG CCGGTGCGGG ATACGCCGCC
GGCTACTACT CCTACATTTG GAGCGAGGTC CTGGATGCCG AGACGGTGGA CTGGTTCACG
GAGAACGGCG GGCTTACCAG GGCCAACGGC GACCGGTTCC GGCAGGAACT GCTTTCGCGC
GGCAACAGCC GCGACCCCCT GGAGTCCTTC CGGATCCTGA GGGGCCGCGA CGCCAAACTG
GAACCCCTGC TCAAGCGCCG CGGCCTGGAG TAA
 
Protein sequence
MTNPLLSPSP LPYGLPPFAR IEAAHYAEAI EAGLSEHLAE IDRIVQNPEV PTFANTAVAM 
EQAGRLLDRA AASFFTLVSA DASPEIRELE TKFSPRFSAH QDELYLNHAL YERFRGIDTS
ACDPESARLV DEYLKEFRQT GIQLDPAGQD RLRAVNAELA RLGTEFGQRV KEGMKSAALL
VEDAEELAGL PADDVAAAAE AARAAGHEGQ FLLGLIQPSN QPALASLTDR AVRRRLFEAS
AARGSNGGPL DVLDLARSTA RLRAEKASLL GFANYAELVA DRQTAPDFGA VQSMLNRMAP
AAVRNADREA AALAESAGHP LEPWDWAYYS AKVRREKYSV DEQALRPYFE LERVLRDGVF
FAAGSLYGTS FHEREDLVGY HPDVRVWEVR DSDGGALGLF LGDYYSRESK RGGAWMNSLV
DQNSLLGTRP VVMNTLNIAK PAPGEPTLLT LDEVRTVFHE FGHALHGLFS DVTYPRFSGT
AVPRDFVEFP SQVNEMWIMW PEVLTNYARH HATGEPLPQD VVDRLEESRL WGEGFATTEY
LGAALLDLAW HVLEQDAVPD DALAFEAKSL AAAGIAHALI PPRYRTGYFQ HIFAGAGYAA
GYYSYIWSEV LDAETVDWFT ENGGLTRANG DRFRQELLSR GNSRDPLESF RILRGRDAKL
EPLLKRRGLE