Gene EcSMS35_1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1627 
Symboldcp 
ID6144950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1615980 
End bp1618025 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content50% 
IMG OID641616503 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_001743681 
Protein GI170681397 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000101595 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA TGAATCCTTT CCTTGTGCAA AGCACACTGC CGTATCTGGC TCCCCATTTT 
GATCAAATTG CCAATCATCA CTATCGCCCG GCATTCGATG AGGGAATGCA GCAAAAGCGG
GCAGAAATTG CTGCCATCGC GCTTAACCCG CAAACACCTG ATTTCAACAA TACTATTCTG
GCACTGGAAC AAAGCGGAGA ATTACTTACC CGCGTTACCA GCGTCTTTTT TGCGATGACT
GCGGCGCATA CCAATGATGA ATTACAGCGT CTTGATGAAC AGTTTTCCGC TGAACTGGCG
GAACTTGCGA ATGATATCTA TCTGAACGGG GAATTATTCG CGCGGGTAGA TGCTGTCTGG
CAGCGCCGTG AATCCCTGGG GCTTGATAGT GAATCCATCC GCCTGGTGGA GGTGATTCAT
CAACGTTTTG TCCTTGCCGG AGCCAAACTT AAGCAAGCTG ATAAAGCAAA ATTAAAAGTA
CTGAATACAG AAGCTGCGAC CCTGACCAGC CAGTTTAACC AGCGGTTACT GGCAGCAAAT
AAATCCGGCG GTCTGGTTGT GAACGATATC GCGCAGCTGG CAGGAATGAG TGAGCAAGAG
ATTGTGCTGG CGGCAGAGGC GGCTCGCGAG AAAGGTCTGT ATAACAAATG GCTGATTCCG
CTGCTGAATA CCACCCAACA ACCAGCGCTT GCTGAACTGT GCGATCGCGC GACGCGTGAA
AAACTGTTTA CTGCGGGCTG GATGCGAGCG GAAAAAAATG ATGCCAATGA CACCCGCGCT
ATCATTCAAC GTCTGGTGGA GATCCGCGCG CAGCAGGCGA AACTGCTTGG TTTTCCTCAT
TATGCCGCAT GGAAAATCGC CGATCAGATG GCAAAAACGC CAGAAGCAGC GCTCAACTTT
ATGCGGGAAA TTGTTCCAGC GGCGCGTCAA CGTGCGAGCG ATGAGTTAGC CTCCATACAG
GCGATTATTG ATAAGCAGCA GGGCGGGTTT AGCGCGCAGC CGTGGGACTG GGCATTTTAT
GCCGAGCAGG TAAGACGTGA GAAATTCGAT CTCGATGAAT CGCAGCTCAA GCCATATTTT
GAATTAAACA CGGTGTTGAA TGAAGGTGTA TTCTGGACCG CGAATCAGCT CTTCGGTATT
AAGTTTGTCG AACGTTTTGA TATTCCTGTC TACCATCCTG ACGTTCGGGT GTGGGAAATT
TTTGATCATA ATGGCGTGGG GCTGGCGTTA TTTTACGGTG ATTTCTTCGC CCGTGATTCA
AAAAGCGGCG GTGCATGGAT GGGCAATTTT GTTGAGCAAT CAACGCTTAA TGAAACGCAT
CCGGTAATTT ATAACGTCTG CAATTATCAG AAACCCGCTG CCGGTGAGCC TGCGTTGTTA
CTCTGGGATG ATGTAATAAC CTTATTCCAT GAATTTGGTC ATACGCTGCA CGGCCTTTTT
GCCCGCCAGC GTTATGCCAC GCTTTCCGGC ACCAACACGC CGCGTGATTT TGTCGAATTT
CCGTCGCAAA TCAACGAACA CTGGGCAACA CATCCGCAGG TATTCGCTCG CTACGCCCGG
CATTATCAGA GCGGGGCAGC AATGCCTGAC GAACTGCAAC AGAAAATGCG TAATGCCAGC
CTGTTCAACA AAGGGTATGA GATGAGCGAA CTGCTTAGCG CCGCACTTCT CGATATGCGC
TGGCATTGCC TGGAAGAAAA CGAAGCAATG CAGGATGTCG ATGATTTTGA ATTGCGGGCG
CTGGTGGCGG AAAATATGGA TCTTCCCGCT ATACCGCCAC GCTATCGCAG CAGTTATTTC
GCCCATATTT TTGGTGGCGG ATATGCCGCG GGTTATTACG CTTATCTGTG GACGCAAATG
TTGGCCGATG ATGGTTATCA GTGGTTTGTT GAGCAGGGCG GATTAACGCG TGAAAATGGG
CTGCGTTTTC GCGAGGCGAT CCTTTCCAGA GGTAACAGCG AGGATCTGGA ACGCCTGTAT
CGACAATGGC GCGGTAAGGC ACCTCAGATT ATGCCGATGC TGCAACATCG TGGCTTGAAC
GTATAA
 
Protein sequence
MTTMNPFLVQ STLPYLAPHF DQIANHHYRP AFDEGMQQKR AEIAAIALNP QTPDFNNTIL 
ALEQSGELLT RVTSVFFAMT AAHTNDELQR LDEQFSAELA ELANDIYLNG ELFARVDAVW
QRRESLGLDS ESIRLVEVIH QRFVLAGAKL KQADKAKLKV LNTEAATLTS QFNQRLLAAN
KSGGLVVNDI AQLAGMSEQE IVLAAEAARE KGLYNKWLIP LLNTTQQPAL AELCDRATRE
KLFTAGWMRA EKNDANDTRA IIQRLVEIRA QQAKLLGFPH YAAWKIADQM AKTPEAALNF
MREIVPAARQ RASDELASIQ AIIDKQQGGF SAQPWDWAFY AEQVRREKFD LDESQLKPYF
ELNTVLNEGV FWTANQLFGI KFVERFDIPV YHPDVRVWEI FDHNGVGLAL FYGDFFARDS
KSGGAWMGNF VEQSTLNETH PVIYNVCNYQ KPAAGEPALL LWDDVITLFH EFGHTLHGLF
ARQRYATLSG TNTPRDFVEF PSQINEHWAT HPQVFARYAR HYQSGAAMPD ELQQKMRNAS
LFNKGYEMSE LLSAALLDMR WHCLEENEAM QDVDDFELRA LVAENMDLPA IPPRYRSSYF
AHIFGGGYAA GYYAYLWTQM LADDGYQWFV EQGGLTRENG LRFREAILSR GNSEDLERLY
RQWRGKAPQI MPMLQHRGLN V