Gene SbBS512_E1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1702 
Symboldcp 
ID6272168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1548752 
End bp1550797 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content50% 
IMG OID641725782 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_001880280 
Protein GI187732722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0043073 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA TGAATCCTTT CCTTGTGCAA AGCACACTAC CGTATCTGGC TCCCCATTTT 
GATCAAATTG CCAATCATCA CTATCGCCCG GCATTCGATG AGGGAATGCA GCAAAAGCGG
GCAGAAATTG CTGCCATCGC GCTTAACCCG CAAACGCCTG ATTTCAACAA TACTATTCTG
GCACTGGAAC AAAGCGGAGA ATTACTTACC CGCGTTACCA GCGTCTTTTT TGCGATGACT
GCGGCGCATA CCAATGATGA ATTACAGCGT CTTGATGAAC AGTTTTCCGC TGAACTGGCG
GAACTGGCTA ATGATATCTA TCTGAACGGT GAATTATTCG CGCGGGTAGA TGCTGTCTGG
CAGCGCCGTG AATCCCTGGG GCTTGATAGT GAATCCATCC GCCTGGTGGA GGTGATTCAT
CAACGTTTTG TCCTTGCCGG AGCCAAACTT GCGCAAGCTG ATAAAGCAAA ATTAAAAGTA
CTGAATACAG AAGCTGCGAC CCTGACCAGC CAGTTTAACC AGCGATTACT GGCAGCAAAT
AAATCCGGCG GTCTGGTTGT GAACGATATC GCGCAGCTGG CAGGAATGAG TGAGCAAGAG
ATTGCGCTGG CGGCAGAGGC GGCTCGCGAG AAAGGTCTGG ATAACAAATG GCTGATTCCG
CTGCTGAATA CCACCCAACA ACCGGCGCTT GCCGAGATGC GCGATCGTGC GACGCGTGAA
AAACTGTTTA TTGCGGGCTG GACGCGAGCG GAAAAAAATG ATGCCAATGA TACCCGCGCT
ATCATTCAAC GTCTGGTAGA GATTCGCGCA CAGCAGGCGA AACTGCTTGA TTTTCCTCAT
TATGCCGCAT GGAAAATCGC CGATCAGATG GCAAAAACGC CAGAAGCAGC ACTCAACTTT
ATGCGGGAAA TTGTTCCAGC GGCGCGTCAA CGTGCTAGCG ATGAATTAGC CTCCATACAG
GCGGTTATCG ATAAGCAGCA AGGCGGGTTT AGCGCGCAGC CGTGGGACTG GGCATTTTAT
GCCGAACAGG TACGGCGGGA GAAATTTGAT CTTGATGAGG CGCAGCTCAA GCCATATTTT
GAATTAAACA CGGTGTTGAA TGAAGGTGTA TTCTGGACCG CGAATCAGCT CTTCGGTATT
AAGTTTGTCG AACGTTTTGA TATTCCTGTC TACCATCCTG ACGTTCGTGT GTGGGAAATT
TTTGATCATA ATGGCGTGGG ACTGGCGTTA TTTTACGGTG ATTTCTTCGC CCGTGATTCA
AAAAGCGGCG GTGCATGGAT GGGCAATTTT GTTGAGCAAT CAACGCTTAA TGAAACGCAT
CCGGTAATTT ATAACGTTTG CAATTATCAG AAACCCGCTG CCGGTGAGCC TGCGTTGTTA
CTCTGGGATG ATGTCATAAC CTTATTCCAT GAATTTGGTC ATACGCTGCA CGGCCTTTTT
GCCCGCCAGC GTTATGCCAC GCTTTCCGGC ACCAACACGC CGCGTGATTT TGTCGAATTT
CCGTCGCAAA TCAACGAACA CTGGGCAACG CATCCGCAGG TATTCGCTCG CTACGCCCGG
CATTATCAGA GCGGGGCAGC AATGCCTGAC GAACTGCAAC AGAAAATGCG TAATGCCAGC
CTGTTCAACA AAGGGTATGA GATGAGCGAA CTGCTTAGCG CCGCACTTCT CGATATGCGC
TGGCATTGCC TGGAAGAAAA CGAAGCAATG CAGGATGTCG ATGATTTTGA ATTGCGGGCG
CTGGTGGCGG AAAATATGGA TCTTCCTGCT ATACCGCCAC GCTATCGCAG CAGTTATTTC
GCCCATATTT TTGGTGGCGG ATATGCTGCA GGTTATTACG CTTATCTGTG GACGCAAATG
TTGGCCGATG ATGGTTACCA GTGGTTTGTT GAGCAGGGCG GATTAACGCG TGAAAATGGG
CAGCGTTTTC GCGAGGCGAT CCTTTCCAGA GGTAACAGCG AAGATCTGGA ACGCCTGTAT
CGACAATGGC GCGGTAAGGC TCCTCAGATT ATGCCGATGC TGCAACATCG TGGCTTGAAT
ATATAA
 
Protein sequence
MTTMNPFLVQ STLPYLAPHF DQIANHHYRP AFDEGMQQKR AEIAAIALNP QTPDFNNTIL 
ALEQSGELLT RVTSVFFAMT AAHTNDELQR LDEQFSAELA ELANDIYLNG ELFARVDAVW
QRRESLGLDS ESIRLVEVIH QRFVLAGAKL AQADKAKLKV LNTEAATLTS QFNQRLLAAN
KSGGLVVNDI AQLAGMSEQE IALAAEAARE KGLDNKWLIP LLNTTQQPAL AEMRDRATRE
KLFIAGWTRA EKNDANDTRA IIQRLVEIRA QQAKLLDFPH YAAWKIADQM AKTPEAALNF
MREIVPAARQ RASDELASIQ AVIDKQQGGF SAQPWDWAFY AEQVRREKFD LDEAQLKPYF
ELNTVLNEGV FWTANQLFGI KFVERFDIPV YHPDVRVWEI FDHNGVGLAL FYGDFFARDS
KSGGAWMGNF VEQSTLNETH PVIYNVCNYQ KPAAGEPALL LWDDVITLFH EFGHTLHGLF
ARQRYATLSG TNTPRDFVEF PSQINEHWAT HPQVFARYAR HYQSGAAMPD ELQQKMRNAS
LFNKGYEMSE LLSAALLDMR WHCLEENEAM QDVDDFELRA LVAENMDLPA IPPRYRSSYF
AHIFGGGYAA GYYAYLWTQM LADDGYQWFV EQGGLTRENG QRFREAILSR GNSEDLERLY
RQWRGKAPQI MPMLQHRGLN I