Gene SbBS512_E4317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4317 
SymbolpepQ 
ID6273273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4035752 
End bp4037083 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID641728127 
Productproline dipeptidase 
Protein accessionYP_001882547 
Protein GI187732539 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00026447 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CCTTACAAGA ACGGACTCGC 
GATGCGCTGG CGCGCTTCAA GCTGGATGCA TTACTTATTC ACTCCGGCGA GCTGTTCAAC
GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG
CCGGTAACTC AGGTGCCAAA CTGCTGGTTG CTGGTGGATG GCGTGAACAA GCCGAAACTG
TGGTTCTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAC CTCCTTCTGG
ACTGAAGATG TGGAAGTGAT CGCACTGCCG AAAGCCGATG GCATTGGTAG CCTGCTGCCC
GCTGCACGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AACGTGCGCT GCAACTGGGT
ATTGAGGCCA GCAACATCAA CCCGAAAGGG GTGATCGACT ACCTGCATTA CTACCGCTCC
TTCAAAACCG AGTACGAACT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT
CATCGTGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTCGATAT CAACATCGCC
TATCTGACCG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCGCTT
AACGAACACG CTTCGGTGCT GCATTACACC AAACTGGATC ATCAGGCACC GGAAGAGATG
CGCAGCTTCC TGCTGGATGC CGGGGCCGAA TATAACGGCT ATGCGGCTGA CCTGACTCGT
ACCTGGTCGG CAAAAAGCGA CAACGACTAC GCACAGCTGG TGAAAGACGT AAATGATGAA
CAACTTGCGC TGATCGCCAC CATGAAAGCT GGCGTCAGCT ATGTGGATTA CCACCTCCAG
TTCCATCAGC GCATTGCCAA ATTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA
GAAGCGATGG TCGAAAACGA TCTCACCGGA CCGTTTATGC CGCACGGTAT CGGCCATCCG
CTGGGCCTGC AGGTGCATGA CGTAGCCGGT TTTATGCAGG ATGATAGCGG TACACACCTC
GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC GGGCATGGTG
TTAACCATCG AACCGGGTAT CTACTTCATC GAATCGCTAC TGGCACCGTG GCGTGAAGGG
CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CAGCGGCATT
CGTATCGAAG ACAACGTGGT GATCCACGAA AATAACGTGG AAAACATGAC CCGGGATCTG
AAACTGGCGT GA
 
Protein sequence
MESLASLYKN HIATLQERTR DALARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV 
PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEDVEVIALP KADGIGSLLP
AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG
HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHASVLHYT KLDHQAPEEM
RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHLQ
FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL
AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFSGI
RIEDNVVIHE NNVENMTRDL KLA