Gene SO_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_0022 
SymbolpepQ 
ID1167920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp27069 
End bp28388 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content47% 
IMG OID637342031 
Productproline dipeptidase 
Protein accessionNP_715664 
Protein GI24371622 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCACT TGGCTCATTA CTATCATGCC CACATCGACG AGTTAAATCG TCGAGTCGCT 
GAAATTGTAT CGCGTGAAGC ATTATCAGGT TTAGTGATCC ACTCGGGTCA GCCACACAGA
ATGTTTTTGG ATGACATTAA TTATCCTTTT AAAGCCAACC CTCATTTTAA GGCGTGGTTA
CCTGTGTTGG ATAATCCCAA CTGCTGGTTG GTGGTGAATG GTCGCGATAA GCCACAACTG
ATCTTTTATC GCCCGGTGGA TTTTTGGCAT AAGGTTGCCG ACGTACCGGA TATGTTCTGG
ACCGAGCATT TTGACATCAA GTTGCTCACT AAGGCCGATA AAGTCGCCGA ATTATTGCCT
ACCGATATCG CCAACTGGGC TTATTTGGGT GAGCATTTAG ATGTGGCTGA GGTGCTTGGG
TTTACTAGTC GTAATCCTGA TTCGGTGATG AGTTATCTGC ATTTTCATCG CACCACAAAA
ACCGAATATG AACTTGAATG TATGCGCCGT GCCAACCAAA TTGCAGTACA AGGTCATCAG
GCGGCTAAAA ATGCCTTTTA TAATGGCGCG AGCGAGTTTG AAATTCAGCA GCAGTATTTA
TCCGCTGTGG GACAGGGTGA GAACGAAGTG CCCTATGGCA ATATTATTGC CCTAAACCAA
AATGCGGCGA TTTTGCATTA CACCGCGCTT GAGCATCAAA ATCCTGCGCG CCGTTTGTCA
TTCTTAATTG ATGCAGGTGC CAGTTATTTT GGCTATGCAT CAGATATCAC TCGTACCTAT
GCGTTCGAGA AGAATCGTTT CGATGAGTTG ATCACTGCTA TGAACAAAGC GCAGCTTGAG
CTGATTGATA TGATGCGCCC AGGAGTGCGT TATCCCGATT TACATTTAGC GACCCATGGC
AAAGTGGCGC AAATGCTATT GGATTTTGAC TTGGCAACAG GCGATGCTCA AGGGTTAGTC
GAGCAAGGCA TTACCAGCGC CTTCTTCCCC CACGGATTGG GCCATATGTT AGGCTTGCAA
GTGCATGATG TTGGCGGCTT CGCCTTTGAT GAGCGTGGCA CCCATATTCC TGCCCCTGAG
GCGCATCCAT TTTTGCGTTG CACGCGCATT TTAGCGCCAA ACCAAGTGCT AACTATGGAG
CCAGGTTTGT ACATTATCGA CTCCTTACTC AATGAGTTAA AACAAAATAG CCGTGGCAAG
CAGATCAATT GGAACAGCGT TGATGAGCTG CGACCTTTCG GCGGTATTCG TATTGAGGAT
AATGTGGTTG TGCATCAGGA TAGAAATGAG AATATGACCC GCGAACTGGG CTTAGCGTGA
 
Protein sequence
MDHLAHYYHA HIDELNRRVA EIVSREALSG LVIHSGQPHR MFLDDINYPF KANPHFKAWL 
PVLDNPNCWL VVNGRDKPQL IFYRPVDFWH KVADVPDMFW TEHFDIKLLT KADKVAELLP
TDIANWAYLG EHLDVAEVLG FTSRNPDSVM SYLHFHRTTK TEYELECMRR ANQIAVQGHQ
AAKNAFYNGA SEFEIQQQYL SAVGQGENEV PYGNIIALNQ NAAILHYTAL EHQNPARRLS
FLIDAGASYF GYASDITRTY AFEKNRFDEL ITAMNKAQLE LIDMMRPGVR YPDLHLATHG
KVAQMLLDFD LATGDAQGLV EQGITSAFFP HGLGHMLGLQ VHDVGGFAFD ERGTHIPAPE
AHPFLRCTRI LAPNQVLTME PGLYIIDSLL NELKQNSRGK QINWNSVDEL RPFGGIRIED
NVVVHQDRNE NMTRELGLA