Gene Shewmr4_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0317 
Symbol 
ID4251931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp344065 
End bp345021 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content49% 
IMG OID638116872 
Productprolyl aminopeptidase 
Protein accessionYP_732454 
Protein GI113968661 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.463031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACTG ACGCCCATCA GCTTGCCCCT TTTATCCGCC GTGATTGGCT GATGATGGGC 
AATGGGCAGC AGTTGCATCT CGCTCAATAT GGTAATCCGC GGGGGATTCC GCTGCTGTAT
TTGCATGGTG GTCCAGGAGC GGGGGCTTCG GTCAGTGAAC TGAGTTTATT TAACCCCGAG
CACTATTGGA TTTTACTTTT AGATCAACGT GGTGCGGGAC AATCACTGCC AGCTGGCGAG
TTGGAACATA ACCATTTAAA TGGGCTTATC TGTGATATTG AGGCGATTCG TATTCACTTA
GGCATAGAGC GTTGGTGCCT AGCGGGTGGC TCATTTGGCG CCACCTTAGC CTTAATTTAC
AGTGGCTTAT TTCCCCATCG AGTGATTGCC CAAGTGCTGT GGGCAATGTT TATTCCCTCC
AAGGCCGGCA TTGAATGGCT CTATGCACCC TCGGGCGCGG CGCAGTTATA TTCGCAGGCT
TACCGTGAAT TTGCTGCACC TTCTATTGGA TTGGCGGATT TATTTACGCA CTATCAGCTA
GGATTGAACG CCCAAGATGA AGTAATTCGT CATGAATTTG CCCGCCGTTG GATCCAATGG
GAATTAATCT TAGCGGGCGT ACCAACTGGC CTACCTAAGC GGCTTACAAC GCCGTTATTG
GCCTTAGCTC AAATCGAGCT GCACTATGCA AAAAATGATT ACTTCAATAT GTTCAGCGTA
TTACAACGGG TGACATCTCA AGTCACAGCG CGTACCCTGC TATTACAAGG CACGCAGGAT
GCTGTCTGTC CGGCACGCTT ATTAGCCGCA TTTTTGGCAA AAATCGACAA CCCTAGGATA
CAAATTCATA CCATTGTCGA TGGTGGGCAT TCGCTGAATA GTGACATACT CTCACTTGCC
GTCACACTCG AGATCCAAGC CATGTGGACT TGGATAAAGC GACAGGAGCT AGCATGA
 
Protein sequence
MITDAHQLAP FIRRDWLMMG NGQQLHLAQY GNPRGIPLLY LHGGPGAGAS VSELSLFNPE 
HYWILLLDQR GAGQSLPAGE LEHNHLNGLI CDIEAIRIHL GIERWCLAGG SFGATLALIY
SGLFPHRVIA QVLWAMFIPS KAGIEWLYAP SGAAQLYSQA YREFAAPSIG LADLFTHYQL
GLNAQDEVIR HEFARRWIQW ELILAGVPTG LPKRLTTPLL ALAQIELHYA KNDYFNMFSV
LQRVTSQVTA RTLLLQGTQD AVCPARLLAA FLAKIDNPRI QIHTIVDGGH SLNSDILSLA
VTLEIQAMWT WIKRQELA