Gene Shewmr4_2589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2589 
Symbol 
ID4253160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3083745 
End bp3085778 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content51% 
IMG OID638119224 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_734717 
Protein GI113970924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00363766 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGTTAG GACGCTATTT ACCATATCTG CTGAGTGCAG TTTTACTGCT TGGTGGGTGT 
GAGCGTACGG ATACACAAGA TTCAAGTGCC AAGGAAAACG GCGATATAAA AGTTGCCCCC
TATGGCAGCT GGCAATCACC TCTGTCTGCA GCCGAGGTGT TTGAACGGGC CGATGATATT
GCTGAGCTGC AAAGCGTCGG CAATGCGATT TATTTTGCTG AATCCAGTGG CAGCGCACAG
GGCAAAGTCG GCATTAAGCG CCTCGATGGC GTTGGTAAAG TGACTGAAGT CGTTCCCCCT
GATTTTAATG TCAAATCTAC CGTGCATGAG TATGGTGGCG CGGCATTTTT GGGCATAGGC
CAGAGCCTAT TTGCCACTAA ACTACAGGAT CAGCTGTTTT ACCGTTTCGC CCCGAATCAG
CCGCCATTGC CCTTAACCCC CAATGGTACC CGCCATGCCG ATTGCGTGGC GTACCCTAAG
GGCTCGCGGA TTATTTGTGT GCGTGAAGAC CACCGTCAGG GCGGCGAACC TAAAGCCAGC
TTAGTGACCA TCAATCTTAA CTTTGCTGGT GAAGGCGATA CCTTTGTCAG TGGCCATGAC
TTTATTTCCT CTCCCACGAT TTCCCCCGAT AACACGCAAT TGGCGTGGAT CACCTGGGAA
CATCCCTATA TGCCTTGGGA TAACAGCGTG CTCTGGCTTG GGGATCTTGA CCGCAAAGGC
CAGTTAAAAA ATATTCGTAA GGTGAATACG CCCAAGGATT CTTCGGTGAC TCAGCCCTTA
TTCGGCCCCG ACGGCAACTT ATATGTGGTG TCCGATCTCA GTAACTGGTG GAATATTTAC
CGCGTCACCC CAAAAGAAAC CTTAAAGCCG GTACTGAGTA AAAATGCCGA GTTTGCCGTG
CCCGATTGGC GCTTAGGCAA TCATAACTAT GCCTTTGAAA ATGCCTCGAC CTTGATTGCC
AGCTATGTTG AAGGTAATCA GGCGGCATTG CTGCGTATGC ACTTAGATTC GGGATTGACC
GAATCCCTTG CCGTTGATTT TGCTGAGATT ACTCAGGTGG TGAAGGGCGA AGATGGGGTT
TATTTTGTCG GCGCTAAGGC GACGCCAGAG AAGGGTATTT ATCGGGTTGT CGGCCGTGGC
ACTGAGTTAG TCTATGCGCC AGCGCTGCCG AATCTTGACC CTAACTATGT GTCGCGGGCG
AGAAATATCG CCTTCAATAC GGGCAAAAAT CAGCAGGCTT ACGGTTATTT TTATGGTCCG
GTGAATCCCA ATTACATTGC GCCCCATGAT ACAAGGCCGC CGCTTATCGT GATGTTACAT
GGCGGGCCGA CGGCGCGCGC TTCCCTTGCC TATCGCAGTG AAATCCAGTT TTGGACCAGC
CGTGGGTTTG CGGTGCTGGA TTTAAACTTT CGTGGTAGCA GTGGCTTTGG CCGCGCCTAC
CGCCAGAGCT TATATGGCAA ATGGGGGGAA AGCGATGTGG AAGATGCGGT CAATGCGGCC
AAGTATTTAG TGACTAAGGG CTGGGTCGAT GCGAAAAAAC TGGCGATCCG CGGGATCAGT
GCTGGCGGCT TAACCGCCAT GTCCTCCCTA GCGTTTTACG ATGTGTTTCA GGCGGGGGTG
AGCTATGAGG GGATCAGTGA TTTTGAACAG CTCGCTAAGG GCACCCATAA GTTTGAGTCG
GGCTATTTAG ATCAGCTTAT TGGCCCCTAT CCAGAGCTGA AACAACGTTA TCGCGAGTTA
TCGCCACTCA ATCACTTAAA TGGTTTAAAT GAACCCTTGC TGATTTTCCA AGGTTTGAGA
AACAAGATAG TGCCGACGGC GCAGTCGCGG CAAATTTATG ATGCGCTGAA AGCCAAAGGC
GTACCGACGG CCTATATCGA TTATGGTGAT GATTCCGACG AGGGGCGCAC ACCTGAGCAT
AAAGCCGCGG GGTTAGAGAC CGAGTTAGCC TTCTATGGCC AAGTGTTTAA GTTTACCCCT
GCGGGTAAAC TGCCCAAATT AACCCTAGAT AATGCGATGG CGCTAAAGCA CTAG
 
Protein sequence
MGLGRYLPYL LSAVLLLGGC ERTDTQDSSA KENGDIKVAP YGSWQSPLSA AEVFERADDI 
AELQSVGNAI YFAESSGSAQ GKVGIKRLDG VGKVTEVVPP DFNVKSTVHE YGGAAFLGIG
QSLFATKLQD QLFYRFAPNQ PPLPLTPNGT RHADCVAYPK GSRIICVRED HRQGGEPKAS
LVTINLNFAG EGDTFVSGHD FISSPTISPD NTQLAWITWE HPYMPWDNSV LWLGDLDRKG
QLKNIRKVNT PKDSSVTQPL FGPDGNLYVV SDLSNWWNIY RVTPKETLKP VLSKNAEFAV
PDWRLGNHNY AFENASTLIA SYVEGNQAAL LRMHLDSGLT ESLAVDFAEI TQVVKGEDGV
YFVGAKATPE KGIYRVVGRG TELVYAPALP NLDPNYVSRA RNIAFNTGKN QQAYGYFYGP
VNPNYIAPHD TRPPLIVMLH GGPTARASLA YRSEIQFWTS RGFAVLDLNF RGSSGFGRAY
RQSLYGKWGE SDVEDAVNAA KYLVTKGWVD AKKLAIRGIS AGGLTAMSSL AFYDVFQAGV
SYEGISDFEQ LAKGTHKFES GYLDQLIGPY PELKQRYREL SPLNHLNGLN EPLLIFQGLR
NKIVPTAQSR QIYDALKAKG VPTAYIDYGD DSDEGRTPEH KAAGLETELA FYGQVFKFTP
AGKLPKLTLD NAMALKH