Gene Shewmr4_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2104 
Symbol 
ID4252677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2509396 
End bp2511489 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content49% 
IMG OID638118728 
Productprolyl oligopeptidase 
Protein accessionYP_734234 
Protein GI113970441 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.949573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGTA AATTACTGCC TTGGTGCATC GCAGGGGTAC TCACTATGAG TGGTCAACTC 
CACGCCGAAG AAGACAAATA CCTTTGGCTT GAAGAAGTTG AAGGCGCAAA GCCGATGGAG
TGGGTTAAGG CCCAAAATGC CACTTCTGCC GCTGAAATTA AAGCCTTCAA AGGCTTCGAT
ACCTTAGTAG CCAACAGCCT TGCTATCCTC AATGACAAAG AGCGTATTCC TTACGCCACC
CATATTGGCG ATAAGCTGTA TAACTTCTGG AAGGATGATA CCCATGTGCG GGGGATTTAC
CGTCGCACTA CGATGGAAGA ATACGCCAAG GCGGATCCAA AGTGGGAAAC CGTGCTCGAT
GTCGATGCCT TAGGCAAAAC CGAAGCGGTG AACTGGGTGT TCAAAGGCAT TGATTGCCAG
TATCCACAGA ATCAGCGCTG CTTTGTGTCC TTATCCCGTG GCGGCGCCGA TGCGGTCGAA
GTGCGTGAAT TTGATTTAAC CACCAAAGAC TTTGTGCCTG CGAAAGACAA ACCTTTCTTC
TTAAAAGAAG CGAAATCTAG CCTTAGTTGG ATTAATACCG ATCAGGCATT TGTCGGTACC
GATTTTGGCG ATGGCCAAAG TATGACGGAC TCGGGCTATC CCCGCGTGGT AAAGTTATGG
CAACGTGGTA CGCCGCTTGA GCAAGCGAAA ACCATTTTCA GTGGCGACAA AACTTCGGTC
GCGGTATCGG GTTGGGTGAT ATTTGACGAT AAAACCCCAC TGAGCCTAGT CACCGAGGCG
CATACCTTTT ACACCGCTAC TCAATATGTT TACCAAGACG GCAAACTCAT AAAGCTGCCA
CTCCCACAGG ATGCCGAGAT TAAAGGCTAT TTCCAAGGCA AGTTGTTTAT TGAGCTTAAG
AGTGAGTTAG CAACCCCTGC GGCGACATTC AGCCAAGGCG CAGTGGTGTA CGCTAATGTG
GCGGATTTGA TTGCCCAAAA AGCCGCCTTT ACTGAATTTG TCAGCCCAAC GCCGACCGCA
TCAATCGCCC AGTTAACGTT CAGTAAGAGC GCAATTTTTG TTAATTGGCT CGATAACGTG
AAAAGCAAAC TGGTTCGCTA TGAGCAAGAC GCGAAAGGCG CATGGCAAAG CACGCCCGTA
CCCTTTGAGG CCAATGGCGC GCTAACCGTG ATGGATATGG AGCGTGACAG TGATGATTTC
TTTGTCAATT ACACGAGTTT CCTTGAACCA TCGAGCCTGT ATACCGTCAA TGCTAAGGCG
CTAAAACCGC AAAAAATGAA AGGTATGCCT CAGCAATTTG CGGCGGATAA ATTTACCACC
GAGCAGTATT TTGCAACCTC AAAGGATGGC ACTAAAGTGC CGTATTTTGT GGTGATGGCA
AAAGATCTTA AGCTCGATGG CAGTAATCCA ACACTGCTTT ATGGTTATGG TGGTTTTGAA
GTGTCACTGC GCCCAGCCTA TTCTGCAACC ATTGGTAAAA ACTGGTTAGA GCAGGGCGGC
GTGTATGTGC TGGCGAATAT CCGTGGTGGC GGCGAGTATG GGCCTGCGTG GCATCAAGCG
GCGCTTAAGG AAAATCGTCA TAAGGCTTAC GAGGACTTTG AAGCCATCGC CGAGGATCTC
ATTGCCCGTA AGATTACCTC GAGCAAGCAC TTAGGTATTC AAGGTGGCAG CAATGGTGGT
TTGCTGATGG GCGCCGCTTT TACCCGCCGT CCCGATCTCT ACAATGCCGT GGTCTGCCAA
GTGCCGTTAC TCGACATGTA CCGCTTCAAT AAGTTACTCG CCGGCGCAAG TTGGATGGGG
GAATACGGAA ATCCCGATGT TCCAGAGGAA TGGGCTTACA TTAAAACTTA TTCGCCATAC
CATAATCTGC ACAAGGATAC GCATTATCCA AAGGTGTTCT TCACCACCTC AACCCGTGAT
GACAGGGTTC ACCCAGGACA CGCGCGTAAG ATGGTGGCCA AGATGAAGGA CATGGGTATC
GATGTGCTTT ACTATGAAAA TATCGAAGGT GGGCATGCTG GGGCTGCAGA TAATAATCAA
GCTGCGGAAC TAAATTCAAT GGCGTTTGCC TACTTATTAC AGCAGTTAAA ATAA
 
Protein sequence
MNRKLLPWCI AGVLTMSGQL HAEEDKYLWL EEVEGAKPME WVKAQNATSA AEIKAFKGFD 
TLVANSLAIL NDKERIPYAT HIGDKLYNFW KDDTHVRGIY RRTTMEEYAK ADPKWETVLD
VDALGKTEAV NWVFKGIDCQ YPQNQRCFVS LSRGGADAVE VREFDLTTKD FVPAKDKPFF
LKEAKSSLSW INTDQAFVGT DFGDGQSMTD SGYPRVVKLW QRGTPLEQAK TIFSGDKTSV
AVSGWVIFDD KTPLSLVTEA HTFYTATQYV YQDGKLIKLP LPQDAEIKGY FQGKLFIELK
SELATPAATF SQGAVVYANV ADLIAQKAAF TEFVSPTPTA SIAQLTFSKS AIFVNWLDNV
KSKLVRYEQD AKGAWQSTPV PFEANGALTV MDMERDSDDF FVNYTSFLEP SSLYTVNAKA
LKPQKMKGMP QQFAADKFTT EQYFATSKDG TKVPYFVVMA KDLKLDGSNP TLLYGYGGFE
VSLRPAYSAT IGKNWLEQGG VYVLANIRGG GEYGPAWHQA ALKENRHKAY EDFEAIAEDL
IARKITSSKH LGIQGGSNGG LLMGAAFTRR PDLYNAVVCQ VPLLDMYRFN KLLAGASWMG
EYGNPDVPEE WAYIKTYSPY HNLHKDTHYP KVFFTTSTRD DRVHPGHARK MVAKMKDMGI
DVLYYENIEG GHAGAADNNQ AAELNSMAFA YLLQQLK