Gene Shewmr4_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3803 
Symbol 
ID4254366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4540190 
End bp4542127 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content48% 
IMG OID638120448 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_735923 
Protein GI113972130 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.6032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGA TTTTACCTTT GTTACTCCTG TGGCTAAGTC TGCCACTGCT TGCTCAAACG 
GTACCACAGC TTCCGGTCGA GGCTTTTGCC AGTATTCCCG ATGTCAGCTC AGTCCAACTG
TCGCCCGATG GCAAAAAAAT TGCCTCTATC GTTCGGGTCG ATCAGCCCAA ACTTAAGGGC
ACTGTGGTCA GTATTCTCGA TTTAGAAACG GGCAACAAAG ACTACGCGAT TCACACCGAT
AACCAGAAGT TTGTCCTACT GTCATTACAG TGGGCAAATG ACACTACTTT GCTGATCAGC
GCTAAGTTTC CGGCAAACCG CTACGGTACA CCGACCACAG AAACTCGCTT GGTGAAGTAC
GATTTAACTA CACGAAAAAC CACGAGCGTA CTCGCCCGTA GCGTCATCGA CCGACTCAGT
TGGATCCCCC AACATCAGGG ACAGATTATC GATATGATGC CGGATGACCC CGATAATATT
TTGCTCTCCC TCGATGGTAT GGGCGAAGAA GTAGGTGAAG ACAGCGTGCT GAGAGTCAAT
CTCTCTCAAG GAAAATCCGC ATTTATTCAA AACTCTAAAC GTAAAATCAT CGGCTGGATT
ACCGACAGAC AGCACAAGGT CCGTATCTCC ATCTATAACG ATGACACTGA ATATAGGATC
TACGAGCAAC CAGAACAAAA GGCAGAGCCA CGCTTACTCT GGACCTTTAA AGCCTTCTCC
GACGAGAGCG TCTGGCCACT GGGCTTTGAT GCCGACCCCA ACATCTTATT CGTACGCGCC
TACCACCAAG GCTTTGAAGC TATCTTCAAG GTGAATTTAA CCGCTCCTAA GCTCACGAAG
GAGCTGGTGT ACGCCAACGA AGATACCGAT GTTGAAGGCA ATCTCATCTA TTCAGAACTC
AAGAAAAAAG TCATTGGGAT CAGTGAAGGC GACGGCGAAG AATATACCTT CTGGGATCCT
GAATATGCGG GTCTGCAAAA TGGGTTGAAA GCGGTATTGC CCAATGCCCA TAACTACATC
ACCCAATTTA GCGCGGATGA ACGCCGCTAT ATCGTCTACT CCACCAGTTC GACTCAACCT
GGCACCTATT ACTTCGGCGA TAGGGATGAA AAGGCGCTGT TTCCGATTGC CGATAGATAT
AGCCAGCTAA GCAGCGAGCA ACTCGCCGAC ACTCAATATC TGAGCTACGA AGCGCGGGAT
AAACTCAAAA TCGATGCTTA CCTGACAGTG CCAAAGGGCC TCGAAGCCAA GCAACTCCCC
ACCATTATCT TCCCCCACGG CGGGCCGATT AGTTACGACA GTAACGATTT CGATTATTGG
TCGCAGTTTT TCGCTAACCG TGGCTACGCG GTGTTTCGAA TGAACTTTAG GGGCTCGGCG
GGCTACGGCT ATGAGTTTAT GAAGGCCGGC CTCAAAAGCT GGGGACTCGA AATGCAAAAC
GATGTGGAAG ATGGTACACG TTACTTAATC AATCAAGGGA TTAGCGATCC ACAGCGTATC
TGTATCGTCG GCGCGAGTTA TGGTGGTTAT GCCGCCTTAA TGGGCGCGGC GATGACACCT
GATCTGTACC GCTGCGCGGT CAGTGTGGCA GGCGTAACAG ACGTGGCTTA CCTGGTAAAA
TCCAGTCGCC GTTTCACCAA TTATGAAGTT GTTAAAGAAC AAATTGGCGA TGATTTTAGC
GCCCTCTATG AACGCTCACC CGTTAGTAAG GCCGATAAGA TTACTATCCC GGTATTACTA
CTGCATGGCA ATAAGGATCG AGTGGTTAAG GTGCAACATA GTCGCGAAAT GTTCGATGAA
CTGAAATCAC GTAAGAAAAA CGTCGAATAT ATTGAGCTCG AAAATGGCGA CCATTACTTA
AGTAACAATG ACCATAGGCT AACCACCTTT AAAGCGCTCG ATAAGTTTTT AGCCGACAAT
CTTAAAACCC AGCTGTAG
 
Protein sequence
MKRILPLLLL WLSLPLLAQT VPQLPVEAFA SIPDVSSVQL SPDGKKIASI VRVDQPKLKG 
TVVSILDLET GNKDYAIHTD NQKFVLLSLQ WANDTTLLIS AKFPANRYGT PTTETRLVKY
DLTTRKTTSV LARSVIDRLS WIPQHQGQII DMMPDDPDNI LLSLDGMGEE VGEDSVLRVN
LSQGKSAFIQ NSKRKIIGWI TDRQHKVRIS IYNDDTEYRI YEQPEQKAEP RLLWTFKAFS
DESVWPLGFD ADPNILFVRA YHQGFEAIFK VNLTAPKLTK ELVYANEDTD VEGNLIYSEL
KKKVIGISEG DGEEYTFWDP EYAGLQNGLK AVLPNAHNYI TQFSADERRY IVYSTSSTQP
GTYYFGDRDE KALFPIADRY SQLSSEQLAD TQYLSYEARD KLKIDAYLTV PKGLEAKQLP
TIIFPHGGPI SYDSNDFDYW SQFFANRGYA VFRMNFRGSA GYGYEFMKAG LKSWGLEMQN
DVEDGTRYLI NQGISDPQRI CIVGASYGGY AALMGAAMTP DLYRCAVSVA GVTDVAYLVK
SSRRFTNYEV VKEQIGDDFS ALYERSPVSK ADKITIPVLL LHGNKDRVVK VQHSREMFDE
LKSRKKNVEY IELENGDHYL SNNDHRLTTF KALDKFLADN LKTQL