Gene Shewmr4_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3740 
Symbol 
ID4254303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4470075 
End bp4472120 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content50% 
IMG OID638120385 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_735860 
Protein GI113972067 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGA GACATCTAGT GAAGAAGGCA GGAATTATAC TGCTGAGTCT GCTAATGCCC 
CTCAGCACGG CATTTGCAGA AGCCAGCAAA CCACTCTCAG TCGAATTACT TTGGCAACTC
AAACGCATAG GTAGCCCAGT GGTATCCTCC ACGGGCGAGC ACATTATTGC GCCTGTGACT
GAATACGACC TTAAGGAAGA CAAGGGCTCA ACCCAACTGT GGCGCTTCGA CGGTGAAGGT
AAAAATAACC GTGCGATTAC CGCCAAAGGC TTAAAAGTCA GCGAGCCAGT ATTTTCGCCA
GACGGCAAAA CCTTAGCTTT TATCAGCGAA CGTAACGATG ATGACGCGGG CCAAATTTAC
CTGTTACCTA TGGATGGCCC GGGTGAAGCC AGCAAACTCA CGGATATACC CACCGGCGTC
AATGGCATCA AGTGGGTTGG CAAGCATTTG TATTTTATCA GCAATATCTT CCCTGAGCAG
AACTGGGAGC AAATGAAGGC CCAGCTTAAA ACAGACAAAG ATAATAAAGT CTCCGCCCGC
CAGTGGAATG CGCTGCCCTA TTCGCAGTTC GATCATTGGT TAGATGAGCG CCGTCAGGCC
CATGTATTTA GGATCCCAGC CACAGGCGGC GCGGTTGAAG CCGTGACTCA GCCGTTAGGC
CATGAACTGC CACGCTCAAG CCAAAGCAGC TCAAGTTATG ATATTTCCCC CGATGAACGC
TTAATTGCCT TCAGCACCTA TGGTTCGGAT AATCGCGTCG ACCCTAAGTT AGATCTGTTC
CTCGCCACCA TTGGCGGCAG CAAGGCTGAA AACATCACTC CAGATAACCC AGCGCCGGAT
CTAAACCCAA CATTTAGCCC TAACGGCAAT ACCCTCGCCT TTACTCGCCA AAAAATCCCC
GGATTTTATG CCGATACCGC AAGGTTGATG CTGATGGATG TGAGCAGTCG CAAACTCACC
ACGCTCACCG CGGATTGGGA TCGCTCAGTG TCTCAATTCG AGTGGACACC CGACAGCAAG
GGCTTCTATG CCAGCATTGA TGATGCAGCG ACAAACCGTA TTTATCATAT CGATGCCAAG
AGCGGTAAGG CCAAGGCCAT CACCCAAGCC ACCGACTTTA GCCAGCCAGC TATCGCTAAA
GATGGTCAGT TGATTGCGAC CAATCAGAGC TTTTTGTATC CCGCGCGTTT AGTCAGCATC
AACCCTCGCA ATGGTAAAAC CGAGCGTTTA GAGCAGTTTA ACGACGAAAT TTTAAAGGAT
GTCGATTTAG GCACCTATGA GTCGGTCACC TACAAGGGTT ATCAGGGCCA AGATATTCAA
ATGTGGGTGC ACTATCCACC TGGGTTCGAT CGCAGCAAGA AGTACCCGCT ATTTATGTTG
ATCCACGGTG GCCCACACAA CGCCATTAGC GATGGTTTCC ACTTCCGCTG GAACGCACAA
ACCTTCGCCT CGTGGGGTTA TGTCACCGCC TGGCCAAACT TCCACGGCTC TAGCGGCTTT
GGACAAGAGT TTGCCGATGC CATTAACCCA GATTGGAAAA ACAAATCCCT CGAAGATGTG
CTCAAAGCCG CAGACTGGTT TAAGCAACAA AGTTGGATCG ATAGCGATCG TATGGTCGCC
GGCGGCGCCA GCTATGGCGG CTACTTAACC TCGATTATTT TAGGTCAGCC GCATCCCTTC
AAGGCGCTGC TGATCCACGC AGCGGTGTAC GACATGTACT CACAAATGTC GGCGGACTTT
GCCGTCCACA GCACTCGCTT TGGTAACTAC TGGGATAATC CTGAGCTGTA TAAAGCCATT
TCGCCCCATT ACTTTGCCGC CAACTTTAAC ACTCCAACCT TAGTCAGCCA TGGGCAACTC
GATTATCGCG TACCCGTCGG CCAAGGTTTT GAGCTGTTCC GCACCTTGCA AACACGTAAT
GTCGAATCGC GGATGATTTA TTTTCCCGAT GAAAACCATT GGATCATGAA ACCCAACAAC
TCCATCTATT GGTATAACCA AGTGAAGGAT TGGATGACTC ATTACGCTAA GCCCGGTGCG
CAATAA
 
Protein sequence
MIRRHLVKKA GIILLSLLMP LSTAFAEASK PLSVELLWQL KRIGSPVVSS TGEHIIAPVT 
EYDLKEDKGS TQLWRFDGEG KNNRAITAKG LKVSEPVFSP DGKTLAFISE RNDDDAGQIY
LLPMDGPGEA SKLTDIPTGV NGIKWVGKHL YFISNIFPEQ NWEQMKAQLK TDKDNKVSAR
QWNALPYSQF DHWLDERRQA HVFRIPATGG AVEAVTQPLG HELPRSSQSS SSYDISPDER
LIAFSTYGSD NRVDPKLDLF LATIGGSKAE NITPDNPAPD LNPTFSPNGN TLAFTRQKIP
GFYADTARLM LMDVSSRKLT TLTADWDRSV SQFEWTPDSK GFYASIDDAA TNRIYHIDAK
SGKAKAITQA TDFSQPAIAK DGQLIATNQS FLYPARLVSI NPRNGKTERL EQFNDEILKD
VDLGTYESVT YKGYQGQDIQ MWVHYPPGFD RSKKYPLFML IHGGPHNAIS DGFHFRWNAQ
TFASWGYVTA WPNFHGSSGF GQEFADAINP DWKNKSLEDV LKAADWFKQQ SWIDSDRMVA
GGASYGGYLT SIILGQPHPF KALLIHAAVY DMYSQMSADF AVHSTRFGNY WDNPELYKAI
SPHYFAANFN TPTLVSHGQL DYRVPVGQGF ELFRTLQTRN VESRMIYFPD ENHWIMKPNN
SIYWYNQVKD WMTHYAKPGA Q