Gene Shewmr4_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2123 
Symbol 
ID4252696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2533569 
End bp2535632 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content49% 
IMG OID638118747 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_734253 
Protein GI113970460 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000494152 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATAA CAATGAAAAT AGCTCCTCTC CTATTGGCGC TTGGCGCGGC TGGTTTGGCC 
TGCTCAGCCC ACGCGGCCGA TCCTAAGCCT TTTACTGTGC AACAGCTGGT TAAACTCAAC
AAATTGCATT CAGCCGCGGT GTCCCACGAC GGCACAAAAT TGGTCTACGG CCTGAAAACC
GTTAATGACA AGGGAGAGGC GAGCTCAGAT TTATATATTC TCGATTTGAC GCAAGCGGAC
GCGAAACCGA TGCAGATCAC TTCTGCAGCG GGTACTGAGC ACGATGTCAG CTTCGCAAAC
GATGACAAAT CGATTTATTT CCTCGCCAGC CGCAGTGGTT CAAGCCAACT GTTCCAACTG
CCATTAACGG GCGGTGAAGC GCAGCAAGTT TCTGATCTGC CATTGGATAT TGATGGTTAC
AAACTCTCTA ACGATGGTAA GCAAATCGTG CTCAGCATGC GTGTTTTCCC CGAGTGTAAA
GACTTAGCTT GTTCAAAAGA CAAATTTAAG GCCGAAGAAG AGCGTAAATC GACGGGCCGT
GAATACAAGC AGTTGATGGT GCGTCACTGG GATACCTGGG AAGATCATGC CCGTAACCAC
TTATTTGTGG GTGCCCTTAA TGGCGAGAAG CTGACCAAAG TGGTGGACAT CACCCAAGGT
TTAGACACAG AAACCCCACC TAAGCCATTC TCAGGCATGG AAGAAGTGTC CTTCACTCCT
GATGGCAAAT ATGTGGTGTA CAGCGCCAAA GCGCCAAGCA AAGATCAAGC TTGGACGACA
AACTACGATC TGTGGCAGGT GAGTGTAAAC GGTGGAAAAG CCACTAACTT AACCGCCGAT
AACATCGCTT GGGACGCCCA GCCAATATTC TCAAGCGATG GTCGCTATAT GGCGTACCTC
GCGATGACTA AACCCGGCTT CGAAGCTGAC CGTTACCGCA TTATGCTGCG TGATACTAGC
ACTGGACAGT CGAAGGAAGT GGCACCGCTG TGGGACAGAA GCCCAAGCTC GCTGATGTTT
GCACCAGACA ACCGTACTCT GTATGTGACG GCTCAAGACA TTGGTCAAGT GTCTATTTTC
AAAGTGAATA CTCAGTTTGG TGATGTGCAG TCTGTCTACA GCGACGGCAG CAATAGCCTG
ATTGCGATCG CCGACGATCA ACTGATCTTC GACAGCAAAA CCTTAGTTGA GCCGGGCGAT
CTGTACCGCA TCAACACCGA CGGCCAAGGC CTGAAACGTC TGACTGAAGT TAACAAAGAC
AAACTGGCCG AAATCAAATT CGGTGAATTC CAACAATTTA GCTTTAAGGG TTGGAACAAC
GAAGATGTTT ACGGTTACTG GATCAAACCT GCCAACTACC AAGAAGGCAA AAAGTATCCG
ATTGCATATC TAGTCCACGG TGGTCCGCAG GGGTCATTTG GTAACGCCTT CAGTGGTCGT
TGGAACGCCC AGTTATGGGC TGGCGCGGGT TATGGCGTTG TGATGGTGGA CTTCCACGGT
TCAACTGGTT ACGGCCAAGC CTTTACCGAT TCTATCAGCC AAGATTGGGG TGGTAAGCCA
TTAGAAGACT TACAAAAAGG TCTGGCAGCG GTGAGCCAAC AACAAAAATG GCTCGATCCA
CAAAATGCCT GTGCATTGGG CGGCTCTTAC GGCGGCTACA TGATGAACTG GATCCAAGGC
AACTGGAACG ATGGCTTTAA GTGCCTCGTT AACCACGCGG GTCTGTTCGA TATGCGCTCT
ATGTACTATG TGACCGAAGA AGTATGGTTC CCAGAGCATG AGTTTGGTGG CACTTACTCA
GATAACAAAG CCTTATATGA GAAGTTTAAC CCAGTAAACT ATGTGGAAAA CTGGAAAACG
CCAATGTTGG TTATCCATGG CGAGAAGGAC TTCCGTGTGC CTTATGGTCA AGGTTTAGCC
TCATTTAGCT ATATGCAACG CAAGGGAATT CCATCAGAGC TGCTGATTTT CCCTGATGAA
AACCACTGGA TCTTAAAGCC TGAAAACCTC GAACAATGGT ACGCGAACGT GTTCCGTTGG
ATGGACAGCT GGACGAAAAA GTAA
 
Protein sequence
MDITMKIAPL LLALGAAGLA CSAHAADPKP FTVQQLVKLN KLHSAAVSHD GTKLVYGLKT 
VNDKGEASSD LYILDLTQAD AKPMQITSAA GTEHDVSFAN DDKSIYFLAS RSGSSQLFQL
PLTGGEAQQV SDLPLDIDGY KLSNDGKQIV LSMRVFPECK DLACSKDKFK AEEERKSTGR
EYKQLMVRHW DTWEDHARNH LFVGALNGEK LTKVVDITQG LDTETPPKPF SGMEEVSFTP
DGKYVVYSAK APSKDQAWTT NYDLWQVSVN GGKATNLTAD NIAWDAQPIF SSDGRYMAYL
AMTKPGFEAD RYRIMLRDTS TGQSKEVAPL WDRSPSSLMF APDNRTLYVT AQDIGQVSIF
KVNTQFGDVQ SVYSDGSNSL IAIADDQLIF DSKTLVEPGD LYRINTDGQG LKRLTEVNKD
KLAEIKFGEF QQFSFKGWNN EDVYGYWIKP ANYQEGKKYP IAYLVHGGPQ GSFGNAFSGR
WNAQLWAGAG YGVVMVDFHG STGYGQAFTD SISQDWGGKP LEDLQKGLAA VSQQQKWLDP
QNACALGGSY GGYMMNWIQG NWNDGFKCLV NHAGLFDMRS MYYVTEEVWF PEHEFGGTYS
DNKALYEKFN PVNYVENWKT PMLVIHGEKD FRVPYGQGLA SFSYMQRKGI PSELLIFPDE
NHWILKPENL EQWYANVFRW MDSWTKK