Gene Shewmr4_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3601 
Symbol 
ID4254165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4306580 
End bp4308565 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content47% 
IMG OID638120244 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_735722 
Protein GI113971929 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTT TGCCATTTGC CGCGCTGGCT GTAATTTGCT TGCCTATTAT GCCTTTCAGC 
ACATTAGCGG CCGAAACCTC GGCGACAAAT GCCCTGTCAC AATCCCAATT ATTCAGTCGC
GGTAACGAGT ACTCTAACGT CAAGATCTCG CCTACCGGCA AGTACTTAAG TGCAATCACT
AGTGTCGAAG GCAAGAATGT CCTCTTAGTA CTCGATGCCC AAACCAAAAA ACTGCTTAAC
GCAATTCGCT TCCCGAGCAA CGCGCAGGTA GGCACCTATG AATGGGCAAA TAGTGAGCGT
ATTGTACTTG CAAAAGAATA CCTTAAGGGC TGGAGCGATG TGCCCCAATA CTACGGCGAA
TTAATGGCGG TCAATGCCGA TGGTTCTCGC CCTAAATATC TATTTGGATA TAACAGCGGC
GAGCAGCAAA CCGGCTCGAA TATCAAGAAA AATACTCCTA TAAGTGCTAC CGCCTTTATT
CTCGATCCTC TGCCTGATGA CGAGCGTTAT ATGCTGGTCA ATGCGATCCC ATGGGGTGGT
GCCCCAGACT TGAGTGAAAC ACTTCAGGAT GTTTACCGCG TAGACCTTTT TAGTGGGGTT
CGTAAACGCA TCACAGGCTC CCCCATTGGC CGAGCACGCT TTATGACAGA TCATGAAGGT
GAAGTCCGCT TTGTGGCTGG GGAAGATGGC AAAAACATCA CTAAAGTCTT TTACCGCAAA
GATGGCGAAT GGTTAAACAC CGATAAACTC AACTTAGGCT TAAGTGATTT TACACCTATC
TCTTTCGCCG ATAATAAAAA TAGTATTTAC GCCGCAGGCC GAGTGGGCAC TGAAACCTTA
GGTGTCTATC GCATCAATCT CGAAACAGGG GAGAAGGCCG AGATTATTCA AGATGAAGTG
GTCGATCCAA GCAACTTCTG GATAAATGGC ACTAACAAAC AGCTCTATGC CGTTGAGTTT
GAAAATGGCT ATCCAAGCTA TGCCTTTGTC GATAACAACG ATAACCATGC CAAACTGCTT
AAGGATTTAC TCGCGGCCCT GCCGGGGCAT CAAGTACAAA TCGTCAGCGA AACCCGTAAT
GGCGAACAAT TGGTGGTGAT TGCATTTAAC GATCGCAATC CCGGTGATTA CTATTTGTTT
GATACGAAAA AGCTCAAGCT AGAGTATCTG GCCGCCGCCC GTAAGTGGCT CGACCCAGAA
AAAATGGCTG AGGTTAAACC TATTAGTTTC ACTAACCGTG ATGGCCAGAA AATCCATGGC
TACTTAACCT TACCCAATGG AAAAGAAGCC AAAAATTTAC CTTTGGTCGT CAATCCCCAT
GGTGGCCCCC ATGGCATTCG TGACTGGTGG GGGTTTGACC CACAAAACCA ACTACTTGCT
CAAAATGGTA TGGCGGTTTT ACAGGTTAAC TTCCGTGGCT CAGGTGGTTA TGGCGAACGT
TTCGAGCAAG CTGGCTATCA AAAATGGGGC TCGGATATTC AGCACGATAT TATCGATGCC
ACTCAATATG TGATTGACCA AGGCCTTGCC GATAAGGAAC GGGTGTGTAT CGCAGGCGGT
AGCTTTGGCG GCTATAGCGC CTTGCAAAGT GCGGTATTAG CACCCGATAT GTTTAAATGC
GCGGTTGGTT TTGCCGGTGT GTATGATCTG GAATTGATGT TTGATGAAGG TGACGTCGCC
AGAACACGTT CAGGCACAAG CTATCTTAAG GACGTACTTG GCCAAGACAA AGCCACCCTA
AAAGCCATGT CTCCCTCTGA GAACGTAGCA AAATTAAAAG CGAACCTCTT ACTGGTGCAC
GGTGGTGACG ATGAGCGAGC ACCGATTGAG CAACTCGAAT CACTCGAAAA AGCCCTCAAG
GCCCATAATT ATCCCTATCA AAAACTGGTG ATGGATAACG AAGGCCATGG TTTTTATGAT
GATAGCCATA GAGCCAAGTA TTACGATCAG ATGCTAAGCT TCTTAAAAAC CAACCTGAAA
CTTTAG
 
Protein sequence
MKTLPFAALA VICLPIMPFS TLAAETSATN ALSQSQLFSR GNEYSNVKIS PTGKYLSAIT 
SVEGKNVLLV LDAQTKKLLN AIRFPSNAQV GTYEWANSER IVLAKEYLKG WSDVPQYYGE
LMAVNADGSR PKYLFGYNSG EQQTGSNIKK NTPISATAFI LDPLPDDERY MLVNAIPWGG
APDLSETLQD VYRVDLFSGV RKRITGSPIG RARFMTDHEG EVRFVAGEDG KNITKVFYRK
DGEWLNTDKL NLGLSDFTPI SFADNKNSIY AAGRVGTETL GVYRINLETG EKAEIIQDEV
VDPSNFWING TNKQLYAVEF ENGYPSYAFV DNNDNHAKLL KDLLAALPGH QVQIVSETRN
GEQLVVIAFN DRNPGDYYLF DTKKLKLEYL AAARKWLDPE KMAEVKPISF TNRDGQKIHG
YLTLPNGKEA KNLPLVVNPH GGPHGIRDWW GFDPQNQLLA QNGMAVLQVN FRGSGGYGER
FEQAGYQKWG SDIQHDIIDA TQYVIDQGLA DKERVCIAGG SFGGYSALQS AVLAPDMFKC
AVGFAGVYDL ELMFDEGDVA RTRSGTSYLK DVLGQDKATL KAMSPSENVA KLKANLLLVH
GGDDERAPIE QLESLEKALK AHNYPYQKLV MDNEGHGFYD DSHRAKYYDQ MLSFLKTNLK
L