Gene Shewmr4_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3271 
Symbol 
ID4253839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3902971 
End bp3904323 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content52% 
IMG OID638119911 
Productpeptidase Do 
Protein accessionYP_735396 
Protein GI113971603 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.137957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000509076 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACGA AACTATCTGT ACTTTCAGCC GCAATGTTAG CCGCAACTCT GACAATGATG 
CCCGCTGTCT CACAGGCGGC TATTCCGCAA TCTGTTGAGG GTCAATCCAT TCCAAGTCTT
GCGCCTATGT TAGAGCGCAC GACCCCCGCC GTGGTTTCCG TGGCGGTATC TGGCACGCAC
GTCTCCAAAC AACGTGTACC CGATGTGTTC CGTTATTTCT TCGGCCCCAA TGCACCGCAG
GAACAAGTAC AAGAACGCCC TTTTAGAGGC TTAGGCTCAG GCGTAATAAT CGACGCGGAT
AAAGGTTATA TCGTCACCAA CAACCATGTG ATCGACGGTG CCGATGATAT CCAGGTCGGT
CTGCACGATG GCCGTGAAGT CAAAGCCAAG CTGATTGGTA CCGACTCAGA GTCCGACATT
GCCTTGCTGC AAATCGAAGC GAAAAATCTA GTCGCAATCA AAACCTCAGA TTCTGATGAA
CTGCGCGTGG GTGACTTTGC CGTCGCCATT GGTAACCCCT TTGGTTTAGG TCAAACCGTC
ACATCAGGGA TCGTCAGCGC CCTAGGCCGT AGCGGTTTAG GCATTGAAAT GCTTGAAAAC
TTTATCCAAA CCGACGCTGC GATTAACAGC GGCAACTCGG GTGGTGCGCT AGTAAACCTG
AAAGGTGAGC TGATCGGTAT TAACACCGCC ATCGTAGCGC CTGGCGGTGG TAACGTGGGT
ATCGGTTTTG CGATCCCCGC CAATATGGTG AAAAACCTCG TGGCACAAAT TGCCGAGCAC
GGTGAAGTGC GCCGTGGCGT ACTGGGGATT TCGGGCCGCG ACCTAGATAG CCAACTCGCC
CAAGGTTTTG GTTTAGATAC CCAGCACGGT GGGTTTGTGA ATGAAGTTAC CGCGGGCAGC
GCAGCTGAAA AAGCCGGCAT CAAAGCGGGT GATATTATCG TCAGTGTCGA TGGCCGTGCG
ATTAAATCGT TCCAAGAGCT GCGTGCGAAA GTCGCGACTA TGGGCGCTGG TGCTAAGGTT
GAACTGGGAC TTATCCGCGA TGGCGATAAG AAAACCGTGA ATGTCACCTT AGGTGAAGCA
AGCCAAACCA CTGAAAAAGC TGCAGGTGCT GTGCACCCCA TGTTACAAGG TGCCTCGCTA
GAAAACGCTT CGAAAGGGGT GGAAATTACT GAGGTGGCCC AAGGCTCGCC TGCGGCAATG
AGTGGCTTAC AAAAAGGCGA TGTGATTGTC GGTATTAACC GTTCGGCGGT GAAAGATCTG
AAATCACTCA AGGAGCAGCT CAAAGATCAA GAAGGCGCTG TCGCCCTGAA GATCCTCCGT
GGTAAGAGCT TGTTGTACTT AGTGCTGCGT TAA
 
Protein sequence
MKTKLSVLSA AMLAATLTMM PAVSQAAIPQ SVEGQSIPSL APMLERTTPA VVSVAVSGTH 
VSKQRVPDVF RYFFGPNAPQ EQVQERPFRG LGSGVIIDAD KGYIVTNNHV IDGADDIQVG
LHDGREVKAK LIGTDSESDI ALLQIEAKNL VAIKTSDSDE LRVGDFAVAI GNPFGLGQTV
TSGIVSALGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL KGELIGINTA IVAPGGGNVG
IGFAIPANMV KNLVAQIAEH GEVRRGVLGI SGRDLDSQLA QGFGLDTQHG GFVNEVTAGS
AAEKAGIKAG DIIVSVDGRA IKSFQELRAK VATMGAGAKV ELGLIRDGDK KTVNVTLGEA
SQTTEKAAGA VHPMLQGASL ENASKGVEIT EVAQGSPAAM SGLQKGDVIV GINRSAVKDL
KSLKEQLKDQ EGAVALKILR GKSLLYLVLR