Gene Shewmr4_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1143 
Symbol 
ID4251811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1344148 
End bp1347429 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content52% 
IMG OID638117724 
Productpeptidase S41 
Protein accessionYP_733280 
Protein GI113969487 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTC GTCATTCTGT TGCCTGCTGC ATGCTTGCCC TTGGCACTTT GTCCGCAGCC 
CACGCCATCG CTCAACCCGC CAGCAATCAA GGTTATTACC GTGCGCCAGC ATTGCATGAC
CAGACCTTAG TCTTTACGGC CGAAGGTGAT CTTTGGACGC AAACGCTCGG GCAAAAGGCG
GCGACGCGCC TGACCACTCT TCCTGCCGAA GAGCTAGGCG CCGCCATCTC CGCCGATGGT
AAATGGGTGG CCTATGTCGC CAATTATGAG GGCGCGAGTG AGGTTTATGT TATTCCTGTT
GCGGGTGGTG TGGCCAAACG AGTGAGTTTC GAAAATAGCC GAGTGCGCGT GCAAGGCTGG
ACCGCAAAGG GCGAAGTGCT TTATTCCACC GATAGTGGTT TTGGCCCTGC GAACAATTGG
ATGTTGCGAC TGGTGAATCC AGAAACCTTG GCGACGACCG ACTTACCACT CGCCGATGCG
GTGGAAGGGG TGGTCGATGC CAATAATCAA TATGTGTACT TTACCCGTTT TGGCCTGCAG
GTCACCGGCG ATAACGCTAA GGTTTATCGC GGCGGCGCTA AGGGTGAGTT ATGGCGCTTT
AAACTCGGTG GCAAAGACGA GGCACAGTTG CTCAGTGGTC AGCATCAAGG TTCGGTGCGT
CAACCTATGC TATGGCAAGA CAGACTCTAC TTTATCAGCG ACAGCGATGG TAACGACAAT
CTCTGGTCCA TGGCTCTGGA TGGCAGTGAC GCGAAACAGT TGACCCAATA TAAAGATTGG
CAGGTGCGCG GCGCTCGCAT GGACCAAGGT AAAGTCGTGT TCCAGCAGGG CGCAGATATT
CATGTTTTTG ATATCGCCGC TGCCAAAGAC TCATTATTAG ATATTGAATT AAAGTCCGAT
TTCGCCCAAC GCCGCGAACA TTGGGTAAAA GATCCTATGG ATTATGCGAC CTCAGCGAAC
CTTGCGCTTG CGGGCGATAA AGTGGTGATC ACCGCCCGTA GCCATGTGGC GATTGCGGGC
ATTGATGGTT CACGTTTAGT CCAAGTGGCA CTGCCTGGTA CCTATCGCGT GCGCAATGCG
ATTATGAGTC AGGATGGTAA ATCGGTTTAT GCCATCAGTG ACATGAGTGG CCAACAGGAA
ATTTGGCAAT TCCCTGCCGA TGGCAGCAGC GGCGCGAAGC AACTCACCAA AGATGGCCAT
ACCTTAAGAA TGACGCTGAG TCTGTCAAAC GATGGCCGCT ACCTTGCCCA CGATGATAAC
GATGGCAATG TGTGGTTGCT GGATCTGAAG AAAAACTCCA ATCAAAAAAT CATCAGCAAC
GGTGAGGGAC TCGGCCCCTA TGCGGATATT CGCTGGTCGG CAGACAGTCG TTTTATCGCG
TTAACTAAAT CCGAAATCGG TAAGCAAAGG CCACAAATTG TGCTGTATTC AGTGGACGAA
AATAAGGCCC AAGCGCTCAC CAGTGACAAG TATGAGTCCT ATTCGCCGAC CTTTAGTCGC
GATGGCCAGT GGTTATATTT CCTCTCCAAT CGCCAGTTTA CTGCGACACC AAGCTCACCT
TGGGGCGACC GCAATATGGG GCCAGTGTTT GATAAACGCA GCCAGATTTT TGCGATTGCC
TTAGTGAAGA ATGCCAAGTT CCCCTTCAGC AAACCCACAG AGCTGACGGC TAAGACGGCG
GAAAAGGCAG ACTCTAAAGA TAAGCCTACG CCTGTAAAAA TTGATTGGGC GGGAATTGGC
GAGCGTTTAT GGCAAGTGCC TTTCGATTCA GGCAATTACA GCCAGTTAAC TGCCATTGAT
GGCCGTCTCT ATGTGCTCGA CCAAGCCATT GGCGATGACA CCGAGCCTAG CTTAATGACG
ATTAAGTTTA GCGAGCAGCG CCCAAAAGCC GAAGTGTTTG CCGAAGACGT GGCGAATTAC
AGAGTGTCCG CCGATGGTAG CAAGTTGTTG CTGCGCAAAA AAAGCAATGA AAAGTCGCTG
CTGATCGTCG ATGCGGGCGA CAAGTTGGGC GATACCGAAA ATGCCAAGGT GCAAACGGAT
CAGTGGCAAT TAGCGATTTC ACCCACCCTA GAATGGCAAC AAATGTTTGA AGATGCCTGG
TTAATGCACA GGGACTCCTT TTTCGATAAG AAGATGCGCG GCCTCGATTG GCAAGCGACC
AAGGCCAAGT ACCAACCGCT ACTTGATCGT TTGACCGACC GTAACGAGTT AAACGATATC
TTTATGCAGA TGATGGGTGA GCTAGATTCG TTGCACTCGC AAGTGCGGGG TGGCGATCTG
CCCAAAGACC CTGACGCGGC CAAAGGTTCG AGTTTAGGCG CGCGGCTACA ACAAACTAAC
GATGGGGTAA AAATTGCCCA TATCTATCGT AATGATCCTG AACTGCCAAG CCAGGCATCG
CCCTTAAGCC GTATCGAAGT CGATGCGAAA GAAGGCGATA TGTTACTCGC CATCAATGGC
ACACCTGTGA CTAATGTGGC CGATGTGACC CGTTTATTGC GTAATCAGCA GGATAAGCAG
GTGTTGCTTG AGCTTAAACG CGGCGGCCAA AACCATAAAA CCGTGGTGAT GCCGGTTAGC
ACTCAGGTCG ATAGCCAATT ACGTTATTTA GATTGGGTTA ACCACAACGC AGGTGTCGTG
ACCGAGGCGA GTAAGGGCAA GATTGGTTAC CTGCATTTAT ACGCCATGGG CGGCGGCGAT
ATTGAGAGTT TTGCCCGTGA GTTTTACACC AATTACGACA AGGACGGTTT GATTATCGAC
GTGCGTCGTA ACCGTGGTGG CAATATTGAT AGTTGGATCA TCGAAAAACT GTTACGCCGC
GCTTGGGCCT TCTGGCAGCC AACCCATGGC ACGCCTAATA CCAATATGCA GCAAACCTTC
CGTGGCCATT TAGTGGTGTT AACCGACGAG TTGACATACT CAGATGGTGA AACCTTCTCG
GCGGGGATTA AGGCGCTGGG CATTGCTCCG CTCATTGGTA AGCAAACCGC GGGCGCAGGC
GTGTGGTTAT CGGGTCGTAA CTCTCTCACA GATAAAGGTA TGGCGCGGGT CGCCGAATAT
CCGCAATATG CGATGGATGG CCGCTGGGTA CTCGAAGGAC ATGGGGTAAC ACCGGATATC
GAGGTGGATA ACTTACCCTT TGCGACCTTT AATGGCCACG ATGCGCAGCT CGAAACGGCC
ATCAGCTATC TTAAGGATGA GTTGATTAAG CAGCCTATCC CCGCATTGAA GGCGCAGCCT
ATGCCGGCGA AAGGAATGGC AGAAGACATA AAAGCTAAGT AA
 
Protein sequence
MKLRHSVACC MLALGTLSAA HAIAQPASNQ GYYRAPALHD QTLVFTAEGD LWTQTLGQKA 
ATRLTTLPAE ELGAAISADG KWVAYVANYE GASEVYVIPV AGGVAKRVSF ENSRVRVQGW
TAKGEVLYST DSGFGPANNW MLRLVNPETL ATTDLPLADA VEGVVDANNQ YVYFTRFGLQ
VTGDNAKVYR GGAKGELWRF KLGGKDEAQL LSGQHQGSVR QPMLWQDRLY FISDSDGNDN
LWSMALDGSD AKQLTQYKDW QVRGARMDQG KVVFQQGADI HVFDIAAAKD SLLDIELKSD
FAQRREHWVK DPMDYATSAN LALAGDKVVI TARSHVAIAG IDGSRLVQVA LPGTYRVRNA
IMSQDGKSVY AISDMSGQQE IWQFPADGSS GAKQLTKDGH TLRMTLSLSN DGRYLAHDDN
DGNVWLLDLK KNSNQKIISN GEGLGPYADI RWSADSRFIA LTKSEIGKQR PQIVLYSVDE
NKAQALTSDK YESYSPTFSR DGQWLYFLSN RQFTATPSSP WGDRNMGPVF DKRSQIFAIA
LVKNAKFPFS KPTELTAKTA EKADSKDKPT PVKIDWAGIG ERLWQVPFDS GNYSQLTAID
GRLYVLDQAI GDDTEPSLMT IKFSEQRPKA EVFAEDVANY RVSADGSKLL LRKKSNEKSL
LIVDAGDKLG DTENAKVQTD QWQLAISPTL EWQQMFEDAW LMHRDSFFDK KMRGLDWQAT
KAKYQPLLDR LTDRNELNDI FMQMMGELDS LHSQVRGGDL PKDPDAAKGS SLGARLQQTN
DGVKIAHIYR NDPELPSQAS PLSRIEVDAK EGDMLLAING TPVTNVADVT RLLRNQQDKQ
VLLELKRGGQ NHKTVVMPVS TQVDSQLRYL DWVNHNAGVV TEASKGKIGY LHLYAMGGGD
IESFAREFYT NYDKDGLIID VRRNRGGNID SWIIEKLLRR AWAFWQPTHG TPNTNMQQTF
RGHLVVLTDE LTYSDGETFS AGIKALGIAP LIGKQTAGAG VWLSGRNSLT DKGMARVAEY
PQYAMDGRWV LEGHGVTPDI EVDNLPFATF NGHDAQLETA ISYLKDELIK QPIPALKAQP
MPAKGMAEDI KAK