Gene Shewmr4_1791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1791 
Symbol 
ID4252365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2124035 
End bp2127247 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content52% 
IMG OID638118402 
Producthypothetical protein 
Protein accessionYP_733922 
Protein GI113970129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.684722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAACA CGCCGACATC GCCAGATCAG GAGACTGCCG AGCACGCGCA TCAGGCGATG 
CCTCAAGAAG CGACTGCGGC CGCAGCGCCA GATTCGGCGG CGTACCGCCG CAGCTCTACA
CTCAAAAAGC GCTTAAAACA AAGCCTGATT GCCACCACTG TGGCCGGCCT GTTAGCTGGC
GCGGCGCTAA CCATTTGGAT AATAAACAGT CAAACCGAAG CCCTGATCCT TCGCGTCGCG
AATTATGCCT TAAGCGGCAT GGACGGTGAG CTTAGCGATA TTCGTTTAGG CCCCATGGGC
TTAGAACATT GGCACATTCG CTCTGCGAGC CTGCGTGTAC ATGACTCGCA TTTAGTGATC
AATAATCTGG ATATTCAGCT TGAACTCAAC TGGCCCAAAA GCCTTGAAGA ACTTAAGCAA
CTTGTCCAAG TTGAGAGCTT GACTCAAAAA ATCAAGCGCA TTAGCACGGG CGAGATAGAC
GTTGAACTCG GCGCATCGCT GCTGGAGCGA AGCCCCACCA TCGCCGATGA ACAAACGCCA
GCGTTGGCGT TAAATATCAA ATCCTTACCT TTAATTGATA TAGGTAAAAC CACGCTCAGA
TTAGCGCCGC AGGCAGAGTT TCCTGCCTAT CAACTGGTGA TGGATAAACT CAGCCTGAAT
CATCAAGCTG AATTAACCAC AGCCTTTAGC AGCCCTGAGG GTGAACCGTT AGCCCAGCTT
GCCGCGACGC TTGGCAACGA ACAATGGCGC TTGAAGAGCG AACTGAATAT CGCGCCGCTA
CTGGAAAACC TGCATCAAAT TGGCCTGCGC CAAACCCAAG GGAGCATCCT CAGCCAATTA
ACCCTGTGGG ATCAACAGTG GCAACAACTT GGGATAGGAT TAAGTGGGCA ACTCAGTTCT
GAGAGCACGA TGACACTCGC CAGTGGCGAA ATAACAAGCC ACCACCGCAT TCAGCAGCCC
AGCATCAGCT TAAGCCATTT TGCCGACTTA ACGCTCGCGC CGCAGCCTGC TTTGGGGTTT
GAGCTTAGTG GATCACTTGC TTCACTTAAT CTCACCCTCG AGCCATTTCG TCTTGCGCTC
ACACCCAATG CCGCACAGCA CACGCAGCTA TTAGCGGTGC TTAATCAGTC TCTGCAATTG
AGCGATGAAA ACTCTCAAGC GCTCCTTACC CTGCTGTCGG GGCTCAAAAG CACCGAGGCG
CCCGTGGGCC TTGCCTTTTC GATGACAGCG CCACTGCACT ATGCACTGGC ATCTCAGGCG
AACACCCATG AACCCATAGC GCTGCCCGCA TTCGAGTTAA CCACCCTAGG CAGTAAGCTT
GAGACGCGTA TTAGCTTACA AGATATCCAG TTAACGCCCA CGCCAGATGC TTGGAAGGTC
GCGAGCCGCT GGCAACTGGC GCTTACGCAA ACGACCCCGC TGACACTGCG GGAGCTTTGG
CATGCAGCCC CGCAGGATCT CAGTTGGGGA GCGGGAATGC TGCAAACAGC AGGTCATATC
AGTGTTGCTC AGTCAGCTCA AGGACTGAAT TGGCAAATCA GCACAGCGCC AGTGACGAGT
GACTCGAATG TCTCAAGCGA TACCTTACAA TTTGCACTTG AAGATCTGCA GCTACAGCAA
ACTGCGCAAG CCGCCGAGCA TCAAACTAAA CAAACGCAGT TAAGCCTTGG CAGTATTCAG
CTCAACGCTA AGGCCCCCAT GGCCGCGAGT GCAACCCCGT TAGCGACTAA GGATACGACT
GGCGCGCAAT CGACCGAGTT TGCCTTGAAT CTGCCGCCAT TATCACTCGC CCTGTCGCAC
TTGCGCGTGA GCCAAGCGGT AGAGAATCTT ACTGGCAACA ACAACAGCAA CAGCGCCGCC
GCAGTGCAGA GCAGTCGCAA CGATATTAGC CTCAAGGCGT TCTCCCTTGA GACATCAAAG
GCGATGACCC TCGATTACTC CTCGTTGCAA TCCATTGAAA ATGCTATCCA ATCGAGTCAG
TTGAGTAATC AAGTGAACTG GCAAGCGCAG CAACTCTTGA TTGAAAAGCA GCTCAGCGCC
AAAGGCCGCA CACGTAAACA GACTGTGCTT AAACTGGATA ATTTGGCACT CGCGCAGACG
TTAAACTGGC AAAACCAACG TCTTCACGGC CATGAACAGT GGCAAGTCGG CACGGTTGAG
CTGCAAAGCG ACCATCAATT ACAGTTAGCC GCGCCTCATA AACCACTGCT ATTAACGGGC
CAATGGGTTG TCGATACCAG CATGACAGAA GCGCTATCTT TGCTGAATCA AACCCAGCCC
TTGCCCGCTG AGTTAAATGT GACAGGCCAT AACCAATTAC AGGCACAATT TAAGCTGACA
CATGAGCCAG AACAGACTCA ATTTGCCATG CAAATTACCC AGTCGATGAC AGAGCTGGAA
GGTTTTTATC AAGACACGAC CTTTGAAGGC GGGAAATTAC AGGCCCAATG CGAGTTCACT
TGGGGGCAAT CCTATAAGGC GCCGCAAGCG CCAGGCTATT TCAGCAGTTT AAGCCGACTG
AACTGCCCGC AGACCATGAT GACCTTTAAC TTGTTCGATC CCGGCTTTCC CTTGACCGAT
ATCGAAGTGG AGGCCGATAT TGTCCTCGCC AAGGATGCCG AAAAGCTTCC CGACAATTGG
CTGCAACAAC TCACGGGCTT GAGTGATACC GACGTGTCGA TGACCGCCAA GGGCAAAGTA
TTGAGCGGCC AGTTTTTACT GCCTGATTTT AATTTAAAAT TGCAGGACAA ATCCCACGCC
TATCTATTAT TGCAGGGCAT GAGCCTTGAG GAAGTACTGC GCATTCAGCC GCAAATTGGT
ATCTATGCCG ATGGAATTTT CGATGGTGTA TTACCCGTGG ATTTGGTCGA TGGCAAAGTC
TCGATCAGCG GTGGCCAGTT GGCGGCCCGT GCGCCCGGTG GCCTTATTGC GATTTCAGGC
AATCCGGCGG TCGATCAAAT GCGTCAATCG CAGCCTTATC TCGACTTTGT ATTTTCGGCA
CTAGAACATT TAGAATACAG CCAGTTATCC AGTAGTTTCG ATATGGATCA AACCGGCGAT
GCCAACCTCT TAGTGGAAGT CAAAGGCCGC AGCCGAGGGA TTGAACGCCC CATCCACTTG
AATTACTCTC ATGAAGAGAA CATGCTGCAA TTATTCAGGA GTCTTCAGAT TGGTAACGAT
CTGCAGGACA GAATCGAAAA ATCCGTGAAG TAA
 
Protein sequence
MVNTPTSPDQ ETAEHAHQAM PQEATAAAAP DSAAYRRSST LKKRLKQSLI ATTVAGLLAG 
AALTIWIINS QTEALILRVA NYALSGMDGE LSDIRLGPMG LEHWHIRSAS LRVHDSHLVI
NNLDIQLELN WPKSLEELKQ LVQVESLTQK IKRISTGEID VELGASLLER SPTIADEQTP
ALALNIKSLP LIDIGKTTLR LAPQAEFPAY QLVMDKLSLN HQAELTTAFS SPEGEPLAQL
AATLGNEQWR LKSELNIAPL LENLHQIGLR QTQGSILSQL TLWDQQWQQL GIGLSGQLSS
ESTMTLASGE ITSHHRIQQP SISLSHFADL TLAPQPALGF ELSGSLASLN LTLEPFRLAL
TPNAAQHTQL LAVLNQSLQL SDENSQALLT LLSGLKSTEA PVGLAFSMTA PLHYALASQA
NTHEPIALPA FELTTLGSKL ETRISLQDIQ LTPTPDAWKV ASRWQLALTQ TTPLTLRELW
HAAPQDLSWG AGMLQTAGHI SVAQSAQGLN WQISTAPVTS DSNVSSDTLQ FALEDLQLQQ
TAQAAEHQTK QTQLSLGSIQ LNAKAPMAAS ATPLATKDTT GAQSTEFALN LPPLSLALSH
LRVSQAVENL TGNNNSNSAA AVQSSRNDIS LKAFSLETSK AMTLDYSSLQ SIENAIQSSQ
LSNQVNWQAQ QLLIEKQLSA KGRTRKQTVL KLDNLALAQT LNWQNQRLHG HEQWQVGTVE
LQSDHQLQLA APHKPLLLTG QWVVDTSMTE ALSLLNQTQP LPAELNVTGH NQLQAQFKLT
HEPEQTQFAM QITQSMTELE GFYQDTTFEG GKLQAQCEFT WGQSYKAPQA PGYFSSLSRL
NCPQTMMTFN LFDPGFPLTD IEVEADIVLA KDAEKLPDNW LQQLTGLSDT DVSMTAKGKV
LSGQFLLPDF NLKLQDKSHA YLLLQGMSLE EVLRIQPQIG IYADGIFDGV LPVDLVDGKV
SISGGQLAAR APGGLIAISG NPAVDQMRQS QPYLDFVFSA LEHLEYSQLS SSFDMDQTGD
ANLLVEVKGR SRGIERPIHL NYSHEENMLQ LFRSLQIGND LQDRIEKSVK