Gene Shewmr4_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1049 
Symbol 
ID4251122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1224605 
End bp1227658 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content53% 
IMG OID638117622 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_733186 
Protein GI113969393 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR02956] TMAO reductase sytem sensor TorS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCCTT TATCTTTATC GCGAAGTAGT TTAACCGGCA GACTCATGCT CGCCTTCGGT 
GTGCTTGGGG TATTGCTGTT GCTACTGGTT AGCTTAGGTA GCCTGAGTTT GCATTGGCTC
AAACAGGCCG ACAGTTATCT GTATGAAGAA TCCTTACCCG CCTCCCAAGC CGCGCGCCAA
TTAGTGCAGG CTTCTAATGC CTTATCCGAT GGTGTGACCC AATTAGGCCA AGTGGAGGAC
GAGCGGCAGC GGGAATTTAT CGGCCGTAAG CTCAGCCTTG AAAGCGCCAC AATGTTAAAT
AGCATCAAAG TGTTACTAAC ATTGGATGTG AACGAAGATA ATCATTTATC CGTATTAGCA
GGGCAAATCA TCGAGCAACT AACCTTATTG GGTAAGAGTG TCGGCCAGCG CATTACCTTA
GGCGATGAGT TACAGCTTAG GGCCCGCGAA TTATCGGTTG CCGCAGGTCA TGCCTCTGAA
TTATTGCAAT CCGAATTGGC GGTCGTCGAC TCGGCGATTT TGGCAAAACT CAGCCAATCC
TATCCGGATA TGGCGGGGGA TAAACGCAGT GGCCAATTGC TCGATGATGT GATTGAGCGA
GAGTTAGATG TGCAGGAGCA GCTCAACCGC GCACTCAAGT TAGTGCATCA AATTGCGCTG
CTCAGTCAGT TATTTGAGGC ATCAGAATTA CAGTCCGAGC TACAGTTGAG TGTTCCCCGT
TTATTGGCCA CCTTTGCCAG CACTACGCCT TCCCATCCGG CGGCGCAGAG TAAACCCGTA
CTCGGAACGG CAATCGAGCA GGCATCGAGC GCGGTCACAC AGCCGCCAAG GGGTGATGTT
GGCATCGATT TAATGGCGCT CACCACAGTA TCTGAGCTTA TTCGCGATCC CGGCAGGCTC
AAGGCGCTAA AGGCGGAGCT GGCAATCCTT CTTCATACCC CAAAAATCAT TAAATTACAG
CGAGAACTGA GCCAAAGCCT GCAGCGCCAA CAGCGGCAGC AACAGGAGTT AGCGGAAAAA
CTCTATAGCC TCAACACGCT GGTGGACAGT GCGCTTAATC AACAGCAGCA GCGGGCAGAG
CTTGCACGTA GCGATTATTT AATGCAGTTG TCCTACGCCC GTTTGGGGCT GTGGGGCACG
GGCATCTTAA TGTTGGTGAT CATCGGTGTG GTGGTTTACC GGGTGATCTA TCGCGGCATT
GCCCTTAGAT TAAATCAGGC GACCGAAGCC ATGTCGCGCT TAAGTTTAGG GGACACCAAT
GTGAGCCTCG ATGCCCACGG CGATGATGAA TTAACCGCCA TGGCCAATGC TATCGAGGCA
TTTAAGCGAA AAACCGCCCA TAATCTGAAA TTGCAGGCGG ATTTACGCCA GGTGGCTGAT
GAGCTGACCG AGCATAAAAA GGCTCTCGAA CAAACCGTGG CCACGCGGAC CCAGGAATTG
GCCGAAGCCA ACCTGCGCCT CGATGCGGAG GCCAAAGGCC ACGCTAAGGC AAGAATCGTT
GCCGAGGAGG CGAGTCAGGC AAAATCCCAA TTCTTGGCGA CCATGAGTCA TGAAATTCGC
ACCCCGCTCA ATGGTTTACT CGGGACGCTT ACCCTGCTCG GCCATAGCCA GTTACCGCCC
GCGCAGCAAC AGATGCTCGC GTTGTCGCAA TATAGCGGCA CGCTGCTGCA AACCGTGCTG
AGCGATATTC TCGATTTCTC CCGCCTCGAG CAGGGTAATT TAACCAATGA GCCGCGGCCA
ACGGATATCA ACGCCCTGCT CGATGAAGTG CTGGCGATCA TGGTGGCGGG CGCCAATTTG
GCGGGATTGA GCCTAAGGCT GAATCGTCCC CAGTTACCCG CGTGTATTCA GATCGATGGC
CCTAAGCTGA GGCAAGTGCT GTTTAACTTG ATTGGTAACG GCATTAAATT CACCCCCGAA
GGCGGCGTGA GTCTCAATGT CAGCGTACGA GGCGATAAGT TAGCCTTTGT GGTGGCCGAT
ACCGGCGTGG GGATTGCGCC CGAAGTGCGC GAGCAGTTAT TTATGCCTTA CTGCGTATTG
CCTAATCAGG GGCGCAGCCG TGGGACAGGC TTAGGGCTTG CCATCTGCAA ACAGTTAGTT
GAACTGATGG ATGCCGAGGG GCCGGGAATT TGGGTCAAGA GCGAGCCAGG CAAGGGCAGC
GAGTTTGGTT TTGAGCTGAG CTTTACCCAA TGTGACAAGG CATTGGATAC GCAAACTCAG
GTGCAAAAAC GGGTTAATCC CCAACGGGTG TTAGTGATTG AAGACAACAA GGTCAATGCC
ATGGTCGCGC AGGGATTTTT GGCGCATTTA GGCCACAGCT CCAGCTTGGC CGTCAGTTGT
CAGCAGGCGC TGGCGCATGT GAGTGGCGAT AAGGCATTCG ATGCCGTGAT GCTCGATATT
CAGCTGAGTG ATGGCTCGGG ACTCACGCTA TTACCCCAGT TAAAGGCGTT ATTCGCCAAT
GACAATGTGA AGTTCGCTGC CTTTACCGCG CAGATGCAAA CGGAGGATCT TAGCCTGTAT
CGGGAGGCTG GGTTTGATAC TGTGTTAGCC AAACCCTTGA GCCTGCAAAC CCTGACAGAA
TGGTTAGGCG TTGCGCGTGT GCCAGCTTCA ATGCCTGCTT TACCGTATGA GTCATCGTCG
CAATCGGCAC AATCACTTCA ATCGCCAAAT CAGGCAGAAT CTACCAGCCA GAGCCATCAA
GCTGAAACCT TGTTAGATCT TAACCAGTTA CAGCAAGATC TTGAGGTCTT AGGCGTGAAA
GCCGTGAGTG ATATTTTGGC GCTCTATCGA TGCTCCAGTG CCGAGCAAAT TGAGCGACTC
TCGGCGCTAA CCTCTGTGGC GCATTTCAGC GAGGGCGCAA AACTATTGCA TGCGCTTAAG
GGCAGCAGTG CCAGCATGGG GCTAAAAGCA CTGACCGAGT GTTGTCAGCA GTGGGAGAAA
ACACTCAACA CCACAGGGGA AAATGTATTG GATAGTAAAG TCGTGGCTGA ATTAACCGCC
TGCTGGCAGG TGTCGATGAC AGCGCTTGAG CAATGGCTCG CAAGACAGAA TTAG
 
Protein sequence
MAPLSLSRSS LTGRLMLAFG VLGVLLLLLV SLGSLSLHWL KQADSYLYEE SLPASQAARQ 
LVQASNALSD GVTQLGQVED ERQREFIGRK LSLESATMLN SIKVLLTLDV NEDNHLSVLA
GQIIEQLTLL GKSVGQRITL GDELQLRARE LSVAAGHASE LLQSELAVVD SAILAKLSQS
YPDMAGDKRS GQLLDDVIER ELDVQEQLNR ALKLVHQIAL LSQLFEASEL QSELQLSVPR
LLATFASTTP SHPAAQSKPV LGTAIEQASS AVTQPPRGDV GIDLMALTTV SELIRDPGRL
KALKAELAIL LHTPKIIKLQ RELSQSLQRQ QRQQQELAEK LYSLNTLVDS ALNQQQQRAE
LARSDYLMQL SYARLGLWGT GILMLVIIGV VVYRVIYRGI ALRLNQATEA MSRLSLGDTN
VSLDAHGDDE LTAMANAIEA FKRKTAHNLK LQADLRQVAD ELTEHKKALE QTVATRTQEL
AEANLRLDAE AKGHAKARIV AEEASQAKSQ FLATMSHEIR TPLNGLLGTL TLLGHSQLPP
AQQQMLALSQ YSGTLLQTVL SDILDFSRLE QGNLTNEPRP TDINALLDEV LAIMVAGANL
AGLSLRLNRP QLPACIQIDG PKLRQVLFNL IGNGIKFTPE GGVSLNVSVR GDKLAFVVAD
TGVGIAPEVR EQLFMPYCVL PNQGRSRGTG LGLAICKQLV ELMDAEGPGI WVKSEPGKGS
EFGFELSFTQ CDKALDTQTQ VQKRVNPQRV LVIEDNKVNA MVAQGFLAHL GHSSSLAVSC
QQALAHVSGD KAFDAVMLDI QLSDGSGLTL LPQLKALFAN DNVKFAAFTA QMQTEDLSLY
REAGFDTVLA KPLSLQTLTE WLGVARVPAS MPALPYESSS QSAQSLQSPN QAESTSQSHQ
AETLLDLNQL QQDLEVLGVK AVSDILALYR CSSAEQIERL SALTSVAHFS EGAKLLHALK
GSSASMGLKA LTECCQQWEK TLNTTGENVL DSKVVAELTA CWQVSMTALE QWLARQN