Gene Shewmr4_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2091 
Symbol 
ID4252664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2491581 
End bp2493551 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content51% 
IMG OID638118715 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_734221 
Protein GI113970428 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.684691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATTT TCAATGTGTT TGGAAATGAA GAGGCGCGCC AGCAAAAGGA AGAGGAGCTG 
CAACGCTATT TCCAATTATT AGATAACAGC GGCAACAGCT TTATGATCGC CGACAGCAAC
CGCAATATTA TCTACGCCAA TAAAGCGGTA TTGAATATGT TATCCGAGGC TGAGGCGGAC
ATTCGCAAAG AGCTGCCGCA GTTCTCTGTG GCCAAGGTGG TGGGCAGTAA TATCGATATT
TTCCATAAAA ACCCGGCCCA TCAACGCAAT ATGCTCGAGC GCCTAACCCA ATCCCATACG
GCGCAAATCA CCATTGGGAA GCGGACTTTT AAACTGATCC TGACGCCCAT TATCACCCGC
GAGAATAAGC ACTTAGGTAC GGGGGTTGAG TGGATTGATA GAACGGAGAG CATAGAGTCC
GAGCGGGCGA CGCAGCGCAT TTTAGAAGCG CTGAATAATA CCAGTACCAA TGTGATGATC
GCCGATGCCA ACCGCACCAT CATCTATATG AACCGCTCGG TGGAATTGAT GCTGCGCCAA
TCAGAGAATG AGATCAGGCA AGCGCTGCCG CATTTCTCCG TCGATAAAAT TCTTGGCAGC
TCGATGGATA TTTTCCATAA GAATCCTGCC CATCAAGCCA GTCTGTTAGA CAAGCTCGAC
CGTAAATATG AATCGCAAAT CAAAGTGGCC AGTTGTCACT TCCGCTTAAC CGCAAGCCCG
ATTATTTCCA AAACGGGTGA GCGGTTAGGT TCTGTCGTCG AATGGCTCGA CCGCACGGTC
GAAGTGCAAA TCGAGCAGGA AATCTCGCGG ATCGTCAATG CGGCGGCGGC GGGAGATTTC
TCCCAACGGG CTGAGACACA AGGTAAGCAG GGCTTCTTCT TAATGCTGGC CAACAGCCTC
AATGCTTTGA TTGAAACCTC GGATCGCGGT CTACAGGATG TGGCGCGGGT GTTGATGGCG
CTCGCCGAAG GCGATTTAAC CACGCGTATC TATAACGATT ACGAAGGTAC CTTTAATGAT
TTGAAAAACT ATTCAAATCA AACCGCTGAA AAGCTCTCTT ACATGATAAG AGACATTCAA
AAGGCCGCCG ATACCATCAA TACCGCCTCT TCCGAAATCG CTCAGGGCAA TGCTGATCTG
TCGAGCCGCA CCGAGGAGCA AGCCTCGAGT CTTGAACAAA CCTCGGCGAG TATGGAAGAA
CTGACGGGCA CAGTGAAGCT AAATGCCGAC AACGCTAGCC AGGCCAATGC ACTTGCCTCT
AAGGCCGCCG ATGTCGCCGT CGATGGGGGC GAACTTATCC AGCAAGTGGT GCAAACCATG
GCATCGATTA ACGAATCGGC ACGCAAGATT GCCGATATTA TCGGTGTAAT TGATGGCATA
GCCTTCCAAA CCAATATCTT AGCGCTAAAT GCGGCGGTCG AAGCGGCCAG AGCCGGTGAG
CAAGGTCGTG GATTTGCGGT GGTGGCCTCA GAGGTGAGAA GTTTGGCCCA ACGTTCGGCC
AACGCAGCCA AAGACATTAA AGCCTTGATC TCCGACTCGG TGACCAAAAT CGAAAGCGGT
AACAGCTTAG TCGGCAAATC CGGCGACACC ATGAAAGAAA TCGTCATCGC CATTAAACGG
GTGAACGACA TTATGGCCGA AATTGCCTCA GCCTCGAATG AGCAGGCGAT CGGTATCGAT
GAGATCAGTA AAGCCGTAGT GCAAATGGAT GAAATGACGC AGCAAAACGC GGCCTTAGTG
GAAGAGGCCG CCGCAGCAGC CGAAAGCATG CAATCACAGG CACAGCAGTT AGCGGACAGT
GTGGCCAACT TTACCGTGGA CGAAGACACC ACAGCTGCGC CCAAACCCGT GGCGAGTCAC
AAGAAGTTAG CGGTTAAACC GCCATCGACT GTGACTCGCA TGCCCGTTAA ACCTAAGGCG
ATGGCGCCAA AGGTCAATAA AGCCGACCAA GATGAATGGG AAGATTTTTG A
 
Protein sequence
MGIFNVFGNE EARQQKEEEL QRYFQLLDNS GNSFMIADSN RNIIYANKAV LNMLSEAEAD 
IRKELPQFSV AKVVGSNIDI FHKNPAHQRN MLERLTQSHT AQITIGKRTF KLILTPIITR
ENKHLGTGVE WIDRTESIES ERATQRILEA LNNTSTNVMI ADANRTIIYM NRSVELMLRQ
SENEIRQALP HFSVDKILGS SMDIFHKNPA HQASLLDKLD RKYESQIKVA SCHFRLTASP
IISKTGERLG SVVEWLDRTV EVQIEQEISR IVNAAAAGDF SQRAETQGKQ GFFLMLANSL
NALIETSDRG LQDVARVLMA LAEGDLTTRI YNDYEGTFND LKNYSNQTAE KLSYMIRDIQ
KAADTINTAS SEIAQGNADL SSRTEEQASS LEQTSASMEE LTGTVKLNAD NASQANALAS
KAADVAVDGG ELIQQVVQTM ASINESARKI ADIIGVIDGI AFQTNILALN AAVEAARAGE
QGRGFAVVAS EVRSLAQRSA NAAKDIKALI SDSVTKIESG NSLVGKSGDT MKEIVIAIKR
VNDIMAEIAS ASNEQAIGID EISKAVVQMD EMTQQNAALV EEAAAAAESM QSQAQQLADS
VANFTVDEDT TAAPKPVASH KKLAVKPPST VTRMPVKPKA MAPKVNKADQ DEWEDF