Gene SbBS512_E4893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4893 
Symboltsr 
ID6272504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4564701 
End bp4566356 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content55% 
IMG OID641728625 
Productmethyl-accepting chemotaxis protein I 
Protein accessionYP_001883019 
Protein GI187732801 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAC GTATCAAGAT CGTGACCAGC TTACTGCTGG TTTTGGCCGT TTTTGGCCTT 
TTACAACTGA CATCAGGCGG TCTGTTCTTT AATGCCTTAA AGAATGACAA AGAAAATTTC
ACTGTTTTAC AAACCATTCG CCAGCAGCAA TCCACGCTGA ATGGCAGCTG GGTCGCGTTG
TTGCAGACGC GTAACACCCT CAACCGCGCG GGTATCCGCT ACATGATGGA TCAGAATAAT
ATTGGTAGCG GTTCAACCGT TGCTGAGCTG ATGCAGAGTG CCAGTATTTC GCTGAAACAG
GCGGAAAAAA ACTGGGCGGA TTACGAAGCG TTGCCGCGTG ACCCGCGTCA GAGCACCGCC
GCAGCGGCAG AGATCAAACG TAATTACGAT ATTTATCACA ATGCGCTGGC GGAGCTGATC
CAACTGTTAG GTGCAGGCAA AATCAACGAG TTCTTTGATC AGCCGACCCA GGGATATCAG
GACGGTTTCG AGAAGCAGTA TGTGGCTTAC ATGGAGCAAA ACGATCGGCT CTATGATATC
GCCGTCAGCG ATAACAATGC CTCCTACAGC CAGGCGATGT GGATTCTGGT GGGCGTGATG
ATCGTCGTAC TGGCGGTCAT CTTCGCCGTC TGGTTCGGTA TTAAAGCCTC GCTGGTAGCG
CCAATGAATC GCCTGATTGA CAGCATTCGT CATATTGCAG GCGGCGATCT GGTGAAACCG
ATTGAGGTGG ATGGCTCTAA TGAGATGGGG CAACTGGCAG AGAGTTTGCG CCATATGCAG
GGAGAGCTGA TGCGTACCGT CGGTGATGTG CGCAACGGGG CCAATGCCAT CTATAGCGGT
GCCAGCGAAA TCGCCACCGG CAATAACGAT CTCTCTTCGC GCACCGAGCA ACAGGCCGCT
TCGCTGGAAG AGACGGCAGC CAGCATGGAG CAACTGACCG CAACGGTGAA GCAGAACGCC
GAAAATGCGC GCCAGGCCAG CCATCTGGCG TTAAGTGCTT CTGAAACGGC GCAACGCGGC
GGCAAAGTGG TAGATAACGT GGTGCAGACT ATGCGCGATA TCTCCACCAG TTCGCAGAAA
ATCGCCGATA TTATCAGCGT AATTGACGGC ATTGCCTTCC AGACCAATAT TCTGGCTTTG
AACGCGGCGG TTGAGGCTGC GCTTGCGGGT GAGCAAGGGC GCGGTTTTGC GGTGGTCGCG
GGAGAAGTGC GTAATCTGGC CCAGCGCAGC GCCCAGGCGG CTCGTGAAAT TAAAAGCCTG
ATTGAAGACT CGGTGGGGAA AGTGGATGTT GGCTCTACGC TGGTCGAAAG CGCCGGGGAA
ACAATGGCGG AGATTGTCAG CGCCGTGACC CGCGTGACGG ACATTATGGG CGAAATTGCT
TCTGCTTCTG ATGAGCAGAG CCGTGGTATC GATCAGGTTG GCTTAGCGGT TGCTGAGATG
GACCGGGTAA CTCAACAGAA CGCCGCGCTG GTGGAAGAGT CTGCCGCTGC CGCCGCCGCG
CTGGAAGAGC AGGCCAGTCG CCTGACCGAA GCAGTGGCAG TGTTCCGGAT TCAGCAACAG
CAGCGTGAAA CATCGGCTGT GGTAAAAACC GTGACGCCAG CTGCGCCGCG TAAAATGGCC
GTGGCAGATA GCGAGGAGAA CTGGGAAACA TTTTAA
 
Protein sequence
MLKRIKIVTS LLLVLAVFGL LQLTSGGLFF NALKNDKENF TVLQTIRQQQ STLNGSWVAL 
LQTRNTLNRA GIRYMMDQNN IGSGSTVAEL MQSASISLKQ AEKNWADYEA LPRDPRQSTA
AAAEIKRNYD IYHNALAELI QLLGAGKINE FFDQPTQGYQ DGFEKQYVAY MEQNDRLYDI
AVSDNNASYS QAMWILVGVM IVVLAVIFAV WFGIKASLVA PMNRLIDSIR HIAGGDLVKP
IEVDGSNEMG QLAESLRHMQ GELMRTVGDV RNGANAIYSG ASEIATGNND LSSRTEQQAA
SLEETAASME QLTATVKQNA ENARQASHLA LSASETAQRG GKVVDNVVQT MRDISTSSQK
IADIISVIDG IAFQTNILAL NAAVEAALAG EQGRGFAVVA GEVRNLAQRS AQAAREIKSL
IEDSVGKVDV GSTLVESAGE TMAEIVSAVT RVTDIMGEIA SASDEQSRGI DQVGLAVAEM
DRVTQQNAAL VEESAAAAAA LEEQASRLTE AVAVFRIQQQ QRETSAVVKT VTPAAPRKMA
VADSEENWET F