Gene SeHA_C4939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4939 
Symbol 
ID6489244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4821428 
End bp4823089 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content55% 
IMG OID642744984 
Productmethyl-accepting chemotaxis protein I 
Protein accessionYP_002048556 
Protein GI194451921 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAGC GAATTAAAAT TGTTACCAGC TTACTGCTGG TATTGGCGCT ATTTGGCCTT 
TTACAACTGA CATCCGGCGG GCTGTTCTTC AACTCGCTGA AGAATGACAA AGAAAACTTC
ACCGTATTGC AAACTATTCG TCAGCAGCAG TCTGCCCTGA ATGCAACCTG GGTGGAGCTG
TTGCAAACGC GTAATACCCT GAATCGCGCG GGTATCCGCT GGATGATGGA CCAGAGCAAT
ATTGGCAGCG GCGCAACTGT CGCTGAACTG ATGCAGGGGG CGACCAATAC GCTGAAGCTG
ACCGAAAAAA ACTGGGAGCA GTATGAGGCG TTACCGCGCG ATCCACGTCA GAGTGAAGCG
GCTTTCCTTG AGATCAAACG AACCTATGAT ATCTACCACG GCGCGTTGGC GGAGCTTATT
CAGCTTCTTG GCGCGGGTAA GATTAACGAG TTTTTTGATC AACCGACTCA AAGCTATCAG
GACGCTTTTG AGAAGCAGTA CATGGCCTAT ATGCAGCAAA ACGATCGTCT GTACGATATT
GCTGTTGAGG ATAACAACAG TTCCTACAAC CAGGCGATGT GGGTACTGGT CAGTGTGCTG
ATTGCCGTTC TGGTGGTCAT TATCGCCGTC TGGTTCGGCA TCAAACTGTC GCTTATCGCC
CCGATGAATC GTCTGATTGA AAGCATTCGT CATATCGCCA GCGGCGATCT GGTGAAGCGT
ATCGACGTGG AAGGCTCCAA CGAAATGGGG CAGTTGGCTG AAAACCTGCG TCATATGCAA
AGTGAACTGA TGCGTACCGT GGGCGATGTA CGTAACGGCG CGAATGCGAT CTATAGCGGC
GCCAGCGAGA TTGCGATGGG CAACAACGAT CTCTCTTCCC GTACTGAGCA GCAGGCAGCG
TCTCTGGAAG AGACCGCCGC CAGTATGGAA CAACTGACCG CCACCGTGAA ACAGAACGCC
GAAAACGCCC GTCAGGCCAG TCACCTGGCG CTGAGTGCGT CAGAGACAGC GCAAAAAGGC
GGCAAAGTGG TGGATAACGT CGTACAAACA ATGCGCGATA TCGCCTCCAG TTCGCAGAAA
ATCGCCGATA TTATCAGCGT AATCGACGGT ATTGCTTTCC AGACCAATAT TCTGGCGCTG
AATGCGGCGG TAGAAGCGGC GCGCGCAGGC GAGCAGGGAC GCGGGTTCGC AGTGGTGGCC
GGTGAAGTCC GTAATCTGGC CCAGCGTAGC GCGCAGGCGG CACGGGAGAT CAAGAGTCTG
ATTGAGGATT CCGTGAGCCG TGTTGATGTA GGTTCGACGC TGGTCGAAAG CGCCGGTGAA
ACCATGGATG AGATCGTCAA TGCAGTGACC CGCGTGACCG ATATCATGGG CGAGATTGCC
TCGGCGTCTG ACGAGCAAAG CCGTGGTATC GACCAGGTGG GCCTGGCGGT AGCGGAGATG
GATCGCGTAA CGCAGCAGAA CGCCTCGCTG GTGGAAGAGT CCGCCGCCGC GGCTGCGGCG
CTGGAAGAGC AAGCCAGCCG TCTGACCCAG GCCGTCGCGG TGTTCCGTAT TCACCAGCAA
CAGCAGCGTG CGCGTGAAGT GGCTGCGGTA AAAACCCCGG CAGCCGTGTC GTCACCAAAG
GCCGCAGTGG CCGACGGCAG CGATAATTGG GAAACATTTT AA
 
Protein sequence
MLKRIKIVTS LLLVLALFGL LQLTSGGLFF NSLKNDKENF TVLQTIRQQQ SALNATWVEL 
LQTRNTLNRA GIRWMMDQSN IGSGATVAEL MQGATNTLKL TEKNWEQYEA LPRDPRQSEA
AFLEIKRTYD IYHGALAELI QLLGAGKINE FFDQPTQSYQ DAFEKQYMAY MQQNDRLYDI
AVEDNNSSYN QAMWVLVSVL IAVLVVIIAV WFGIKLSLIA PMNRLIESIR HIASGDLVKR
IDVEGSNEMG QLAENLRHMQ SELMRTVGDV RNGANAIYSG ASEIAMGNND LSSRTEQQAA
SLEETAASME QLTATVKQNA ENARQASHLA LSASETAQKG GKVVDNVVQT MRDIASSSQK
IADIISVIDG IAFQTNILAL NAAVEAARAG EQGRGFAVVA GEVRNLAQRS AQAAREIKSL
IEDSVSRVDV GSTLVESAGE TMDEIVNAVT RVTDIMGEIA SASDEQSRGI DQVGLAVAEM
DRVTQQNASL VEESAAAAAA LEEQASRLTQ AVAVFRIHQQ QQRAREVAAV KTPAAVSSPK
AAVADGSDNW ETF