Gene SeD_A4946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4946 
Symbol 
ID6871938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4779606 
End bp4781267 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content55% 
IMG OID642787816 
Productmethyl-accepting chemotaxis protein I 
Protein accessionYP_002218409 
Protein GI198244112 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.556415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAGC GAATTAAAAT TGTTACCAGC TTACTGCTGG TATTGGCGCT ATTTGGCCTT 
TTACAACTGA CATCCGGCGG GCTGTTCTTC AACTCGCTGA AGAATGACAA AGAAAACTTC
ACCGTATTGC AAACCATTCG TCAGCAGCAG TCTGCCCTGA ATGCAACCTG GGTGGAGCTG
TTGCAAACGC GTAATACCCT GAATCGCGCG GGTATCCGCT GGATGATGGA CCAGAGCAAT
ATTGGCAGCG GCGCAACTGT CGCTGAACTG ATGCAGGGGG CGACCAATAC GCTGAAGCTG
ACCGAAAAAA ACTGGGCGCA GTATGAGGCG TTACCGCGCG ATCCACGTCA GAGTGAAGCG
GCTTTCCTTG AGATCAAACG AACCTATGAT ATCTACCACG GCGCGTTGGC GGAGCTTATT
CAGCTTCTTG GCGCGGGTAA GATTAACGAG TTTTTTGATC AACCGACTCA AAGCTATCAG
GACGCTTTTG AGAAGCAGTA CATGGCCTAT ATGCAGCAAA ACGATCGTCT GTACGATATT
GCTGTTGAGG ATAACAACAG TTCCTACAAC CAGGCGATGT GGGTACTGGT CAGTGTGCTG
ATTGCCGTTC TGGTGGTCAT TATCGCCGTC TGGTTCGGCA TCAAACTGTC GCTTATCGCC
CCGATGAATC GTCTGATTGA AAGCATTCGT CATATCGCCA GCGGCGATCT GGTGAAGCGT
ATCGACGTGG AAGGCTCCAA CGAAATGGGG CAGTTGGCTG AAAACCTGCG TCATATGCAA
AGTGAACTGA TGCGTACCGT TGGCGATGTA CGTAACGGCG CGAATGCGAT CTATAGCGGC
GCCAGCGAGA TTGCGATGGG CAACAACGAT CTCTCTTCCC GTACTGAGCA GCAGGCAGCG
TCTCTGGAAG AGACCGCCGC CAGTATGGAA CAACTGACCG CCACCGTGAA ACAGAACGCC
GAAAACGCCC GTCAGGCCAG TCACCTGGCA TTGAGCGCCT CTGAAACAGC GCAAAAAGGC
GGCAAAGTGG TGGATAACGT CGTACAAACA ATGCGCGATA TCGCCTCCAG TTCGCAGAAA
ATCGCCGATA TTATCAGCGT AATCGACGGT ATTGCTTTCC AGACCAATAT CCTGGCGCTG
AATGCGGCGG TAGAAGCGGC GCGCGCAGGC GAGCAGGGAC GCGGGTTCGC GGTGGTGGCC
GGTGAAGTCC GTAATCTGGC CCAGCGGAGC GCGCAGGCAG CGCGGGAGAT CAAGAGTCTG
ATTGAGGATT CCGTGAGCCG TGTTGATGTA GGTTCGACGC TGGTCGAAAG CGCCGGTGAA
ACCATGGATG AGATCGTCAA TGCAGTGACC CGCGTGACCG ATATCATGGG CGAGATTGCC
TCGGCGTCTG ACGAGCAAAG CCGTGGTATC GACCAGGTGG GCCTGGCGGT AGCGGAGATG
GATCGCGTAA CGCAGCAGAA CGCCTCGCTG GTGGAAGAGT CCGCCGCCGC GGCTGCGGCG
CTGGAAGAGC AAGCCAGCCG TCTGACCCAG GCCGTCGCGG TGTTCCGTAT TCAACAGCAA
CAGCAGCGTG CGCGTGATGT GGCTGCGGTA AAAACCCCGG CAGCCGTGTC GTCACCAAAG
GCCGCTGTGG CCGACGGCAG CGATAATTGG GAAACGTTTT AA
 
Protein sequence
MLKRIKIVTS LLLVLALFGL LQLTSGGLFF NSLKNDKENF TVLQTIRQQQ SALNATWVEL 
LQTRNTLNRA GIRWMMDQSN IGSGATVAEL MQGATNTLKL TEKNWAQYEA LPRDPRQSEA
AFLEIKRTYD IYHGALAELI QLLGAGKINE FFDQPTQSYQ DAFEKQYMAY MQQNDRLYDI
AVEDNNSSYN QAMWVLVSVL IAVLVVIIAV WFGIKLSLIA PMNRLIESIR HIASGDLVKR
IDVEGSNEMG QLAENLRHMQ SELMRTVGDV RNGANAIYSG ASEIAMGNND LSSRTEQQAA
SLEETAASME QLTATVKQNA ENARQASHLA LSASETAQKG GKVVDNVVQT MRDIASSSQK
IADIISVIDG IAFQTNILAL NAAVEAARAG EQGRGFAVVA GEVRNLAQRS AQAAREIKSL
IEDSVSRVDV GSTLVESAGE TMDEIVNAVT RVTDIMGEIA SASDEQSRGI DQVGLAVAEM
DRVTQQNASL VEESAAAAAA LEEQASRLTQ AVAVFRIQQQ QQRARDVAAV KTPAAVSSPK
AAVADGSDNW ETF