Gene SNSL254_A3846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3846 
Symbol 
ID6482823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3719397 
End bp3721040 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content56% 
IMG OID642739111 
Productmethyl-accepting chemotaxis protein I 
Protein accessionYP_002042822 
Protein GI194444489 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TCAAAGTCAT CACCGGCGTT ATCGCGACGC TGGGCATATT TAGCGCCTTA 
TTGTTGGTGA CAGGAATACT GTTTTATTCC GCCGTCAGCA GCGATCGGCT GAATTTCCAG
AATGCGAGCG CGCTGAGTTA CCAACAACAG GAACTGGGCG GCAGTTTTCA GACATTGATC
GAAACCCGCG TCACCATTAA CCGCGTGGCG ATACGCATGT TAAAAAATCA GCGCGATCCC
GCCTCACTGG ACGCCATGAA CACGCTGTTA ACCAACGCTG GCGCGTCGCT CAACGAAGCG
GAAAAGCATT TCAACAACTA CGTGAACTCC GAAGCGATCG CGGGCAAAGA TCCGGCGTTG
GATGCCCAGG CCGAAGCCAG CTTTAAGCAG ATGTATGACG TTTTGCAGCA GTCTATCCAC
TATCTTAAAG CCGATAATTA CGCCGCCTAT GGCAACCTTG ACGCGCAAAA AGCGCAGGAT
GACATGGAGC AGGTATATGA CCAGTGGCTC TCTCAAAATG CGCAATTAAT AAAATTAGCC
AGCGATCAGA ATCAGAGCAG TTTTACCCAG ATGCAATGGA CGCTGGGGAT AATTCTACTT
ATCGTGCTCA TCGTGCTGGC GTTTATCTGG CTGGGGCTGC AACGCGTTCT ACTCCGCCCG
CTGCAACGGA TTATGGCGCA CATTCAAACG ATCGCCGACG GCGATCTTAC CCATGAGATA
GAGGCCGAAG GACGCAGTGA AATGGGCCAA CTGGCCGCCG GTCTTAAAAC GATGCAGCAG
TCGTTAATCC GTACCGTCAG CGCGGTGCGC GATAACGCAG ACTCTATCTA TACTGGCGCA
GGCGAAATTT CCGCCGGCAG CAGCGATCTC TCTTCCCGTA CCGAACAGCA GGCCTCGGCG
CTGGAGGAGA CCGCCGCCAG CATGGAACAG TTAACCGCCA CGGTACGGCA AAACACTGAT
AACGCACGAC AGGCGACGGG TCTGGCGAAA ACCGCATCAG AAACCGCGCG TAAAGGAGGA
CGCGTGGTGA ATAACGTAGT GAGCACCATG AACGATATCG CCGAAAGCTC GGAAAAAATC
GTGGACATCA CCAGCGTGAT TGACGGTATC GCCTTCCAGA CTAATATCCT GGCGCTGAAC
GCCGCGGTAG AAGCCGCCCG CGCCGGCGAA CAGGGGCGAG GATTCGCGGT CGTGGCCGGA
GAGGTACGCA CGTTGGCCAT CCGTAGCGCG CAGGCCGCCA AAGAGATCAA AGTACTGATT
GAAAACTCCG TGTCGCGCAT TGATACCGGC TCTACGCAGG TACGCGAAGC GGGAGAAACC
ATGAAAGAGA TCGTTAACGC CGTGACCCGC GTGACCGATA TTATGGGCGA AATCGCCTCT
GCCTCCGATG AGCAAAGCAA AGGCATTGAG CAGGTGGCGC AGGCGGTATC GGAAATGGAC
AGCGTGACGC AGCAAAACGC CTCGCTGGTA GAGGAATCCG CAGCAGCAGC GGCGGCGCTG
GAAGATCAGG CTAACGAACT TCGTCAGGCG GTCGCCGCGT TCCGCATCCA GAAACAACCT
CGTCGGGAGG CGTCGCCGAC GCCGTTAAGC AAAGGTTTAA CGCCGCAGCC CGCCGCAGAA
CAGGCGAACT GGGAAAGCTT CTAA
 
Protein sequence
MKNIKVITGV IATLGIFSAL LLVTGILFYS AVSSDRLNFQ NASALSYQQQ ELGGSFQTLI 
ETRVTINRVA IRMLKNQRDP ASLDAMNTLL TNAGASLNEA EKHFNNYVNS EAIAGKDPAL
DAQAEASFKQ MYDVLQQSIH YLKADNYAAY GNLDAQKAQD DMEQVYDQWL SQNAQLIKLA
SDQNQSSFTQ MQWTLGIILL IVLIVLAFIW LGLQRVLLRP LQRIMAHIQT IADGDLTHEI
EAEGRSEMGQ LAAGLKTMQQ SLIRTVSAVR DNADSIYTGA GEISAGSSDL SSRTEQQASA
LEETAASMEQ LTATVRQNTD NARQATGLAK TASETARKGG RVVNNVVSTM NDIAESSEKI
VDITSVIDGI AFQTNILALN AAVEAARAGE QGRGFAVVAG EVRTLAIRSA QAAKEIKVLI
ENSVSRIDTG STQVREAGET MKEIVNAVTR VTDIMGEIAS ASDEQSKGIE QVAQAVSEMD
SVTQQNASLV EESAAAAAAL EDQANELRQA VAAFRIQKQP RREASPTPLS KGLTPQPAAE
QANWESF