Gene SNSL254_A2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2079 
Symbol 
ID6486833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2019705 
End bp2021366 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content56% 
IMG OID642737435 
Productmethyl-accepting chemotaxis protein II 
Protein accessionYP_002041185 
Protein GI194444788 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.429347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value1.96177e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAACC GTATCCGCGT TGTCACAATG CTGATGATGG TGCTGGGGGT TTTCGCACTG 
CTACAGCTTG TTTCCGGTGG TTTGCTGTTT TCTTCATTAC AGCATAACCA GCAAGGTTTT
GTTATTTCTA ACGAATTACG TCAGCAACAA AGCGAACTCA CGTCGACATG GGACTTAATG
CTGCAAACGC GCATTAACCT GAGCCGCTCC GCCGCACGCA TGATGATGGA CGCTTCTAAC
CAGCAGAGCA GCGCCAAAAC GGATTTACTC CAGAATGCAA AAACGACCCT CGCACAGGCG
GCGGCGCACT ACGCCAATTT CAAAAACATG ACGCCATTGC CAGCGATGGC GGAGGCCAGC
GCGAACGTCG ATGAAAAATA TCAGCGCTAT CAGGCCGCAT TAGCCGAACT TATTCAGTTT
CTGGACAATG GCAATATGGA TGCCTACTTC GCCCAGCCAA CCCAGGGAAT GCAAAACGCG
TTGGGCGAGG CGCTGGGCAA TTACGCCCGG GTGAGCGAAA ACCTGTACCG CCAGACATTT
GATCAAAGCG CTCATGACTA CCGTTTTGCG CAATGGCAAC TGGGGGGGCT TGCGGTCGTG
CTGGTGCTGA TTTTGATGGT GGTTTGGTTC GGCATTCTTC ATGCCTTGCT TAACCCATTA
GCGCGAGTGA TTACTCATAT CCGTGAAATT GCCAGCGGCG ATCTGACGAA AACGCTCACC
GTCTCAGGAC GTAATGAAAT TGGCGAACTG GCGGGAACGG TTGAACATAT GCAACGCTCG
CTGATTGACA CCGTGACGCA GGTTCGTGAA GGTTCGGATG CGATTTATTC CGGCACCAGT
GAAATTGCCG CCGGTAATAC CGACCTCTCT TCCCGTACCG AACAGCAGGC CTCCGCTCTG
GAGGAGACGG CGGCCAGCAT GGAACAACTG ACGGCCACCG TGAAGCAAAA CGCCGATAAT
GCCCGCCAGG CTTCGCAACT GGCGCAAAGC GCCTCCGAGA CCGCGCGTCA TGGCGGCAAA
GTGGTCGACG GCGTAGTAAA CACTATGCAC GAAATTGCCG ACAGTTCGAA AAAAATCGCT
GACATTATCA GCGTTATCGA CGGTATTGCC TTCCAGACTA ACATTCTGGC GCTGAACGCG
GCGGTAGAAG CAGCGCGCGC GGGAGAGCAG GGGCGCGGTT TTGCGGTCGT GGCAGGCGAG
GTGCGTAATC TGGCCAGCCG CAGTGCCCAG GCGGCGAAAG AAATAAAAGC GTTGATTGAA
GATTCCGTCT CGCGTGTCGA TACCGGTTCT GTGCTGGTGG AAAGCGCCGG GGAAACCATG
ACTGACATCG TCAATGCCGT TACGCGCGTC ACGGATATCA TGGGCGAAAT CGCCTCCGCC
TCGGATGAGC AAAGCCGGGG TATTGATCAG GTCGCTTTGG CCGTTTCCGA AATGGATCGC
GTAACGCAAC AGAACGCCTC GCTGGTTCAG GAATCCGCAG CGGCCGCCGC CGCGCTGGAA
GAGCAGGCCA GCCGTCTGAC CCAGGCGGTA TCGGCTTTCC GCCTGGCATC GCGACCGCTG
GCGGTAAATA AACCTGAGAT GCGTTTGTCA GTGGACGCTC AGTCCGGCAA TACGCCGCGG
CCATTAGCCG CCGGGGATGA TGCGAACTGG GAAACCTTCT GA
 
Protein sequence
MFNRIRVVTM LMMVLGVFAL LQLVSGGLLF SSLQHNQQGF VISNELRQQQ SELTSTWDLM 
LQTRINLSRS AARMMMDASN QQSSAKTDLL QNAKTTLAQA AAHYANFKNM TPLPAMAEAS
ANVDEKYQRY QAALAELIQF LDNGNMDAYF AQPTQGMQNA LGEALGNYAR VSENLYRQTF
DQSAHDYRFA QWQLGGLAVV LVLILMVVWF GILHALLNPL ARVITHIREI ASGDLTKTLT
VSGRNEIGEL AGTVEHMQRS LIDTVTQVRE GSDAIYSGTS EIAAGNTDLS SRTEQQASAL
EETAASMEQL TATVKQNADN ARQASQLAQS ASETARHGGK VVDGVVNTMH EIADSSKKIA
DIISVIDGIA FQTNILALNA AVEAARAGEQ GRGFAVVAGE VRNLASRSAQ AAKEIKALIE
DSVSRVDTGS VLVESAGETM TDIVNAVTRV TDIMGEIASA SDEQSRGIDQ VALAVSEMDR
VTQQNASLVQ ESAAAAAALE EQASRLTQAV SAFRLASRPL AVNKPEMRLS VDAQSGNTPR
PLAAGDDANW ETF