Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3846 |
Symbol | |
ID | 6482823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3719397 |
End bp | 3721040 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739111 |
Product | methyl-accepting chemotaxis protein I |
Protein accession | YP_002042822 |
Protein GI | 194444489 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATA TCAAAGTCAT CACCGGCGTT ATCGCGACGC TGGGCATATT TAGCGCCTTA TTGTTGGTGA CAGGAATACT GTTTTATTCC GCCGTCAGCA GCGATCGGCT GAATTTCCAG AATGCGAGCG CGCTGAGTTA CCAACAACAG GAACTGGGCG GCAGTTTTCA GACATTGATC GAAACCCGCG TCACCATTAA CCGCGTGGCG ATACGCATGT TAAAAAATCA GCGCGATCCC GCCTCACTGG ACGCCATGAA CACGCTGTTA ACCAACGCTG GCGCGTCGCT CAACGAAGCG GAAAAGCATT TCAACAACTA CGTGAACTCC GAAGCGATCG CGGGCAAAGA TCCGGCGTTG GATGCCCAGG CCGAAGCCAG CTTTAAGCAG ATGTATGACG TTTTGCAGCA GTCTATCCAC TATCTTAAAG CCGATAATTA CGCCGCCTAT GGCAACCTTG ACGCGCAAAA AGCGCAGGAT GACATGGAGC AGGTATATGA CCAGTGGCTC TCTCAAAATG CGCAATTAAT AAAATTAGCC AGCGATCAGA ATCAGAGCAG TTTTACCCAG ATGCAATGGA CGCTGGGGAT AATTCTACTT ATCGTGCTCA TCGTGCTGGC GTTTATCTGG CTGGGGCTGC AACGCGTTCT ACTCCGCCCG CTGCAACGGA TTATGGCGCA CATTCAAACG ATCGCCGACG GCGATCTTAC CCATGAGATA GAGGCCGAAG GACGCAGTGA AATGGGCCAA CTGGCCGCCG GTCTTAAAAC GATGCAGCAG TCGTTAATCC GTACCGTCAG CGCGGTGCGC GATAACGCAG ACTCTATCTA TACTGGCGCA GGCGAAATTT CCGCCGGCAG CAGCGATCTC TCTTCCCGTA CCGAACAGCA GGCCTCGGCG CTGGAGGAGA CCGCCGCCAG CATGGAACAG TTAACCGCCA CGGTACGGCA AAACACTGAT AACGCACGAC AGGCGACGGG TCTGGCGAAA ACCGCATCAG AAACCGCGCG TAAAGGAGGA CGCGTGGTGA ATAACGTAGT GAGCACCATG AACGATATCG CCGAAAGCTC GGAAAAAATC GTGGACATCA CCAGCGTGAT TGACGGTATC GCCTTCCAGA CTAATATCCT GGCGCTGAAC GCCGCGGTAG AAGCCGCCCG CGCCGGCGAA CAGGGGCGAG GATTCGCGGT CGTGGCCGGA GAGGTACGCA CGTTGGCCAT CCGTAGCGCG CAGGCCGCCA AAGAGATCAA AGTACTGATT GAAAACTCCG TGTCGCGCAT TGATACCGGC TCTACGCAGG TACGCGAAGC GGGAGAAACC ATGAAAGAGA TCGTTAACGC CGTGACCCGC GTGACCGATA TTATGGGCGA AATCGCCTCT GCCTCCGATG AGCAAAGCAA AGGCATTGAG CAGGTGGCGC AGGCGGTATC GGAAATGGAC AGCGTGACGC AGCAAAACGC CTCGCTGGTA GAGGAATCCG CAGCAGCAGC GGCGGCGCTG GAAGATCAGG CTAACGAACT TCGTCAGGCG GTCGCCGCGT TCCGCATCCA GAAACAACCT CGTCGGGAGG CGTCGCCGAC GCCGTTAAGC AAAGGTTTAA CGCCGCAGCC CGCCGCAGAA CAGGCGAACT GGGAAAGCTT CTAA
|
Protein sequence | MKNIKVITGV IATLGIFSAL LLVTGILFYS AVSSDRLNFQ NASALSYQQQ ELGGSFQTLI ETRVTINRVA IRMLKNQRDP ASLDAMNTLL TNAGASLNEA EKHFNNYVNS EAIAGKDPAL DAQAEASFKQ MYDVLQQSIH YLKADNYAAY GNLDAQKAQD DMEQVYDQWL SQNAQLIKLA SDQNQSSFTQ MQWTLGIILL IVLIVLAFIW LGLQRVLLRP LQRIMAHIQT IADGDLTHEI EAEGRSEMGQ LAAGLKTMQQ SLIRTVSAVR DNADSIYTGA GEISAGSSDL SSRTEQQASA LEETAASMEQ LTATVRQNTD NARQATGLAK TASETARKGG RVVNNVVSTM NDIAESSEKI VDITSVIDGI AFQTNILALN AAVEAARAGE QGRGFAVVAG EVRTLAIRSA QAAKEIKVLI ENSVSRIDTG STQVREAGET MKEIVNAVTR VTDIMGEIAS ASDEQSKGIE QVAQAVSEMD SVTQQNASLV EESAAAAAAL EDQANELRQA VAAFRIQKQP RREASPTPLS KGLTPQPAAE QANWESF
|
| |