Gene SNSL254_A2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2120 
Symbol 
ID6484908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2051897 
End bp2053405 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content50% 
IMG OID642737475 
Productflagellin 
Protein accessionYP_002041222 
Protein GI194444631 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000196793 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC AAACAGCCTG TCGCTGTTGA CCCAGAATAA CCTGAACAAA 
TCCCAGTCTG CTCTGGGTAC CGCTATCGAG CGTCTGTCTT CCGGTCTGCG TATCAACAGC
GCGAAAGACG ATGCGGCAGG TCAGGCAATT GCTAACCGTT TCACCGCGAA CATCAAAGGT
CTGACTCAGG CTTCCCGTAA CGCTAACGAC GGTATCTCCA TTGCGCAGAC CACTGAAGGC
GCGCTGAACG AAATCAACAA CAACCTGCAG CGTGTGCGTG AACTGGCGGT TCAGTCTGCT
AACAGCACCA ACTCCCAGTC TGACCTTGAC TCCATCCAGG CTGAAATCAC CCAGCGCCTG
AACGAAATCG ACCGTGTATC CGGCCAGACT CAGTTCAACG GCGTGAAAGT CCTGGCGCAG
GACAACACCC TGACCATCCA GGTTGGTGCC AACGACGGTG AAACCATCGA TATAGATCTG
AAGCAGATCA ACTCTCAGAC CCTGGGTCTG GATACGCTGA ATGTGCAGAA AGCGTATGAT
GTATCAGCAA CTGCTGCAAT GGATCCGAAA TCATTTACTG ACGGTACTAA AAATCTTACA
GCGCCTGATG CTACTGCTAT CAAAGCCGCG TTGGGAAATC CCGCGGCAAC AGGCGATTCC
TTGTCTGCTA CGCTTTCGTT TAAAGATGGT AAGTATTACG CCACTGTTGC AGGGTATACG
AATGCTGCCG ATACCAGTAA GAATGGTAAA TATGAAGTGA ATGTTGATAG TGCGACAGGT
GCGGTAACTT TCAATGCAGC ACCAACTAAA GCCACAGTAA CTGGGGATAC AACAGTAACC
AAAGTACAGG TTAATGCTCC TGTTGCAGTC AGTACTGATG TTAAAAAAGC GCTAGAAGAT
GGTGGCGTTT CAAATGCGGA CGCTACCGCA GCTAAATTAG TAAAAATGTC TTATACCGAT
AAAAATGGAA AATCTATTGA CGGTGGTTAT GCGCTTGAAG CCGGTGGCAA GTACTATGCT
GCAACCTATG ACGAAGGTAC AGGTAAAATC ACAGCTAATG TAACCACTTA TACTGATTCC
ACGGGAGCCA CAAAAACTGC GGCTAACCAA CTTGGTGGCG TAGACGGTAA AACCGAAGTT
GTTACTATCG ACGGTAAAAC CTACAATGCT AGTAAAGCCG CTGGTCACGA TTTCAAAGCG
CAGCCAGAGC TGGCTGAAGC AGCCGCTAAA ACCACCGAAA ACCCGCTGGC TAAAATTGAT
GCCGCGCTGG CGCAGGTTGA TGCGTTGCGT TCTGACCTGG GTGCGGTACA GAACCGTTTC
AACTCCGCTA TCACCAACCT GGGCAACACC GTAAACAACT TGTCTGAAGC GCGTAGCCGT
ATCGAAGATT CCGACTATGC GACCGAAGTC TCCAACATGT CTCGCGCGCA GATCCTGCAG
CAGGCCGGTA CCTCCGTTCT GGCGCAGGCG AACCAGGTTC CGCAAAACGT CCTCTCTTTA
CTGCGTTAA
 
Protein sequence
MAQVINTNSL SLLTQNNLNK SQSALGTAIE RLSSGLRINS AKDDAAGQAI ANRFTANIKG 
LTQASRNAND GISIAQTTEG ALNEINNNLQ RVRELAVQSA NSTNSQSDLD SIQAEITQRL
NEIDRVSGQT QFNGVKVLAQ DNTLTIQVGA NDGETIDIDL KQINSQTLGL DTLNVQKAYD
VSATAAMDPK SFTDGTKNLT APDATAIKAA LGNPAATGDS LSATLSFKDG KYYATVAGYT
NAADTSKNGK YEVNVDSATG AVTFNAAPTK ATVTGDTTVT KVQVNAPVAV STDVKKALED
GGVSNADATA AKLVKMSYTD KNGKSIDGGY ALEAGGKYYA ATYDEGTGKI TANVTTYTDS
TGATKTAANQ LGGVDGKTEV VTIDGKTYNA SKAAGHDFKA QPELAEAAAK TTENPLAKID
AALAQVDALR SDLGAVQNRF NSAITNLGNT VNNLSEARSR IEDSDYATEV SNMSRAQILQ
QAGTSVLAQA NQVPQNVLSL LR