Gene SeSA_A2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2114 
Symbol 
ID6517488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2024797 
End bp2026317 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content47% 
IMG OID642747192 
Productflagellin 
Protein accessionYP_002114990 
Protein GI194737030 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.168497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC AAACAGCCTG TCGCTGTTGA CCCAGAATAA CCTGAACAAA 
TCCCAGTCCG CTCTGGGCAC CGCTATCGAG CGTCTGTCTT CCGGTCTGCG TATCAACAGC
GCGAAAGACG ATGCGGCAGG TCAGGCGATT GCTAACCGTT TTACCGCGAA CATCAAAGGT
CTGACTCAGG CTTCCCGTAA CGCTAACGAC GGTATTTCTA TTGCGCAGAC CACTGAAGGA
GCGCTGAACG AAATCAACAA CAACCTGCAG CGTGTGCGTG AACTGGCGGT TCAGTCTGCT
AACGGTACTA ACTCCCAGTC TGACCTTGAC TCTATCCAGG CTGAAATCAC CCAGCGTCTG
AACGAAATCG ACCGTGTATC CGGTCAGACT CAGTTCAACG GCGTGAAAGT CCTGGCGCAG
GACAACACCC TGACCATCCA GGTTGGTGCC AACGACGGTG AAACTATTGA TATTGATTTA
AAAGAAATTA GCTCTAAAAC ACTGGGACTT GATAAGCTTA ATGTCCAAGA TGCCTACACC
CCGAAAGAAA CTGCTGTAAC CGTTGATAAA ACTACCTATA AAAATGGTAC AGATACTGTT
ACAGCCCAGA GCAATACTGA TATCGAAACT GCAATTGGCG GTGGTGCAAC GGGGGTTACT
GGGGCTGATA TCAAATTTAA AGATGGTCAA TACTATTTAG ATGTTAAAGG CGGTGCTTCT
GCTGGTGTTT ATAAAGCCAC TTATGATGAA ACTACAAAGA AAGTTAATAT TGATACGACT
GATAAAACTC CGTTAGCAAC TGCGGAAGCT ACAGCTATTC GGGGAACGGC CACTATAACC
CACAACCAAA TTGCTGAAGT AACAAAAGAG GGTGTTGATA CGACCACAGT TGCGGCTCAA
CTTGCTGCTG CAGGGGTTAC TGGTGCCGAT AAGGACAATA CTAGCCTTGT AAAACTATCG
TTTGAGGATA AAAACGGTAA GGTTATTGAT GGTGGCTATG CAGTGAAAAT GGGCGACGAT
TTCTATGCCG CTACATATGA TGAGAAAACA GGTACAATTA CTGCTAAAAC AACCACTTAT
ACAGATGGTG CTGGCGTTGC TCAAACTGGA GCTGTGAAAT TTGGTGGCGC AAATGGTAAA
TCTGAAGTTG TTACTGCTAC CGATGGTAAA ACTTACTTAG CAAGCGACCT TGACAAACAT
AACTTCAGAA CAGGCGGTGA GCTTAAAGAG GTTAATACAG ATAAGACTGA AAACCCACTG
CAGAAAATTG ATGCTGCCTT GGCACAGGTT GATACACTTC GTTCTGACCT GGGTGCGGTA
CAGAACCGTT TCAACTCCGC TATCACCAAC CTGGGCAATA CCGTAAATAA CCTGTCTTCT
GCCCGTAGCC GTATCGAAGA TTCCGACTAC GCGACCGAAG TCTCCAACAT GTCTCGCGCG
CAGATTCTGC AGCAGGCTGG TACTTCCGTT CTGGCGCAGG CGAACCAGGT TCCGCAAAAC
GTCCTCTCTT TACTGCGTTA A
 
Protein sequence
MAQVINTNSL SLLTQNNLNK SQSALGTAIE RLSSGLRINS AKDDAAGQAI ANRFTANIKG 
LTQASRNAND GISIAQTTEG ALNEINNNLQ RVRELAVQSA NGTNSQSDLD SIQAEITQRL
NEIDRVSGQT QFNGVKVLAQ DNTLTIQVGA NDGETIDIDL KEISSKTLGL DKLNVQDAYT
PKETAVTVDK TTYKNGTDTV TAQSNTDIET AIGGGATGVT GADIKFKDGQ YYLDVKGGAS
AGVYKATYDE TTKKVNIDTT DKTPLATAEA TAIRGTATIT HNQIAEVTKE GVDTTTVAAQ
LAAAGVTGAD KDNTSLVKLS FEDKNGKVID GGYAVKMGDD FYAATYDEKT GTITAKTTTY
TDGAGVAQTG AVKFGGANGK SEVVTATDGK TYLASDLDKH NFRTGGELKE VNTDKTENPL
QKIDAALAQV DTLRSDLGAV QNRFNSAITN LGNTVNNLSS ARSRIEDSDY ATEVSNMSRA
QILQQAGTSV LAQANQVPQN VLSLLR