Gene SeHA_C2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2947 
Symbol 
ID6491716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2887736 
End bp2889256 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content51% 
IMG OID642743105 
Productflagellin 
Protein accessionYP_002046729 
Protein GI194448978 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.46563e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACAAG TAATCAACAC TAACAGTCTG TCGCTGCTGA CCCAGAATAA CCTGAACAAA 
TCCCAGTCCG CACTGGGTAC CGCTATCGAG CGTCTGTCTT CTGGTCTGCG TATCAACAGC
GCGAAAGACG ATGCGGCAGG TCAGGCGATT GCTAACCGTT TCACCGCGAA CATCAAAGGT
CTGACTCAGG CTTCCCGTAA CGCTAACGAC GGTATCTCCA TTGCGCAGAC CACTGAAGGC
GCGCTGAACG AAATCAACAA CAACCTGCAG CGTGTGCGTG AACTGGCGGT TCAGTCTGCT
AACAGCACCA ACTCCCAGTC TGACCTCGAC TCCATCCAGG CTGAAATCAC CCAGCGCCTG
AACGAAATCG ACCGTGTATC CGGCCAGACT CAGTTCAACG GCGTGAAAGT CCTGGCGCAG
GACAACACCC TGACCATCCA GGTTGGCGCC AACGACGGTG AAACTATCGA TATCGATCTG
AAGCAGATCA ACTCTCAGAC CCTGGGTCTG GACTCACTGA ACGTGCAGAA AGCGTATGAT
GTGAAAGATA CAGCAGTAAC AACGAAAGCT TATGCCAATA ATGGTACTAC ACTGGATGTA
TCGGGTCTTG ATGATGCAGC TATTAAAGCG GCTACGGGTG GTACGAATGG TACGGCTTCT
GTAACCGGTG GTGCGGTTAA ATTTGACGCA GATAATAACA AGTACTTTGT TACTATTGGT
GGCTTTACTG GTGCTGATGC CGCCAAAAAT GGCGATTATG AAGTTAACGT TGCTACTGAC
GGTACAGTAA CCCTTGCGGC TGGCGCAACT AAAACCACAA TGCCTGCTGG TGCGACAACT
AAAACAGAAG TACAGGAGTT AAAAGATACA CCGGCAGTTG TTTCAGCAGA TGCTAAAAAT
GCCTTAATTG CTGGCGGCGT TGACGCTACC GATGCTAATG GCGCTGAGTT GGTCAAAATG
TCTTATACCG ATAAAAATGG TAAGACAATT GAAGGCGGTT ATGCGCTTAA AGCTGGCGAT
AAGTATTACG CCGCAGATTA CGATGAAGCG ACAGGAGCAA TTAAAGCTAA AACCACAAGT
TATACTGCTG CTGACGGCAC TACCAAAACA GCGGCTAACC AACTGGGTGG CGTAGACGGT
AAAACCGAAG TCGTTACTAT CGACGGTAAA ACCTACAATG CCAGCAAAGC CGCTGGTCAT
GATTTCAAAG CACAACCAGA GCTGGCGGAA GCAGCCGCTA AAACCACCGA AAACCCGCTG
CAGAAAATTG ATGCCGCGCT GGCGCAGGTG GATGCGCTGC GCTCTGATCT GGGTGCGGTA
CAAAACCGTT TCAACTCTGC TATCACCAAC CTGGGCAATA CCGTAAACAA TCTGTCTGAA
GCGCGTAGCC GTATCGAAGA TTCCGACTAC GCGACCGAAG TTTCCAACAT GTCTCGCGCG
CAGATTCTGC AGCAGGCCGG TACTTCCGTT CTGGCGCAGG CTAACCAGGT CCCGCAGAAC
GTGCTGTCTC TGTTACGTTA A
 
Protein sequence
MAQVINTNSL SLLTQNNLNK SQSALGTAIE RLSSGLRINS AKDDAAGQAI ANRFTANIKG 
LTQASRNAND GISIAQTTEG ALNEINNNLQ RVRELAVQSA NSTNSQSDLD SIQAEITQRL
NEIDRVSGQT QFNGVKVLAQ DNTLTIQVGA NDGETIDIDL KQINSQTLGL DSLNVQKAYD
VKDTAVTTKA YANNGTTLDV SGLDDAAIKA ATGGTNGTAS VTGGAVKFDA DNNKYFVTIG
GFTGADAAKN GDYEVNVATD GTVTLAAGAT KTTMPAGATT KTEVQELKDT PAVVSADAKN
ALIAGGVDAT DANGAELVKM SYTDKNGKTI EGGYALKAGD KYYAADYDEA TGAIKAKTTS
YTAADGTTKT AANQLGGVDG KTEVVTIDGK TYNASKAAGH DFKAQPELAE AAAKTTENPL
QKIDAALAQV DALRSDLGAV QNRFNSAITN LGNTVNNLSE ARSRIEDSDY ATEVSNMSRA
QILQQAGTSV LAQANQVPQN VLSLLR