Gene SeHA_C2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2173 
Symbol 
ID6491549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2095295 
End bp2096776 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content49% 
IMG OID642742368 
Productflagellin 
Protein accessionYP_002046008 
Protein GI194451133 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.569586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.00540488 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC AAACAGCCTG TCGCTGTTGA CCCAGAATAA CCTGAACAAA 
TCCCAGTCCG CTCTGGGTAC CGCTATCGAG CGTCTGTCTT CCGGTCTGCG TATCAACAGC
GCGAAAGACG ATGCGGCAGG TCAGGCGATT GCTAACCGTT TCACCGCGAA CATCAAAGGT
CTGACTCAGG CTTCCCGTAA CGCTAACGAC GGTATCTCCA TTGCGCAGAC CACTGAAGGC
GCGCTGAACG AAATCAACAA CAACCTGCAG CGTGTGCGTG AACTGGCGGT TCAGTCTGCT
AACAGCACCA ACTCCCAGTC TGACCTCGAC TCCATCCAGG CTGAAATCAC CCAGCGCCTG
AACGAAATCG ACCGTGTATC CGGCCAGACT CAGTTCAACG GCGTGAAAGT CCTGGCGCAG
GACAACACCC TGACCATCCA GGTTGGTGCC AACGACGGTG AAACCATCGA TATCGATCTG
AAGCAGATCA ACTCTCAGAC CCTGGGTCTG GATACGCTGA ATGTTCAACA AAAATATAAG
GTCAGCGATA CGGCTGCAAC TGTCACTGGC TATACAGATT CTGCTACTGC TATTGACAAA
TCTACGTTTG CTGCATCAGC AACTACCTTA GGTGGTACTC CTGCTATTAC TGGTGATCTG
AAGTTTGATG ATACTACTGG AAAATATTAC GCTGATGTTT CAGGTACTAC GGCTAAAGAT
GGTGTTTATG AAGTAACAGT TGCAGCCGAT GGAAAAGTCA CTTTAACTGG CACACCAACA
GGACCAATTA CTGCTGGCTT CCCTTCAACT GCAACAAAAG ATGTTAAACA AACTCAGCAA
GAAAACGCTG ATTTGACAGA GGCCAAAGCC GCATTGACAG CAGCGGGTGT TGCAGCGGCC
GGCACAGCAT CTGTTGTTAA GATGTCTTAT ACTGATAATA ACGGTAAAAC TATTGATGGT
GGTTTAGCAG TTAAGGTAGG CGATGATTAC TATTCTGCAA CTCAAAATAA AGATGGTTCC
ATAAGTATTA ATACTACGAA ATACACTGCA GATGACGGTA CATCCAAAAC TGCACTAAAC
AAACTGGGTG GCGCAGACGG CAAAACCGAA GTTGTTTCTA TTGGTGGTAA AACTTACGCT
GCAAGTAAAG CCGAAGGTCA CAACTTTAAA GCACAGCCTG ATCTGGCGGA AGCGGCTGCT
ACAACCACCG AAAACCCGCT GCAGAAAATT GATGCTGCTT TGGCACAGGT TGACACGTTA
CGTTCTGACC TGGGTGCGGT ACAGAACCGT TTCAACTCCG CTATTACCAA CCTGGGCAAC
ACCGTAAACA ACCTGACTTC TGCCCGTAGC CGTATCGAAG ATTCCGACTA CGCGACCGAA
GTTTCCAACA TGTCTCGCGC GCAGATTCTG CAGCAGGCCG GTACCTCCGT TCTGGCGCAG
GCGAACCAGG TTCCGCAAAA CGTCCTCTCT TTACTGCGTT AA
 
Protein sequence
MAQVINTNSL SLLTQNNLNK SQSALGTAIE RLSSGLRINS AKDDAAGQAI ANRFTANIKG 
LTQASRNAND GISIAQTTEG ALNEINNNLQ RVRELAVQSA NSTNSQSDLD SIQAEITQRL
NEIDRVSGQT QFNGVKVLAQ DNTLTIQVGA NDGETIDIDL KQINSQTLGL DTLNVQQKYK
VSDTAATVTG YTDSATAIDK STFAASATTL GGTPAITGDL KFDDTTGKYY ADVSGTTAKD
GVYEVTVAAD GKVTLTGTPT GPITAGFPST ATKDVKQTQQ ENADLTEAKA ALTAAGVAAA
GTASVVKMSY TDNNGKTIDG GLAVKVGDDY YSATQNKDGS ISINTTKYTA DDGTSKTALN
KLGGADGKTE VVSIGGKTYA ASKAEGHNFK AQPDLAEAAA TTTENPLQKI DAALAQVDTL
RSDLGAVQNR FNSAITNLGN TVNNLTSARS RIEDSDYATE VSNMSRAQIL QQAGTSVLAQ
ANQVPQNVLS LLR