Gene SNSL254_A0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0026 
Symbol 
ID6486157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp28418 
End bp29425 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content50% 
IMG OID642735471 
Productmannose binding protein FimH 
Protein accessionYP_002039253 
Protein GI194446333 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.119008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC CTCTTTTATT TGCTCTGCTG GCGGGAAGTG TCGTATCGCA GTACGCCTTC 
GCAGACGTGT GTAAAAACGT TAACGGTGTA CCTTCCAGTA TTAATTACGA TTTAACGACC
ACTCTGACGG CAGAACAAAA CCAGGTGGGA AAGACGGTTC AACTGGAAAA AAGCCAGGAA
GTTAATGTAC AGGCGGTGTG TCCCGCCGGC GCGTCGACCT ATAGCCAGAC TTATCGCTCC
TATGTGTCGC CATATCCGGT CGTAGAAACG AGCGGTAACT GGAAATATTT AAAGCTGGAC
CCGGACTACC TTGAAGGCGG AATGCGAATT GAGGATTCTT CGGCGGGCGA TATCTATCCG
CCAATGAACA ATGTCCTGAT GGGATATGAT GAAAATGTGA AAGCGGGTCA ACCGTTTTAC
GTTCGTGACT CAAATCTGGA GTTTCAGCTC AAAATTGTTA AACCGTTCGT CGGCACGGTG
AATATTAGTC CTAAGACTAT GTTCAATGTT TATGTCATGA CCGCCGCAGG CGATCCGCTG
ACAGATGTCG TGTACAGCAT TCTTTATAGT GGAACGGTGA CCGTTCCGCA AAGCTGCGAA
ATCAACGCCG GACAAACGAT TCTGGTGAAT TTCGGCGCAT TATACAGCGG CAATTTCAAC
CATGCAGGCC AAAAGCCGGA GGGGGTACGA GCGAAAAAAT TCAGCGTACC GGTAAAGTGC
AGCGGTCTGG ATTCGCAGGT CAATTTAACA ATGCGTCTTA TCGCTACGCC GGATAGCCAC
GTTCCCCAGG CTATCGCTTC GGATAATGCC GATGTCGGCG TAGTGGTCGA AACCGATGAA
GGAAACGCGC TTATCCCCAA TGATGTACAG AGCGTCGCGC CTTTTATCAC CGATAGCGCC
GGACGCGCTA ACATCACATT GCAAGCCTAC CCGGTGAGTA CAACAGGCGA AACGCCAGCG
GAAGGGGCGT TTACCGCGCT GGCCAGTCTG CGAGTGGACT TTGACTAA
 
Protein sequence
MKIPLLFALL AGSVVSQYAF ADVCKNVNGV PSSINYDLTT TLTAEQNQVG KTVQLEKSQE 
VNVQAVCPAG ASTYSQTYRS YVSPYPVVET SGNWKYLKLD PDYLEGGMRI EDSSAGDIYP
PMNNVLMGYD ENVKAGQPFY VRDSNLEFQL KIVKPFVGTV NISPKTMFNV YVMTAAGDPL
TDVVYSILYS GTVTVPQSCE INAGQTILVN FGALYSGNFN HAGQKPEGVR AKKFSVPVKC
SGLDSQVNLT MRLIATPDSH VPQAIASDNA DVGVVVETDE GNALIPNDVQ SVAPFITDSA
GRANITLQAY PVSTTGETPA EGAFTALASL RVDFD