Gene SNSL254_A0600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0600 
Symbol 
ID6483140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp613055 
End bp614062 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content52% 
IMG OID642736017 
Productmannose binding protein FimH 
Protein accessionYP_002039791 
Protein GI194442424 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.282698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.11786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAT ACTCAGCGCT ATTGCTGGCG GGGACCGCAC TCTTTTTCAC CCATCCCGCG 
CTGGCGACGG TTTGCCGTAA TTCAAACGGG ACGGCGACCG ATATCTTTTA TGACCTGTCA
GATGTTTTCA CCAGCGGCAA TAATCAGCCG GGACAGGTGG TGACGCTGCC GGAAAAATCA
GGTTGGGTCG GCGTAAACGC GACGTGCCCG GCGGGGACAA CGGTAAATTA TACCTACCGA
AGCTATGTAT CAGAATTACC GGTACGAAGC ACCGAAGGAA ATTTTAAATA CCTCAAGCTG
AATGACTACC TTCTGGGCGC GATGAGCATC ACCGATAGTG TCGCTGGCGT ATTTTATCCG
CCCCGTAACT ATATTCGCAT GGGCGTCGAC TCTAACGTGT CGCAGCAAAT GCCGTTTGGC
GTGCAGGACT CAAAGCTGGT TTTTAAATTA AAAGTGATAC GGCCTTTTAT TAATATGGTG
ACGATCCCTC GCCAGACAAT GTTTACTGTC TATGTGACGA CCTCTACCGG CGACGCGTTG
AGCACGCCGG TATATACCAT TAGCTACAGC GGCAAAGTGG AAGTGCCGCA AAACTGCGAA
GTGAATGCCG GACAGGTCGT GGAGTTTGAT TTCGGCGATA TCGGCGCGTC GTTATTTAGT
CAGGCGGGGG CGGGTAATCG TCCGCAAGGC GTCACGCCGC AAACGAAAAC TATCGCTATT
AAATGTACCA ACGTCGCGGC GCAAGCCTAT TTATCGATGC GGCTTGAAGC CGAAAAGGCC
TCAGGGCAGG CGATGGTGTC CGATAATCCG GATTTAGGCT TTGTGGTTGC TAATAGCAAC
GGTACGCCGC TCACACCCAA TAATTTGTCG AGTAAAATTC CGTTTCATCT TGATGATAAC
GCCGCCGCTC GCGTAGGTAT TCGCGCCTGG CCGATCAGCG TGACGGGGAA TAAACCGGTG
GAAGGGCCGT TTACTGCGCG CGGCTATCTA CGAGTCGATT ATGATTAA
 
Protein sequence
MKIYSALLLA GTALFFTHPA LATVCRNSNG TATDIFYDLS DVFTSGNNQP GQVVTLPEKS 
GWVGVNATCP AGTTVNYTYR SYVSELPVRS TEGNFKYLKL NDYLLGAMSI TDSVAGVFYP
PRNYIRMGVD SNVSQQMPFG VQDSKLVFKL KVIRPFINMV TIPRQTMFTV YVTTSTGDAL
STPVYTISYS GKVEVPQNCE VNAGQVVEFD FGDIGASLFS QAGAGNRPQG VTPQTKTIAI
KCTNVAAQAY LSMRLEAEKA SGQAMVSDNP DLGFVVANSN GTPLTPNNLS SKIPFHLDDN
AAARVGIRAW PISVTGNKPV EGPFTARGYL RVDYD