Gene SeD_A0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0189 
Symbol 
ID6871331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp201644 
End bp202723 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content53% 
IMG OID642783436 
Productputative fimbrial protein precurosr 
Protein accessionYP_002214130 
Protein GI198243284 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.558575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATT TATGGATGCT GCTGGCGCTG TCGCTATTTT CAGGCCATGC GCTGGCAGAC 
GGAACGATGG GCAACGGAAG CGGCTGGTGT CAACCCACCA GCGGCACGCA TAATTTCTTT
TTTCCCCTTG ACCAGACCAT TACCGATACG GATGAGAACC AGGCAGGGAA AATAGTCAAA
GAGAGTTGGT CGGTCGGCGG CGAATACAGC GCCAGGTGCG ACTGCGATAA TAAAGATTAT
CAGGGCGTTA ACTATTTCAC CGCCACGACC GGCGATTTAA CACAAAAAGG AACGTACAGC
GAAGCGGGTA GCAATGGGCA ACAGATGGAT TTTTATGTTC TGGTCGCGGG TAAGCTGGAG
ATTGGTACGG AAACCTACAT CGTCGGTAAC CTGAAACAGT ATATCCCCGT TCCCTTTTCA
GCGATCAGTA ATCAGGCCCC CACCGCAGGC GGGTGTACGG GCGCGGACAT AAACAAAATG
TCCGCAGGGA ATAAGGGTAA CGTGCGTATT TATATTACTC ACCCACTGGT AGGTGAAATC
ACCATTCCTG AGACGACGAT TATGAATCTC TATTTGTCAA AAACGCCGGG CAGCAGCGGA
GATAATATTC CCCCTTCCGT TCCACCGATG GCGCACGTCA CCATGTCCGG GACCATTACC
GTGCCGCAGT CCTGCTCCAT CAACGCCGGG CAGGTTATCG AGGTCAGGCT ACCGGATATT
GAGGGCAAAG ATATTCGTCA CCTCGGCGAC AGTCCGCAGA ACTCGCACGT CACCACTCAG
GTAAACTTTA CCTGTAGTAA CGTGGCGGAC GGCACCAACC TGTCGATGTC ATTAAATGGC
GCAACCGATC CGCACAACCC GGACTACCTG AAAACTGACA ATGAGAATTT GGGGATACGG
ATTTCCGATA AATACGATAA TACCATCGTT CCCGGCGGCA GCGCCGAATT GCCGATTGAA
GATTACGCCG ACGGTAAAGG CAGCACCGAG TTCACCGCCG CGCCGGTCAA TACCACCGGA
CATGTTCCCC ACACCGGAGA ATACCAGGCT ACCGCCACGC TGGAGATTCA GATTCGCTGA
 
Protein sequence
MKNLWMLLAL SLFSGHALAD GTMGNGSGWC QPTSGTHNFF FPLDQTITDT DENQAGKIVK 
ESWSVGGEYS ARCDCDNKDY QGVNYFTATT GDLTQKGTYS EAGSNGQQMD FYVLVAGKLE
IGTETYIVGN LKQYIPVPFS AISNQAPTAG GCTGADINKM SAGNKGNVRI YITHPLVGEI
TIPETTIMNL YLSKTPGSSG DNIPPSVPPM AHVTMSGTIT VPQSCSINAG QVIEVRLPDI
EGKDIRHLGD SPQNSHVTTQ VNFTCSNVAD GTNLSMSLNG ATDPHNPDYL KTDNENLGIR
ISDKYDNTIV PGGSAELPIE DYADGKGSTE FTAAPVNTTG HVPHTGEYQA TATLEIQIR