Gene SeD_A5004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A5004 
Symbol 
ID6874402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4833287 
End bp4834372 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content52% 
IMG OID642787871 
Productputative major fimbrial subunit 
Protein accessionYP_002218461 
Protein GI198244198 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones91 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGGT TATATCTGGC GCTTATTCTG CTGTTCGCGT ATTCCGGTCA TGGTTACGCC 
TCCTGTAAAC GTTCCGGAAA TGAAGGCGCG ATCACTATCA CCCCGCCGTC GCAGCTTGTG
GTGGATAGCC ACGCCTATAC CGCAGGCGAA GTGTTGTGGC AATCGGGCTG GGTTTCCACC
TCCGAAGTCA CAATGGATGG CTGTAGTCGC GATTATAAGG TCGGTTTTTT ATATGAACCC
GGTAGCGCGC AGTCAAATAC ATCAGCGACA ATCAATGCGA ATGACGGGAA CAACACACCG
GTATTTAGCA CCGGCATTTC CGGCGTGGGC ATTGCGATTA AAACCCAAAC GAATGCTGGC
CCTTACGATA ATGTCATGCC AATCGATAAT ACCTACCATA ATGGCGATGG CAATAAAACA
CACCATGCGA TGGCTCCCGC CTACAATGTT GAACTGGTCG CCTTAGGCGG TCCGATCACC
TCCGGTACCG CGACATTCCA AAGCCCACTG GCGCGCGTAT CTTTTCGCGA TAGCGCAACG
GAAGACTCCG GCGGCGATAT CCTGACCCAT CTGTATTTGG GGAATACGCA ATTGATTATG
AAAGCGATGG GATGTCGGGT AGAAACACCT GCCATCACCG TTGATTTAGG CAGCGTCAAT
TTAGGCAGTT TCGCTAACAG TCAAACTGCG GGCACAGGCG AGCAGGATAT CTTATTGACC
TGCGAACAAG GCACCGCTAT TTCCGCATCG TTAAGCGCCC AGCCTGCCAG CGGAAATAAC
CCTGATAATT CAGTCATCCA GTTGAGCAAT GCGAGCGCGC CAACCAGCGC AACCGGCGTT
GGCGTACAGT TGGGTATTCA GGCGCCGGAC GCCGGTTTCT TTACCGACAG TTTGCCAATC
AATCAAAAAA TTGATCTCTT TACTCACACG ATTACCACCA ATGCCGATGG CAGCCAGACG
GTTAGCGGCG GAACCATGAA TATGTCCACC ACCCTGAAAA TTAGCGCGCG CTACTATAAA
ACCGCCGCCA CGGTAACGGC CGGGCAAGCG AATGCCACAG CAACATTAAA CCTGACCTAT
AACTAA
 
Protein sequence
MRRLYLALIL LFAYSGHGYA SCKRSGNEGA ITITPPSQLV VDSHAYTAGE VLWQSGWVST 
SEVTMDGCSR DYKVGFLYEP GSAQSNTSAT INANDGNNTP VFSTGISGVG IAIKTQTNAG
PYDNVMPIDN TYHNGDGNKT HHAMAPAYNV ELVALGGPIT SGTATFQSPL ARVSFRDSAT
EDSGGDILTH LYLGNTQLIM KAMGCRVETP AITVDLGSVN LGSFANSQTA GTGEQDILLT
CEQGTAISAS LSAQPASGNN PDNSVIQLSN ASAPTSATGV GVQLGIQAPD AGFFTDSLPI
NQKIDLFTHT ITTNADGSQT VSGGTMNMST TLKISARYYK TAATVTAGQA NATATLNLTY
N