Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A5004 |
Symbol | |
ID | 6874402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4833287 |
End bp | 4834372 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642787871 |
Product | putative major fimbrial subunit |
Protein accession | YP_002218461 |
Protein GI | 198244198 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.170406 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 91 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAGGT TATATCTGGC GCTTATTCTG CTGTTCGCGT ATTCCGGTCA TGGTTACGCC TCCTGTAAAC GTTCCGGAAA TGAAGGCGCG ATCACTATCA CCCCGCCGTC GCAGCTTGTG GTGGATAGCC ACGCCTATAC CGCAGGCGAA GTGTTGTGGC AATCGGGCTG GGTTTCCACC TCCGAAGTCA CAATGGATGG CTGTAGTCGC GATTATAAGG TCGGTTTTTT ATATGAACCC GGTAGCGCGC AGTCAAATAC ATCAGCGACA ATCAATGCGA ATGACGGGAA CAACACACCG GTATTTAGCA CCGGCATTTC CGGCGTGGGC ATTGCGATTA AAACCCAAAC GAATGCTGGC CCTTACGATA ATGTCATGCC AATCGATAAT ACCTACCATA ATGGCGATGG CAATAAAACA CACCATGCGA TGGCTCCCGC CTACAATGTT GAACTGGTCG CCTTAGGCGG TCCGATCACC TCCGGTACCG CGACATTCCA AAGCCCACTG GCGCGCGTAT CTTTTCGCGA TAGCGCAACG GAAGACTCCG GCGGCGATAT CCTGACCCAT CTGTATTTGG GGAATACGCA ATTGATTATG AAAGCGATGG GATGTCGGGT AGAAACACCT GCCATCACCG TTGATTTAGG CAGCGTCAAT TTAGGCAGTT TCGCTAACAG TCAAACTGCG GGCACAGGCG AGCAGGATAT CTTATTGACC TGCGAACAAG GCACCGCTAT TTCCGCATCG TTAAGCGCCC AGCCTGCCAG CGGAAATAAC CCTGATAATT CAGTCATCCA GTTGAGCAAT GCGAGCGCGC CAACCAGCGC AACCGGCGTT GGCGTACAGT TGGGTATTCA GGCGCCGGAC GCCGGTTTCT TTACCGACAG TTTGCCAATC AATCAAAAAA TTGATCTCTT TACTCACACG ATTACCACCA ATGCCGATGG CAGCCAGACG GTTAGCGGCG GAACCATGAA TATGTCCACC ACCCTGAAAA TTAGCGCGCG CTACTATAAA ACCGCCGCCA CGGTAACGGC CGGGCAAGCG AATGCCACAG CAACATTAAA CCTGACCTAT AACTAA
|
Protein sequence | MRRLYLALIL LFAYSGHGYA SCKRSGNEGA ITITPPSQLV VDSHAYTAGE VLWQSGWVST SEVTMDGCSR DYKVGFLYEP GSAQSNTSAT INANDGNNTP VFSTGISGVG IAIKTQTNAG PYDNVMPIDN TYHNGDGNKT HHAMAPAYNV ELVALGGPIT SGTATFQSPL ARVSFRDSAT EDSGGDILTH LYLGNTQLIM KAMGCRVETP AITVDLGSVN LGSFANSQTA GTGEQDILLT CEQGTAISAS LSAQPASGNN PDNSVIQLSN ASAPTSATGV GVQLGIQAPD AGFFTDSLPI NQKIDLFTHT ITTNADGSQT VSGGTMNMST TLKISARYYK TAATVTAGQA NATATLNLTY N
|
| |