Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C5001 |
Symbol | |
ID | 6488465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 4879147 |
End bp | 4880232 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642745043 |
Product | putative major fimbrial subunit |
Protein accession | YP_002048612 |
Protein GI | 194448428 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAGGT TATATCTGGC GCTTATTCTG CTGTTCGCGT ATTCCGGTCA TGGTTACGCC TCCTGTAAAC GTTCCGGAAA TGAAGGCGCG ATCACTATCA CCCCGCCGTC GCAACTTGTG GTGGATAGCC ACGCCTATAC CGCAGGCGAA GTGTTGTGGC AATCGGGCTG GGTTTCCACC TCCGAAGTCA CAATGGATGG CTGTAGTCGC GATTATAAGG TCGGTTTTTT ATATGAACCC GGTAGCGCGC AGTCAAATGC ATCAGCGACA ATCAATGCGA ATGACGGGAA CAACACGCCG GTATTTAGCA CCGGGATTTC CGGCGTGGGC ATTGCGATTA AAACCCAAAC GAATGCTGGC CCTTACGATA ATGTCATGCC AATCGATAAT ACCTACCATA ATGGCGATGG CAATAAAACA CACCATGCGA TGGCTCCCGC CTACAATGTT GAACTGGTTG CCTTAGGCGG TCCGATCACC TCCGGTACCG CGACATTCCA AAGTCCACTG GCGCGCGTAT CTTTTCGCGA TAGCGCAACG GAAGACTCCG GCGGCGATGT TCTGACCCAT CTGTATTTAG GGAATACGCA ATTGATTATG AAAGCGATGG GATGTCGGGT AGAAACACCT GCCATCACCG TTGATTTAGG CAGCGTCAAT TTAGGCAGTT TCGCTAACAG TCAAACCGCG GGCACAGGCG AGCAGGATAT CTTATTGACC TGCGAACAAG GCACCGCTAT TTCCGCATCG TTAAGCGCCC AGCCTGCCAG CGGAAATAAC CCTGATAATT CAGTCATCCA GTTGAGCAAT GCAAGCGCGC CAACCAGCGC AACCGGCGTT GGCGTACAGT TGGGTATTCA GGCGCCGGAC GCCGGTTTCT TTACCGACAG TTTGCCAATC AATCAAAAAA TTGATCTCTT TACTCACACG ATTACCACCA ATGCCGATGG CAGCCAGACG GTTAACGGCG GAACCATGAA TATGTCGACC ACCCTGAAAA TTAGCGCGCG CTACTATAAA ACCGCCGCCA CGGTAACGGC CGGGCAAGCG AATGCCACGG CAACATTAAA CCTGACCTAT AACTAA
|
Protein sequence | MRRLYLALIL LFAYSGHGYA SCKRSGNEGA ITITPPSQLV VDSHAYTAGE VLWQSGWVST SEVTMDGCSR DYKVGFLYEP GSAQSNASAT INANDGNNTP VFSTGISGVG IAIKTQTNAG PYDNVMPIDN TYHNGDGNKT HHAMAPAYNV ELVALGGPIT SGTATFQSPL ARVSFRDSAT EDSGGDVLTH LYLGNTQLIM KAMGCRVETP AITVDLGSVN LGSFANSQTA GTGEQDILLT CEQGTAISAS LSAQPASGNN PDNSVIQLSN ASAPTSATGV GVQLGIQAPD AGFFTDSLPI NQKIDLFTHT ITTNADGSQT VNGGTMNMST TLKISARYYK TAATVTAGQA NATATLNLTY N
|
| |