Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3201 |
Symbol | epaO |
ID | 6874446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3078732 |
End bp | 3079643 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642786219 |
Product | surface presentation of antigens protein SpaO |
Protein accession | YP_002216860 |
Protein GI | 198245837 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1886] Flagellar motor switch/type III secretory pathway protein |
TIGRFAM ID | [TIGR02551] type III secretion system apparatus protein YscQ/HrcQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTGC GTGTGAGACA GATTGATCGT CGCGAATGGC TATTGGCGCA AACCGCGACA GAATGCCAGC GCCATGGCCA GGAAGCTACG CTGGAATATC CGACGCGACA AGGAATGTGG GTTCGGTTGA GCGATGCAGA AAAACGGTGG TCGGCGTGGA TTCAACCTGG GGACTGGCTT GAGCATGTCT CTCCCGCTCT GGCGGGGGCG GCGGTTTCTG CTGGCGCTGA GCACCTGGTC GTTCCCTGGC TTGCTGCGAC AGAGCGACCG TTTGAGTTGC CCGTGCCGCA TTTGTCCTGT CGGCGTTTAT GCGTAGAGAA CCCCGTGCCG GGAAGCGCGC TGCCGGAAGG GAAATTGTTG CACATTATGA GCGATCGGGG CGGCCTGTGG TTTGAATATC TTCCTGAACT GCCTGCAGTC GGGGGCGGCA GGCCGAAAAT GCTGCGTTGG CCGTTGCGCT TTGTAATCGG TAGCAGTGAT ACGCAGCGTT CGTTGCTGGG CCGAATCGGG ATCGGAGATG TACTCCTGAT TCGTACTTCC CGTGCGGAAG TTTATTGCTA CGCGAAAAAG TTAGGTCATT TCAACCGTGT TGAAGGGGGA ATTATTGTGG AAACGTTAGA TATTCAACAT ATCGAAGAAG AAAATAATAC AACTGAAACT GCAGAAACTC TGCCTGGCTT GAATCAATTG CCCGTCAAAC TGGAATTTGT TTTGTATCGT AAGAACGTTA CCCTCGCCGA ACTCGAAGCC ATGGGGCAGC AACAGCTATT ATCACTGCCG ACCAATGCTG AACTTAACGT TGAAATTATG GCGAATGGTG TTTTGCTGGG TAATGGCGAA CTGGTACAGA TGAATGACAC CTTAGGCGTT GAGATCCATG AATGGCTGAG CGAGTCTGGT AATGGGGAAT GA
|
Protein sequence | MSLRVRQIDR REWLLAQTAT ECQRHGQEAT LEYPTRQGMW VRLSDAEKRW SAWIQPGDWL EHVSPALAGA AVSAGAEHLV VPWLAATERP FELPVPHLSC RRLCVENPVP GSALPEGKLL HIMSDRGGLW FEYLPELPAV GGGRPKMLRW PLRFVIGSSD TQRSLLGRIG IGDVLLIRTS RAEVYCYAKK LGHFNRVEGG IIVETLDIQH IEEENNTTET AETLPGLNQL PVKLEFVLYR KNVTLAELEA MGQQQLLSLP TNAELNVEIM ANGVLLGNGE LVQMNDTLGV EIHEWLSESG NGE
|
| |