Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2805 |
Symbol | |
ID | 6482390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2748210 |
End bp | 2749460 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642738129 |
Product | peyer'S patch-specific virulence factor GipA |
Protein accession | YP_002041863 |
Protein GI | 194442869 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.537832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTGCTC ATCAGAATGC CTCCTTTCCT CCCCGGCCTG AAGGCCGGGG AGGAAAGGAG GCGGTTTTCC GGCTAACTGT CTTTTGCATA ATCACATTTT CCTCTTTAAC ATGTGAAGCC ATGAAACGCG CATATAAATA CCGGTTTTAC CCCACGACTG AGCAGGCTGA GCTTTTAGCT CAGACGTTCG GTTGTGTGCG TTTCGTCTAC AACTCCATCC TCCGCTGGCG TACCGATGCG TACTACGAGC GAAAGGAAAA GATCGGTTAC CTACAGGCCA ACGCTCGCCT TACGGCGCTG AAAAAGGAGC CAGAATTTGC CTGGCTTAAC GACGTTTCCT GCGTTCCCCT CCAGCAGTCT TTGCGCCACC AACAAACCGC CTTTGCTAAC TTCTTCGCCG GACGGGCTGC ATATCCGGCT TTCAAAAGCA AACGGCACAA GCAGGCGGCT GAGTTCACTG CGAGCGCGTT TAAATACCGC GACGGCAAGC TGTACATGGC AAAGAACAAA ATCCCCTTAG ACGTGCGCTG GAGTCGTCCG CTGCCGTCCG TGCCGTCTAC CGTCACCATT TCCAAAGATG CCGCAGGGCG GTACTTTGTT TCGTGCCTTT GCGAATTTGA ACCCGCATCA CTGCCGATCA CCTCTTCAAT GGTCGGCATT GATGTTGGTT TAAAAGATTT GTTCGTCACC GATACCGGAT TCAGGTCCGG CAATCCCCGC CATACCGCTA AATACGCGGC TCGCCTGGCA CTACTCCAGC GCCGGTTAAG CAAAAAGGCC AAAGGCTCAA AGAACCGCGC CAAAGCCCAC TTAAAGGTAG CCCGACTCCA CGCGAAAATT GCTGATTGCC GACTGGATGC CCTGCACAAG GCCACCCGCA AACTGATTAA CGATAACCAA GTTGTATGCG TCGAATCCCT GAAAGTGAGG AACATGATCC GCAACCCGTC GCTATCCAAA GCAATAGCAG ACGCGAGCTG GGGCGAACTT GTGCGCCAGC TCCGGTACAA AGGCGAATGG GCGGGGCGGT CAGTGGTAGC CATTGACCAG TTTTTCCCGT CCTCAAAACG CTGTAGCTGT TGCGGTTTCA TCATGAAAAA AATGCCTCTT GATGTTCGTA AATGGCAGTG CCCTGAGTGC GGAACTGACC ACGACCGGGA CGTTAACGCG GCACGTAATA TCAAAGCTGC CGGGCTGGCA GTGTTAGCCC ACGGAGAGCC TGTAAATCCT GAATCGCTCA AAGCGGCTTA G
|
Protein sequence | MFAHQNASFP PRPEGRGGKE AVFRLTVFCI ITFSSLTCEA MKRAYKYRFY PTTEQAELLA QTFGCVRFVY NSILRWRTDA YYERKEKIGY LQANARLTAL KKEPEFAWLN DVSCVPLQQS LRHQQTAFAN FFAGRAAYPA FKSKRHKQAA EFTASAFKYR DGKLYMAKNK IPLDVRWSRP LPSVPSTVTI SKDAAGRYFV SCLCEFEPAS LPITSSMVGI DVGLKDLFVT DTGFRSGNPR HTAKYAARLA LLQRRLSKKA KGSKNRAKAH LKVARLHAKI ADCRLDALHK ATRKLINDNQ VVCVESLKVR NMIRNPSLSK AIADASWGEL VRQLRYKGEW AGRSVVAIDQ FFPSSKRCSC CGFIMKKMPL DVRKWQCPEC GTDHDRDVNA ARNIKAAGLA VLAHGEPVNP ESLKAA
|
| |