Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_pSN254_0156 |
Symbol | insG |
ID | 4929552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_009140 |
Strand | + |
Start bp | 138381 |
End bp | 139925 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642572454 |
Product | transposase InsG for insertion sequence IS1353 |
Protein accession | YP_001102029 |
Protein GI | 134047184 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.298801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTCGT ATGAAGATCG CCTTCGAGCC GTGAGGTTGT ACCTGAAGCT TGGGCGCCGG ATGAGCGCCA CACTACGGCA GCTGGGATAC CCCACCAAGA ACTCGCTGAA GGCCTGGTTG GCAGAATTCG AACGGAATCA GGATCTTCGC CGAGGCTATC AACGGATAAA ACGGCAGTAC ACCGATGAGC AAAAGCAACG GGCAGTAGAT CACTATATCG AACAAGGCTA CTGCCTGAGT CACACAATCC GAAGCCTGGG CTACCCAAGC CGCGAGGCCT TGCGTGCCTG GATCCGTGAT TTACGCCCTG AATTCGCTAG GACGGTCGTC GGCAGCAGCG CTCCCACAGT CGCCCGCTCT CGCCTCGAGA AGCAGCAAGC CGTCATTGCA CTGAACCTGC GCGTAGGTTC GGCAAAGGAT GTGGCCGACA CTGTCGGTGT ATCGCGACCA ACGTTGTATA ACTGGCAGCA TCGATTACTT GGCAAAGTGC CCCTAAAACC CATGACAAAG AAGAAAGGTG ACACCTCGCT CGAGCAGCGG CATGAGGCAC TACTCAGGGA ACTGGCCGAA CTGGAGAGCC AGAACCAGCG GCTTCGCATG GAGAATGCAA TTCTGGAGAA GGCGAGTGAA TTGATAAAAA AAGACATGGG CATCAACCCC CTCGAACTGA CAAGCCGAGA AAAAACGAAG GTGGTTGATG CCCTCAGAGT CACGTTTCCA TTAGCCAATC TGTTGTGCGG CCTGAAGCTG GCGCGCAGCA CATACTTCTA TCAACGCCTG CGGCAGACGC GGCCCGACAA GTACACGCAG GTGCGTGAGG TCATTCGGAC TATCTTCGAG GACAACTACC GCTGCTATGG CTATCGACGC ATTGATAGTG CCTTGCGCCT TGGTGGCATG CGTGTGTCCG AGAAGGTCGT GCGTCGCTTG ATGGCGCAAG AGCGTCTGGT CGTGAGAACA CCGCGCCGCC GGCGCTTCTC GGCGTATGCT GGCGACCCGA CACCAGCGGT CCCGAATCTG CTGAATCGCG ACTTTCACGC GTCGGCGCCG AATACGAAAT GGTTGACCGA TCTGACGGAA ATACACATTC CGGCAGGGAA GGTCTACGTC TCGCCGATCG TCGATTGCTT CGATGGGCTG GTGGTGGCCT GGAATATCGG CACCAGCCCG GATGCGAACC TGGTCAATAC CATGCTGGAT CACGCGGTAC GGACACTGCG ACCCGGTGAG CATCCGGTTA TCCATTCGGA CAGGGGCTCG CATTATCGCT GGCCTGCGTG GATCCGCCGC ACTGAAAATG CCCAATTAAC GCGGTCGATG TCCAAAAAGG GCTGCTCGCC AGACAATGCT GCATGCGAGG GCTTTTTCGG ACGATTGAAG ACCGAACTAA TCTACCCGAG GAATTGGCAG CACGTGACGC TGAAAGACCT CATGACGCGA ATCGATGCCT ATATCCACTG GTACAACGAG CGCCGCATCA AAGTGTCGCT TGGCGGGCGT AGTCCCATCG AGTATCGTCA TGCGGTCGGA TTGATGTCCG TATAA
|
Protein sequence | MYSYEDRLRA VRLYLKLGRR MSATLRQLGY PTKNSLKAWL AEFERNQDLR RGYQRIKRQY TDEQKQRAVD HYIEQGYCLS HTIRSLGYPS REALRAWIRD LRPEFARTVV GSSAPTVARS RLEKQQAVIA LNLRVGSAKD VADTVGVSRP TLYNWQHRLL GKVPLKPMTK KKGDTSLEQR HEALLRELAE LESQNQRLRM ENAILEKASE LIKKDMGINP LELTSREKTK VVDALRVTFP LANLLCGLKL ARSTYFYQRL RQTRPDKYTQ VREVIRTIFE DNYRCYGYRR IDSALRLGGM RVSEKVVRRL MAQERLVVRT PRRRRFSAYA GDPTPAVPNL LNRDFHASAP NTKWLTDLTE IHIPAGKVYV SPIVDCFDGL VVAWNIGTSP DANLVNTMLD HAVRTLRPGE HPVIHSDRGS HYRWPAWIRR TENAQLTRSM SKKGCSPDNA ACEGFFGRLK TELIYPRNWQ HVTLKDLMTR IDAYIHWYNE RRIKVSLGGR SPIEYRHAVG LMSV
|
| |