Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_pSN254_0155 |
Symbol | istA |
ID | 4929498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_009140 |
Strand | - |
Start bp | 136735 |
End bp | 138258 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642572453 |
Product | transposase IstA for insertion sequence IS1326 |
Protein accession | YP_001102028 |
Protein GI | 134047140 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.791462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATAAACG TGGCGATATT GAGCGCAATT CGACGCTGGC ATTTTCGCGA TGGTGCGTCG ATTCGGGAAA TAGCCCGACG AAGCGGCCTG TCCAGGAACA CCGTTCGCAA GTATTTGCAA AGCAAGGTGG TTGAACCGCA GTACCCAGCG CGAGACAGCG TTGGCAAGTT AAGTCCTTTT GAGCCCAAGT TAAGGCAGTG GCTCTCCACC GAGCACAAAA AGACAAAGAA GCTGCGCAGA AACCTGCGCA GCATGTACCG GGATTTGGTC GCTTTGGGCT TTACCGGGTC TTATGACCGA GTGTGTGCCT TTGCCCGACA GTGGAAAGAT TCCGAACAGT TCAAGGCGCA AACCTCGGGC AAGGGTTGTT TCATCCCCTT GCGCTTTGCT TGTGGCGAAG CCTTCCAATT CGATTGGAGT GAGGACTTTG CCCGCATAGC GGGCAAACAG GTCAAACTTC AGATTGCCCA GTTTAAGTTG GCCCACAGCC GGGCCTTTGT GCTTCGGGCT TACTACCAGC AAAAACATGA AATGCTGTTT GATGCCCACT GGCATGCCTT TCAAATCTTC GGTGGCATTC CCAAGCGCGG CATCTACGAC AACATGAAGA CCGCTGTGGA TTCGGTGGGG CGTGGCAAAG AGCGCAGGGT CAATCAGCGG TTCACTGCCA TGGTCAGCCA CTACCTGTTT GATGCGCAGT TCTGTAATCC AGCATCGGGT TGGGAGAAAG GCCAGATTGA GAAGAACGTG CAGGATTCCC GCCAACGCCT GTGGCAAGGG GCACCAGACT TTCAAAGCCT TGCTGATTTG AATGTGTGGC TTGAGCATCG CTGCAAAGCG CTGTGGTCTG AGCTGCGCCA CCCCGAATTG GACCAAACCG TGCAAGAGGC CTTTGCCGAT GAACAAGGCG AGTTGATGGC GCTACCCAAT GCCTTTGATG CATTCGTGGA GCAAACCAAG CGAGTCACTT CAACCTGCCT TGTTCACCAC GAGGGCAATC GCTACAGCGT TCCTGCCAGT TACGCCAACA GGGCCATCAG CCTTCGGATT TATGCAGACA AGCTGGTGAT GGCTGCCGAA GGCCAACACA TTGCCGAGCA TCCAAGATTG TTTGGCAGTG GCCACGCTCG GCGTGGCCAC ACACAATACG ACTGGCACCA TTACTTGTCT GTGCTTCAGA AGAAACCTGG GGCGTTGCGC AATGGTGCGC CATTTGCTGA ATTGCCACCC GCGTTCAAGA AGCTTCAATC CATCTTGCTG CAACGCCCCG GCGGTGACCG TGACATGGTG GAAATTCTGG CCCTTGTATT GCACCACGAT GAAGGTGCGG TACTCAGTGC TGTGGAATTG GCATTGGAGT GTGGCAAGCC ATCGAAGGAG CATGTGCTTA ATCTGTTGGG ACGTTTGACC GAAGAACCTC CACCCAAACC GATTCCAATT CCCAAGGGGT TAAGGCTGAC ATTGGAACCA CAGGCCAACG TGAACCGCTA TGACAGTTTA AGGAGAGCCC ATGATGCAGC ATGA
|
Protein sequence | MINVAILSAI RRWHFRDGAS IREIARRSGL SRNTVRKYLQ SKVVEPQYPA RDSVGKLSPF EPKLRQWLST EHKKTKKLRR NLRSMYRDLV ALGFTGSYDR VCAFARQWKD SEQFKAQTSG KGCFIPLRFA CGEAFQFDWS EDFARIAGKQ VKLQIAQFKL AHSRAFVLRA YYQQKHEMLF DAHWHAFQIF GGIPKRGIYD NMKTAVDSVG RGKERRVNQR FTAMVSHYLF DAQFCNPASG WEKGQIEKNV QDSRQRLWQG APDFQSLADL NVWLEHRCKA LWSELRHPEL DQTVQEAFAD EQGELMALPN AFDAFVEQTK RVTSTCLVHH EGNRYSVPAS YANRAISLRI YADKLVMAAE GQHIAEHPRL FGSGHARRGH TQYDWHHYLS VLQKKPGALR NGAPFAELPP AFKKLQSILL QRPGGDRDMV EILALVLHHD EGAVLSAVEL ALECGKPSKE HVLNLLGRLT EEPPPKPIPI PKGLRLTLEP QANVNRYDSL RRAHDAA
|
| |