Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_pSN254_0141 |
Symbol | tnpA |
ID | 4929451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_009140 |
Strand | - |
Start bp | 121107 |
End bp | 124079 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642572440 |
Product | transposon Tn21 transposase TnpA |
Protein accession | YP_001102015 |
Protein GI | 134047160 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGCC GCTCAATCCT GTCCGCCACC GAGCGCGAAA GCCTGCTGGC ACTGCCAGAT GCCAAAGACG AACTGATACG GCACTACACG TTCAACGAAA CCGACCTGTC GGTGATCCGT CAGCGTCGCG GCGCCGCGAA TCGATTGGGC TTCGCTGTGC AGCTTTGCTA CTTGCGATTC CCTGGCACCT TTTTGGGCGT CGATGAGCCT CCGTTTCCGC CCCTGTTGCG CATGGTGGCC GCGCAACTCA AGATGCCAGT GGAAAGTTGG AGCGAGTACG GCCAGCGCGA ACAGACACGG CGGGAGCACT TGGTCGAGCT GCAAACGGTT TTTGGGTTCA AGCCCTTCAC CATGAGCCAC TATCGGCAAG CCGTGCATAC ATTGACCGAG CTGGCCTTGC AGACCGACAA AGGCATCGTG CTGGCGAGCG CACTTGTCGA GAATCTGCGG CGGCAGAGCA TTATCCTGCC CGCCATGAAT GCCATCGAGC GCGCAAGCGC CGAGGCCATC ACCCGTGCCA ACCGACGCAT TTACGCGGCG CTGACCGATT CTTTGTTATC ACCCCACCGT CAGCGCCTGG ACGAACTTCT CAAGCGCAAG GACGGCAGTA AAGTGACGTG GCTGGCATGG CTGCGCCAGT CGCCTGCCAA ACCGAACTCT CGCCACATGC TCGAACATAT TGAGCGCCTG AAATCCTGGC AAGCACTTGA TCTGCCCGCA GGCATCGAGC GGCAGGTTCA CCAGAACCGC CTGCTCAAAA TCGCTCGTGA AGGTGGCCAG ATGACGCCTG CTGATCTGGC AAAGTTCGAG GTGCAACGAC GCTATGCCAC GCTGGTAGCG CTGGCCATCG AAGGCATGGC CACCGTCACC GATGAAATCA TCGACCTTCA CGATCGCATC ATCGGCAAGC TGTTCAACGC GGCCAAGAAC AAGCATCAGC AGCAGTTCCA GGCTTCCGGC AAGGCGATCA ACGACAAGGT GCGGATGTAT GGGCGCATCG GTCAAGCGTT GATTGAGGCC AAGCAAAGCG GCAGCGATCC GTTCGCCGCC ATCGAGGCCG TTATGCCCTG GGACACCTTC GCCGCCAGCG TCACCGAAGC GCAAACATTG GCGCGGCCTG CCGACTTTGA TTTCCTGCAC CACATCGGTG AAAGCTATGC CACGCTACGC CGCTACGCGC CGCAGTTCCT GGGCGTGCTC AAATTGCGGG CTGCGCCCGC CGCCAAGGGT GTGCTCGATG CCATCGACAT GCTGCGCGGC ATGAACAGCG ACAGCGCGCG CAAGGTGCCC GCCGATGCGC CAACCGCATT CATCAAGCCG CGCTGGGCAA AGCTGGTTCT GACCGACGAC GGCATCGACC GGCGTTACTA CGAGTTATGC GCCCTGTCGG AGCTGAAGAA CGCGCTGCGC TCCGGTGATG TCTGGGTGCA GGGTTCTCGC CAGTTCAAGG ACTTCGACGA ATACCTGGTG CCGGTCGAGA AGTTCGCCAC TTTGAAGCTG GCCAGCGAAT TGCCGCTGGC AGTGGCCACC GACTGCGACC AATACCTGCA TGACCGGTTG GAATTGTTGG AGGCGCAACT CGCCACAGTC AACCGCATGG CTGCGGCCAA CGACTTACCG GATGCCATCA TCACCACCGC GTCAGGCCTG AAGATCACGC CGCTGGACGC GGCAGTACCA GACGCCGCGC AAGCCATGAT CGACCAGACA GCTATGCTGC TGCCGCACCT CAAAATCACC GAGTTGCTGA TGGAGGTCGA TGAATGGACG GGCTTCACCC GCCACTTCAC ACACCTGAAG ACCAGCGACA CGGCCAAGGA CAAAACCTTG CTGTTGACGA CGATCCTGGC CGACGCGATC AACCTGGGTC TGACCAAAAT GGCCGAGTCC TGCCCTGGCA CCACCTACGC CAAGCTGTCT TGGCTGCAAG CCTGGCACAT CCGCGATGAA ACCTATTCGA CGGCGCTGGC CGAGCTGGTG AATGCGCAGT TTCGGCAACC CTTCGCCGGC AACTGGGGTG ACGGCACCAC GTCATCGTCG GACGGCCAGA ACTTCAGAAC CGGCAGCAAA GCAGAAAGCA CTGGTCATAT CAACCCGAAG TATGGAAGCA GTCCAGGACG GACTTTCTAC ACCCATATCT CCGACCAGTA CGCGCCCTTC AGTGCCAAGG TGGTCAACGT GGGCATTCGT GATTCAACTT ACGTGCTTGA TGGCCTGCTG TACCACGAGT CGGACTTGCG CATCGAGGAA CACTACACCG ACACGGCAGG CTTCACCGAT CACGTGTTTG GCTTGATGCA TTTGCTGGGA TTTCGCTTCG CGCCGCGTAT CCGTGACTTG GGCGAAACCA AGCTATTCAT CCCCAAGGGC GATGCCGCCT ATGACGCGCT CAAGCCGATG ATTAGCAGCG ACAGGCTGAA CATCAAGCAA ATACGCGCCC ATTGGGATGA AATTCTGCGG CTGGCCACCT CCATCAAGCA AGGCACGGTA ACGGCTTCGC TGATGCTGCG CAAACTCGGC AGCTACCCGC GCCAGAACGG CTTGGCCGTG GCGTTGCGCG AGCTGGGGCG CATCGAGCGC ACGCTGTTCA TTTTGGATTG GCTGCAAAGC GTGGAGCTGC GCCGCCGCGT CCATGCGGGG CTGAATAAGG GCGAGGCGCG CAACGCGCTG GCCAGGGCGG TCTTCTTCTA CCGATTGGGT GAAATCCGCG ACCGCAGTTT TGAGCAGCAG CGCTACCGGG CCAGCGGCCT CAATCTGGTG ACGGCGGCCA TCGTGTTGTG GAACACGGTA TATCTGGAGC GTGCCACCAG TGCTTTGCGT GGCAACGGCA CGGCGCTGGA CGACACATTG TTGCAATATC TGTCGCCGCT GGGGTGGGAG CACATCAACC TGACCGGCGA TTACCTATGG CGCAGCAGCG CCAAGGTCGG TGCGGGGAAG TTTAGGCCAT TGCGACCGCT GCCACCGGCT TAG
|
Protein sequence | MPRRSILSAT ERESLLALPD AKDELIRHYT FNETDLSVIR QRRGAANRLG FAVQLCYLRF PGTFLGVDEP PFPPLLRMVA AQLKMPVESW SEYGQREQTR REHLVELQTV FGFKPFTMSH YRQAVHTLTE LALQTDKGIV LASALVENLR RQSIILPAMN AIERASAEAI TRANRRIYAA LTDSLLSPHR QRLDELLKRK DGSKVTWLAW LRQSPAKPNS RHMLEHIERL KSWQALDLPA GIERQVHQNR LLKIAREGGQ MTPADLAKFE VQRRYATLVA LAIEGMATVT DEIIDLHDRI IGKLFNAAKN KHQQQFQASG KAINDKVRMY GRIGQALIEA KQSGSDPFAA IEAVMPWDTF AASVTEAQTL ARPADFDFLH HIGESYATLR RYAPQFLGVL KLRAAPAAKG VLDAIDMLRG MNSDSARKVP ADAPTAFIKP RWAKLVLTDD GIDRRYYELC ALSELKNALR SGDVWVQGSR QFKDFDEYLV PVEKFATLKL ASELPLAVAT DCDQYLHDRL ELLEAQLATV NRMAAANDLP DAIITTASGL KITPLDAAVP DAAQAMIDQT AMLLPHLKIT ELLMEVDEWT GFTRHFTHLK TSDTAKDKTL LLTTILADAI NLGLTKMAES CPGTTYAKLS WLQAWHIRDE TYSTALAELV NAQFRQPFAG NWGDGTTSSS DGQNFRTGSK AESTGHINPK YGSSPGRTFY THISDQYAPF SAKVVNVGIR DSTYVLDGLL YHESDLRIEE HYTDTAGFTD HVFGLMHLLG FRFAPRIRDL GETKLFIPKG DAAYDALKPM ISSDRLNIKQ IRAHWDEILR LATSIKQGTV TASLMLRKLG SYPRQNGLAV ALRELGRIER TLFILDWLQS VELRRRVHAG LNKGEARNAL ARAVFFYRLG EIRDRSFEQQ RYRASGLNLV TAAIVLWNTV YLERATSALR GNGTALDDTL LQYLSPLGWE HINLTGDYLW RSSAKVGAGK FRPLRPLPPA
|
| |