Gene SNSL254_pSN254_0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0141 
SymboltnpA 
ID4929451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp121107 
End bp124079 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content60% 
IMG OID642572440 
Producttransposon Tn21 transposase TnpA 
Protein accessionYP_001102015 
Protein GI134047160 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCC GCTCAATCCT GTCCGCCACC GAGCGCGAAA GCCTGCTGGC ACTGCCAGAT 
GCCAAAGACG AACTGATACG GCACTACACG TTCAACGAAA CCGACCTGTC GGTGATCCGT
CAGCGTCGCG GCGCCGCGAA TCGATTGGGC TTCGCTGTGC AGCTTTGCTA CTTGCGATTC
CCTGGCACCT TTTTGGGCGT CGATGAGCCT CCGTTTCCGC CCCTGTTGCG CATGGTGGCC
GCGCAACTCA AGATGCCAGT GGAAAGTTGG AGCGAGTACG GCCAGCGCGA ACAGACACGG
CGGGAGCACT TGGTCGAGCT GCAAACGGTT TTTGGGTTCA AGCCCTTCAC CATGAGCCAC
TATCGGCAAG CCGTGCATAC ATTGACCGAG CTGGCCTTGC AGACCGACAA AGGCATCGTG
CTGGCGAGCG CACTTGTCGA GAATCTGCGG CGGCAGAGCA TTATCCTGCC CGCCATGAAT
GCCATCGAGC GCGCAAGCGC CGAGGCCATC ACCCGTGCCA ACCGACGCAT TTACGCGGCG
CTGACCGATT CTTTGTTATC ACCCCACCGT CAGCGCCTGG ACGAACTTCT CAAGCGCAAG
GACGGCAGTA AAGTGACGTG GCTGGCATGG CTGCGCCAGT CGCCTGCCAA ACCGAACTCT
CGCCACATGC TCGAACATAT TGAGCGCCTG AAATCCTGGC AAGCACTTGA TCTGCCCGCA
GGCATCGAGC GGCAGGTTCA CCAGAACCGC CTGCTCAAAA TCGCTCGTGA AGGTGGCCAG
ATGACGCCTG CTGATCTGGC AAAGTTCGAG GTGCAACGAC GCTATGCCAC GCTGGTAGCG
CTGGCCATCG AAGGCATGGC CACCGTCACC GATGAAATCA TCGACCTTCA CGATCGCATC
ATCGGCAAGC TGTTCAACGC GGCCAAGAAC AAGCATCAGC AGCAGTTCCA GGCTTCCGGC
AAGGCGATCA ACGACAAGGT GCGGATGTAT GGGCGCATCG GTCAAGCGTT GATTGAGGCC
AAGCAAAGCG GCAGCGATCC GTTCGCCGCC ATCGAGGCCG TTATGCCCTG GGACACCTTC
GCCGCCAGCG TCACCGAAGC GCAAACATTG GCGCGGCCTG CCGACTTTGA TTTCCTGCAC
CACATCGGTG AAAGCTATGC CACGCTACGC CGCTACGCGC CGCAGTTCCT GGGCGTGCTC
AAATTGCGGG CTGCGCCCGC CGCCAAGGGT GTGCTCGATG CCATCGACAT GCTGCGCGGC
ATGAACAGCG ACAGCGCGCG CAAGGTGCCC GCCGATGCGC CAACCGCATT CATCAAGCCG
CGCTGGGCAA AGCTGGTTCT GACCGACGAC GGCATCGACC GGCGTTACTA CGAGTTATGC
GCCCTGTCGG AGCTGAAGAA CGCGCTGCGC TCCGGTGATG TCTGGGTGCA GGGTTCTCGC
CAGTTCAAGG ACTTCGACGA ATACCTGGTG CCGGTCGAGA AGTTCGCCAC TTTGAAGCTG
GCCAGCGAAT TGCCGCTGGC AGTGGCCACC GACTGCGACC AATACCTGCA TGACCGGTTG
GAATTGTTGG AGGCGCAACT CGCCACAGTC AACCGCATGG CTGCGGCCAA CGACTTACCG
GATGCCATCA TCACCACCGC GTCAGGCCTG AAGATCACGC CGCTGGACGC GGCAGTACCA
GACGCCGCGC AAGCCATGAT CGACCAGACA GCTATGCTGC TGCCGCACCT CAAAATCACC
GAGTTGCTGA TGGAGGTCGA TGAATGGACG GGCTTCACCC GCCACTTCAC ACACCTGAAG
ACCAGCGACA CGGCCAAGGA CAAAACCTTG CTGTTGACGA CGATCCTGGC CGACGCGATC
AACCTGGGTC TGACCAAAAT GGCCGAGTCC TGCCCTGGCA CCACCTACGC CAAGCTGTCT
TGGCTGCAAG CCTGGCACAT CCGCGATGAA ACCTATTCGA CGGCGCTGGC CGAGCTGGTG
AATGCGCAGT TTCGGCAACC CTTCGCCGGC AACTGGGGTG ACGGCACCAC GTCATCGTCG
GACGGCCAGA ACTTCAGAAC CGGCAGCAAA GCAGAAAGCA CTGGTCATAT CAACCCGAAG
TATGGAAGCA GTCCAGGACG GACTTTCTAC ACCCATATCT CCGACCAGTA CGCGCCCTTC
AGTGCCAAGG TGGTCAACGT GGGCATTCGT GATTCAACTT ACGTGCTTGA TGGCCTGCTG
TACCACGAGT CGGACTTGCG CATCGAGGAA CACTACACCG ACACGGCAGG CTTCACCGAT
CACGTGTTTG GCTTGATGCA TTTGCTGGGA TTTCGCTTCG CGCCGCGTAT CCGTGACTTG
GGCGAAACCA AGCTATTCAT CCCCAAGGGC GATGCCGCCT ATGACGCGCT CAAGCCGATG
ATTAGCAGCG ACAGGCTGAA CATCAAGCAA ATACGCGCCC ATTGGGATGA AATTCTGCGG
CTGGCCACCT CCATCAAGCA AGGCACGGTA ACGGCTTCGC TGATGCTGCG CAAACTCGGC
AGCTACCCGC GCCAGAACGG CTTGGCCGTG GCGTTGCGCG AGCTGGGGCG CATCGAGCGC
ACGCTGTTCA TTTTGGATTG GCTGCAAAGC GTGGAGCTGC GCCGCCGCGT CCATGCGGGG
CTGAATAAGG GCGAGGCGCG CAACGCGCTG GCCAGGGCGG TCTTCTTCTA CCGATTGGGT
GAAATCCGCG ACCGCAGTTT TGAGCAGCAG CGCTACCGGG CCAGCGGCCT CAATCTGGTG
ACGGCGGCCA TCGTGTTGTG GAACACGGTA TATCTGGAGC GTGCCACCAG TGCTTTGCGT
GGCAACGGCA CGGCGCTGGA CGACACATTG TTGCAATATC TGTCGCCGCT GGGGTGGGAG
CACATCAACC TGACCGGCGA TTACCTATGG CGCAGCAGCG CCAAGGTCGG TGCGGGGAAG
TTTAGGCCAT TGCGACCGCT GCCACCGGCT TAG
 
Protein sequence
MPRRSILSAT ERESLLALPD AKDELIRHYT FNETDLSVIR QRRGAANRLG FAVQLCYLRF 
PGTFLGVDEP PFPPLLRMVA AQLKMPVESW SEYGQREQTR REHLVELQTV FGFKPFTMSH
YRQAVHTLTE LALQTDKGIV LASALVENLR RQSIILPAMN AIERASAEAI TRANRRIYAA
LTDSLLSPHR QRLDELLKRK DGSKVTWLAW LRQSPAKPNS RHMLEHIERL KSWQALDLPA
GIERQVHQNR LLKIAREGGQ MTPADLAKFE VQRRYATLVA LAIEGMATVT DEIIDLHDRI
IGKLFNAAKN KHQQQFQASG KAINDKVRMY GRIGQALIEA KQSGSDPFAA IEAVMPWDTF
AASVTEAQTL ARPADFDFLH HIGESYATLR RYAPQFLGVL KLRAAPAAKG VLDAIDMLRG
MNSDSARKVP ADAPTAFIKP RWAKLVLTDD GIDRRYYELC ALSELKNALR SGDVWVQGSR
QFKDFDEYLV PVEKFATLKL ASELPLAVAT DCDQYLHDRL ELLEAQLATV NRMAAANDLP
DAIITTASGL KITPLDAAVP DAAQAMIDQT AMLLPHLKIT ELLMEVDEWT GFTRHFTHLK
TSDTAKDKTL LLTTILADAI NLGLTKMAES CPGTTYAKLS WLQAWHIRDE TYSTALAELV
NAQFRQPFAG NWGDGTTSSS DGQNFRTGSK AESTGHINPK YGSSPGRTFY THISDQYAPF
SAKVVNVGIR DSTYVLDGLL YHESDLRIEE HYTDTAGFTD HVFGLMHLLG FRFAPRIRDL
GETKLFIPKG DAAYDALKPM ISSDRLNIKQ IRAHWDEILR LATSIKQGTV TASLMLRKLG
SYPRQNGLAV ALRELGRIER TLFILDWLQS VELRRRVHAG LNKGEARNAL ARAVFFYRLG
EIRDRSFEQQ RYRASGLNLV TAAIVLWNTV YLERATSALR GNGTALDDTL LQYLSPLGWE
HINLTGDYLW RSSAKVGAGK FRPLRPLPPA