Gene SNSL254_pSN254_0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0065 
SymboltraI 
ID4929521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp46409 
End bp49381 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content54% 
IMG OID642572369 
Producttype IV conjugative transfer system protein TraI 
Protein accessionYP_001101944 
Protein GI134047158 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAG CCCTTAACAA GTTATTTGGT GGGCGAAGTG GAGTGATCGA GACCGCGCCG 
AGCGTCAGAG TGTTGCCGCT TAAAGACGTG GAAGATGAAG AAATCCCTCG ATACCCACCT
TTTGCCAAGG GCCTGCCAGT GGCCCCACTA GACAAGATAC TGGCAACCCA AGCTGAACTG
ATTGAGAAAG TGCGGAACTC TCTCGGTTTC ACTGTGGACG ACTTCAACCG GCTTGTTTTG
CCGGTGATCC AGCGGTATGC CGCGTTTGTT CACCTGTTGC CAGCTTCCGA ATCACACCAC
CACCGTGGCG CTGGTGGTCT GTTCCGACAT GGGCTTGAAG TGGCCTTCTG GGCAGCTCAG
GCATCTGAGT CAGTTATCTT TTCCATCGAG GGGACGCCTC GTGAACGCCG TGACAATGAG
CCGCGTTGGA GACTGGCGAG CTGTTTCTCC GGGCTGCTGC ATGATGTGGG TAAACCGCTC
TCGGATGTGT CCATTACGGA CAAAGACGGG TCAATCACAT GGAACCCGTA TTCGGAGTCA
CTTCATGACT GGGCACACCG TCACGAAATC GACCGTTACT TTATCCGGTG GCGCGACAAG
CGACACAAAA GACATGAGCA ATTCTCGCTG CTGGCGGTGG ATCGAATTAT TCCGGCTGAG
ACTCGGGAGT TTCTGTCCAA GTCTGGCCCG TCCATCATGG AAGCGATGCT GGAAGCTATC
TCAGGAACCA GCGTCAATCA GCCTGTGACC AAGCTGATGC TTCGCGCTGA CCAAGAGAGC
GTCTCACGGG ACCTTCGCCA GAGTCGTCTC GATGTGGACG AGTTCTCCTA TGGTGTGCCT
GTCGAGCGGT ACGTGTTCGA TGCCATCCGC CGCCTGGTTA AAACCGGAAA ATGGAAGGTC
AATGAGCCAG GCGCGAAAGT CTGGCACCTC AACCAAGGTG TATTCATTGC CTGGAAACAG
CTTGGGGACC TTTATGACTT GATCAGCCAC GACAAGATCC CCGGTATTCC ACGAGACCCT
GACACACTGG CCGACATTCT CATCGAACGT GGTTTTGCTG TACCAAACAC GGTGCAGGAG
AAGGGTGAAC GTGCGTACTA CCGCTACTGG GAAGTTTTGC CTGAGATGCT CCAGGAGGCG
GCGGGATCGG TGAAGATCTT GATGCTCCGA CTCGAATCAA ACGACCTGGT GTTTACGACT
GAGCCTCCTG CGGCTGTTGC TGCGGAAGTT GTTGGTGATG TTGAGGACGC TGAGATTGAG
TTCGTTGATC CTGAGGAAGT CGATGACGAC CAAGAGGAAG ATGTGTCAGC TCTGAACGAT
GACATGTTGG CCGCAGAGCA GGAAGCAGAG AAAGCTCTAG CTGGTCTTGG CTTTGGTGAT
GCGATGGAGA TGCTGAAAAG CACCTCAGAT GCTGTCGAGG AGAAACCAGA GCAAAAAGAT
GCTGGATCAA CGGAATCATC TAAGCCTGAC GCTGGCAAGA AGGGTAAGCC GCAGAGCAAA
CCGGGCAAAG CAAAACCGAA GAGTGATACA GAGAAACAAC CCCACAAACC AGAGGCAAAA
GAGGATTTGT CCCCTCAGGA CATTGCCAAA AACGCACCAC CTTTGGCAAA CGACAATCCG
TTACAAGCAC TCAAGGATGT TGGGGGTGGA CTGGGGGACA TCGACTTTCC GTTTGACGCA
TTCAGCGCAT CGGCAGAGAC AGCCAGCACT GACGCAACAA ACTCAGAAAT CCCAGATGTG
GCAATGCCCG GAAAGCAAGA GAAGCAGCCA AAACAGGACT TCGTTCCACA AGAACAAAAC
TCCCTGCAGG GCGATGACTT TCCAATGTTC GGTAGTTCTG ATGAACCGCC ATCATGGGCG
ATTGAGCCGC TCCCTATGCT GACTGACGCA CCAGAACAAA CAACACCAGC GCCAGCAATG
CCACCTACGG ACAAACCTAA TCTGCATGAG AAAGACGCAA AGACCTTACT CGTTGAGATG
TTAGCCGGGT ACGGAGAAGC ATCGGCGTTG CTTGAACAAG CGATCATGCC TGTTTTGGAA
GGTAAAACGA CGCTGGGCGA AGTCCTATGC CTGATGAAGG GTCAAGCCGT CATTTTGTAC
CCGGATGGCG CTCGGTCGCT GGGTGCGCCG TCAGAGGTTC TCTCGAAGCT GTCCCACGCC
AACGCGATTG TTCCAGACCC GATTATGCCA GGTCGCAAAG TTCGTGATTT CAGCGGAGTG
AAGGCAATTG TACTGGCGGA GCAGCTTTCA GATGCAGTCG TAGCGGCCAT TAAGGATGCC
GAGGCGTCAA TGGGGGGATA CCAGGATGCC TTTGAGCTCG TCTCTCCCCC TGGCTTGGAT
GCAAGCAAGA ATAAGTCTGC ACCGAAACAA CAAAGCCGAA AAAAGGCGCA GCAGCAGAAG
CCTGAGGTTA ACGCCGGTAA ACCCTCGCCT GAACAAAAGG CGAAAGGTAA GGACTCCCAG
CCACAGCAGA AGGAGAAGAA GGTCGATGTT ACTTCTCCGG TTGAAGAGCC GCAGCGCCAG
CCGGTCCAAG AAAAACAGAA CGTGGCTCGC CTTCCTAAGC GGGAGGTTCA ACCAGTGGCT
CCTGAGCCCA AAGTTGAGCG TGAGAAGGAA TTGGGACACG TCGAGGTGCG AGAAAGGGAA
GAGCCAGAGG TTAGGGAGTT TGAGCCGCCT AAGGCGAAAA CAAACCCGAA AGACATCAAC
GCGGAAGATT TCTTGCCGTC TGGTGTTACG CCTCAGAAAG CACTCCAGAT GCTCAAGGAC
ATGATCCAAA AACGCTCGGG CAGATGGCTC GTGACACCTG TCCTGGAAGA GGATGGCTGC
TTAGTAACCA GTGACAAAGC CTTCGACATG ATTGCCGGTG AAAACATCGG CATCAGCAAA
CACATCCTCT GCGGGATGCT GAGCCGGGCA CAGAGACGGC CTTTGCTCAA GAAACGTCAG
GGAAAACTGT ATTTAGAGGT AAATGAAACA TGA
 
Protein sequence
MLKALNKLFG GRSGVIETAP SVRVLPLKDV EDEEIPRYPP FAKGLPVAPL DKILATQAEL 
IEKVRNSLGF TVDDFNRLVL PVIQRYAAFV HLLPASESHH HRGAGGLFRH GLEVAFWAAQ
ASESVIFSIE GTPRERRDNE PRWRLASCFS GLLHDVGKPL SDVSITDKDG SITWNPYSES
LHDWAHRHEI DRYFIRWRDK RHKRHEQFSL LAVDRIIPAE TREFLSKSGP SIMEAMLEAI
SGTSVNQPVT KLMLRADQES VSRDLRQSRL DVDEFSYGVP VERYVFDAIR RLVKTGKWKV
NEPGAKVWHL NQGVFIAWKQ LGDLYDLISH DKIPGIPRDP DTLADILIER GFAVPNTVQE
KGERAYYRYW EVLPEMLQEA AGSVKILMLR LESNDLVFTT EPPAAVAAEV VGDVEDAEIE
FVDPEEVDDD QEEDVSALND DMLAAEQEAE KALAGLGFGD AMEMLKSTSD AVEEKPEQKD
AGSTESSKPD AGKKGKPQSK PGKAKPKSDT EKQPHKPEAK EDLSPQDIAK NAPPLANDNP
LQALKDVGGG LGDIDFPFDA FSASAETAST DATNSEIPDV AMPGKQEKQP KQDFVPQEQN
SLQGDDFPMF GSSDEPPSWA IEPLPMLTDA PEQTTPAPAM PPTDKPNLHE KDAKTLLVEM
LAGYGEASAL LEQAIMPVLE GKTTLGEVLC LMKGQAVILY PDGARSLGAP SEVLSKLSHA
NAIVPDPIMP GRKVRDFSGV KAIVLAEQLS DAVVAAIKDA EASMGGYQDA FELVSPPGLD
ASKNKSAPKQ QSRKKAQQQK PEVNAGKPSP EQKAKGKDSQ PQQKEKKVDV TSPVEEPQRQ
PVQEKQNVAR LPKREVQPVA PEPKVEREKE LGHVEVRERE EPEVREFEPP KAKTNPKDIN
AEDFLPSGVT PQKALQMLKD MIQKRSGRWL VTPVLEEDGC LVTSDKAFDM IAGENIGISK
HILCGMLSRA QRRPLLKKRQ GKLYLEVNET