Gene SNSL254_A2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2845 
Symbol 
ID6484249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2777919 
End bp2779148 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID642738168 
Productphage integrase family protein 
Protein accessionYP_002041901 
Protein GI194442657 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.809863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.121401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTT CAGATAGTTA TCTAAAGTCG TGCCTCGGGC GCGAACGCGA TAAAGTTGAG 
GAGAAGGCAG ACCGGGACGG TCTGTGGGTG CGCATTTCCA AAAAGGGCGC CGTCACTTTT
TTCTACCGAT TCCGTTTCCT GGGTAAGCAG GACAAGATGA CGATCGGCAA TTACCCGGAA
TTCGGGTTGA AGGCCGCGCG CGAAGAGGTA ACTAAGTGGG CCGCCATTCT TGCCCGGGGA
GAAAATCCGC GGATCAGGCA AAGCCTCGAT AAAGCTAAAA TCAACAGCCA GTACACATTC
GAGGAACTTT TCAGAGAATG GCATGCGATG GTATGCGTTC AGAAAGAAAC ATCCGATCAA
ATACTGCGTT CGTTTGAGCT GCATGTATTC CCTAAGCTGG GTAAGTATCC GGCGCATCAG
TTGACGCTGC ACAACTGGCT TACAGTTCTG GACCGACTGG CGCAGGGATA CACTGAGATC
ACCCGGCGAG TAATTAGTAA CGGTCGGCAG TGCTACTCAT GGGCAGTGAA GCGCCAGTTG
CTTGAGGTTA ACCCACTTTC TGAAATGTCC GGTCGTGATT TTGGTATTCA GAAGAAAATG
GGGGAGAGAA CACTGGATCG CAAAGAAATT GCGATTGTCT GGCGAGCTAT TGAGGATTCC
CGCCTTATTG AGCGAAACAA GATCCTTTAT AAATTGTCCC TGATATGGGC GTGCAGGGTC
GGTGAACTCC GTCAGGCTGA AGTTTCGCAT TTCGATTTTG AGGAGGGCGT CTGGACCGTA
CCGTGGGAAA ATCATAAAAC GGGACGGAAA AGTAAGAAGC CGATAATCCG CCCGATCATC
CCTGAAATGC TCCCGCTGAT ACAACGAGCC ATTGAGCTGG CGCCAGGCCG TTTTGTTTTC
TCAAAATATG CAGACAAGCC GATGAGCGAA GGCTTTCATA TGAGCATCAG CAGCAACCTT
GTTAAGTTCA TGCTGAAGGC TTATAACGAG CAGGTTCCGC ACTTTACGAT CCATGATCTA
CGCAGAACTG CGCGAACGAA TTTCTCCGAG CTGACTGAAC CACATATTGC TGAGATGATG
CTCGGGCACA AACTGCCTGG AGTGTGGTCG GTGTACGACA AATACACCTA TATCGAAGAA
ATGAGAGAAG CTTATAGTAA ATGGTGGGCC CGACTGATGA GCATCATCGA GCCCGATGTT
CTGGAGTTCA CGCCACGTCA GACCGGATGA
 
Protein sequence
MAISDSYLKS CLGRERDKVE EKADRDGLWV RISKKGAVTF FYRFRFLGKQ DKMTIGNYPE 
FGLKAAREEV TKWAAILARG ENPRIRQSLD KAKINSQYTF EELFREWHAM VCVQKETSDQ
ILRSFELHVF PKLGKYPAHQ LTLHNWLTVL DRLAQGYTEI TRRVISNGRQ CYSWAVKRQL
LEVNPLSEMS GRDFGIQKKM GERTLDRKEI AIVWRAIEDS RLIERNKILY KLSLIWACRV
GELRQAEVSH FDFEEGVWTV PWENHKTGRK SKKPIIRPII PEMLPLIQRA IELAPGRFVF
SKYADKPMSE GFHMSISSNL VKFMLKAYNE QVPHFTIHDL RRTARTNFSE LTEPHIAEMM
LGHKLPGVWS VYDKYTYIEE MREAYSKWWA RLMSIIEPDV LEFTPRQTG