Gene SNSL254_A2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2951 
Symbol 
ID6486463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2876970 
End bp2877980 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content45% 
IMG OID642738268 
Productphage integrase 
Protein accessionYP_002041997 
Protein GI194445766 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.339524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value3.4545800000000002e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGTTC GTAAAATAGA CTCTGGCGAA TGGCTATGCG ACCTGCGGCC TACTGGCGTC 
AAGGGAAAAC GCATTCGCAA AAAATTTGCC ACTAAAGGCG AAGCGCTGGC TTATGAAAAA
TACATTGCCA GCGAAATGGA AGAAAAGCCA TGGTTAGGTG AAAAGCAAGA TAATCGACGA
CTATCAGAAC TGATTGAACA GTGGCACGAC CTTTACGGCC GTACACTCTC TGATGCGGAT
CGGATGATGT CAAAATTGAA AGGTATCTGT GCGGGCATGG GCGATCCCAT AGCGGCACAA
ATCACATCCG CAGATTTTAG CCAATATCGT GAGGGCCGAT TAAAAGGTGA AATTCCCGAT
GTTAACGGTC GACTAATGCC GATACAGCCC CAGACGGTAA ATCATGAGCA GCGCAACCTC
TCAGCTGTAT TTGGTACGCT AAAAAAACTG GGGCACTGGT CATTACCTAA TCCTCTGGCA
GGTATTCCAA CATTCAAAGT TGATGAAAAA ATGGTTTCTT TTTTGTACCC AGAAGAGATC
AAAAGCCTGC TGCAATACCT ATCAGAATCA AGCAGTGATA GCGTACTTAT AATCACCAAA
ATCTGCTTGG CTACAGGGGC CAGATGGAGT GAGGCCGAAA ATTTAGAAGG TGCGCAGGTC
ACGCCGTATC GGATAACCTA CAAGAACACC AAAAATGGAA GAGTCAGATC GATTCCTATC
TCGAAAGAAC TGTATGACGA AATTCCGAAA AAACGTGGGC GTTTGTTCAC GCCATGCCGT
AAGACTTTTG AACGAGTAGT GGCTAAAGCG GGCATTGAGT TACCTGACGG GCAATGCACA
CACGTACTGC GTCATACATT TGCCAGTCAT TTTATGATGA ACGGTGGAAA CATCCTTGTC
CTCAAAGAAA TACTTGGGCA TTCAGATATA AAAATGACAA TGATTTACGC ACATTTCGCG
CCTACACATT TAGAAGATGC TGTACTTAAA AACCCTTTGG CTAACCTTTA A
 
Protein sequence
MAVRKIDSGE WLCDLRPTGV KGKRIRKKFA TKGEALAYEK YIASEMEEKP WLGEKQDNRR 
LSELIEQWHD LYGRTLSDAD RMMSKLKGIC AGMGDPIAAQ ITSADFSQYR EGRLKGEIPD
VNGRLMPIQP QTVNHEQRNL SAVFGTLKKL GHWSLPNPLA GIPTFKVDEK MVSFLYPEEI
KSLLQYLSES SSDSVLIITK ICLATGARWS EAENLEGAQV TPYRITYKNT KNGRVRSIPI
SKELYDEIPK KRGRLFTPCR KTFERVVAKA GIELPDGQCT HVLRHTFASH FMMNGGNILV
LKEILGHSDI KMTMIYAHFA PTHLEDAVLK NPLANL