Gene SNSL254_pSN254_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0110 
Symbol 
ID4929526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp97249 
End bp98259 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content52% 
IMG OID642572409 
Productputative phage-type endonuclease 
Protein accessionYP_001101984 
Protein GI134047100 
COG category[L] Replication, recombination and repair 
COG ID[COG5377] Phage-related protein, predicted endonuclease 
TIGRFAM ID[TIGR03033] putative phage-type endonuclease 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.300181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG TCAACCTATC GCAACGCGAG GAAGATTGGC TTGATTGGCG GCGTCAAGGT 
GTAACAGCCA CTGACGCCGC TATCCTGCTC AATCGGTCTC CGTACAAAAC ACGATGGAGA
CTGTGGGCCG AGAAGACTGG GTATGCGCGT GAAGTCGATC TGAGTCTTAA TCCGCTGGTT
CGCCGGGGGA TAGAAAACGA AGATGCTGCA AGACGCGCTT TCGAGGAGAA GTATGATGAC
ATGCTGCTCC CCGCCTGTGT CGAATCGGTT CAATACCCGC TCATGAGGGC CTCCCTGGAT
GGCCTGAGAG ATAACGGGGA GCCCGTCGAG CTGAAAAGCC CGAGTGCGAC TGTCTGGGAA
GATGTTTGTG CTGAGAAAGC AAACAGCAAG GCATACCAGC TTTATTACCC GCAGGTGCAA
CACCAGCTCC TGGTAACGGG GGCCAAGCAA GGCTGGTTAG TCTTCTACTT TGAAGGTCAG
ATTCAGGAGT TTCCAATACT CCGAGACGAA GCCATGATTC AAGAAATCTT GGCCGAGGCT
AAAAAGTTCT GGCAACAGGT AGTAGACAAG AAGGAGCCCG ACAAAGATCC AGAGAGAGAC
CTGTACATAC CGCAAGGTGA AGAGGTCAAC CGTTGGATTG CTGCTGCTGA GGAATACCGC
CTCTATGATG CAGAGATTCA GGAGCTGAAA CAGCGACTGT CTGAGCTTCA AGAAAGGCAA
AAGCCTCATC TCGACACCAT GAAGTCCCTC ATGGGGGAAT ACTTCCATGC CGACTACTGC
GGTGTGATGG TAACGAGATA CAAAGCGGCT GGCCGGGTAG ACTACAAAAA GCTGTTGGCT
GATAAGGCGT CAGGCGTGAA GCCTGAGGAT GTTGACCAGT ACAGAGAGAA GTCATCAGAG
CGGTGCCGTG TAACGGTTAC TGGCTCTGTG AAGCCACGGT ACATTGTTGA TGAGGACGTG
CTTGCTCCTC TTGATGATTT GCCGGAAGAA GTAGAGACGT TCTACTGGTG A
 
Protein sequence
MKIVNLSQRE EDWLDWRRQG VTATDAAILL NRSPYKTRWR LWAEKTGYAR EVDLSLNPLV 
RRGIENEDAA RRAFEEKYDD MLLPACVESV QYPLMRASLD GLRDNGEPVE LKSPSATVWE
DVCAEKANSK AYQLYYPQVQ HQLLVTGAKQ GWLVFYFEGQ IQEFPILRDE AMIQEILAEA
KKFWQQVVDK KEPDKDPERD LYIPQGEEVN RWIAAAEEYR LYDAEIQELK QRLSELQERQ
KPHLDTMKSL MGEYFHADYC GVMVTRYKAA GRVDYKKLLA DKASGVKPED VDQYREKSSE
RCRVTVTGSV KPRYIVDEDV LAPLDDLPEE VETFYW