Gene SNSL254_A1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1443 
Symbol 
ID6485613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1410190 
End bp1411356 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content46% 
IMG OID642736835 
ProductDNA/RNA non-specific endonuclease 
Protein accessionYP_002040589 
Protein GI194443189 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1864] DNA/RNA endonuclease G, NUC1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000963372 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAA ATGGTGTTAA AAAACATGCG CCAGATATTA TTGAATATCC ATTTTTCATT 
CGCTATCTGA GTGCGAGAAA TTATTGGCTT CACGATTATG CATATAATAC GATGTTTTTT
GGTATCAATA TGAATATCAC GTTGTATTCT TTTGAGCTCA TTTTCTATGA TGGCTTCGAT
GTTTATCTGT TATTAATTTT TACCGTGATA GTGTTGTCTT TAATGATGAG CGCATCTAAC
GGCTGGCAGG GTAATATAAC CAAATTATTG CTACCTGAAT TATCAGGGCA GTTATTATTA
AGAAAGAAAA AAATGAATAA AACCATTAAT CTGCTAAAAT TACTGCCCGT AGTATTATTA
AGCGCATGTA CTACATCGTA TCCTCCCCAG GATACAACAT CGGCACCCGA GTTACCCCAT
CGTAACGTAC TCGTTCAGCA ACCTGATAAC TGTAGCGTTG GCTGTCCTCA AGGAGGAAGT
CAACAAACAA TCTATCGCCA TGTCTATACG CTCAATAATA ATAGCGCCAC GAAATTTGCC
AACTGGGTTG CCTATAGCGT GACAAAGACC AGCCAGGCAA GCGGTCGCCC GCGGAACTGG
GCGCAGGACC CCGATTTACC GCCCTCGGAT ACGTTGGCCC CTTCGGCCTA TAAAAATGCC
CATACGCTAT TAAAAGTCGA CAGGGGACAC CAGGCGCCGT TGGCAGGATT GGGCGGCGTT
TCGGACTGGC CGTCGTTAAA TTATTTATCG AATATTACGC CGCAGAAATC CGCCCTGAAC
CAGGGAGCAT GGGCTGCACT GGAAAACCGG GTGCGCGAAC TTGCCAAACA GGCTGATGTA
TCTGTAGTGC ACGTAGTGAC CGGCCCCCTT TTTGAGCGGC ATATCGCCAC ATTGCCAGAA
GATGCGACGG TAGAAATTCC CAGCGGGTAC TGGAAGGTTT TATTCACCGG AACGGCGCCG
TCAAAAAGTG AAGGAAATTA CGCTGCGTTT ATTATGGATC AGAATACACC CCGTTCGGCG
AATTTTTGCG ACTATCAGGT TACCGTGGAG GCTATCGAAC ATAAAACGAA GCCAGTGCTG
ACGCTGTGGT CTGCCTTGCC TGAAGCGGTA GCCAGCGAGG TGAAAACGAC AAAGGGGAGT
CTGGCGCAGA AGTTAGGTTG TCGATGA
 
Protein sequence
MLKNGVKKHA PDIIEYPFFI RYLSARNYWL HDYAYNTMFF GINMNITLYS FELIFYDGFD 
VYLLLIFTVI VLSLMMSASN GWQGNITKLL LPELSGQLLL RKKKMNKTIN LLKLLPVVLL
SACTTSYPPQ DTTSAPELPH RNVLVQQPDN CSVGCPQGGS QQTIYRHVYT LNNNSATKFA
NWVAYSVTKT SQASGRPRNW AQDPDLPPSD TLAPSAYKNA HTLLKVDRGH QAPLAGLGGV
SDWPSLNYLS NITPQKSALN QGAWAALENR VRELAKQADV SVVHVVTGPL FERHIATLPE
DATVEIPSGY WKVLFTGTAP SKSEGNYAAF IMDQNTPRSA NFCDYQVTVE AIEHKTKPVL
TLWSALPEAV ASEVKTTKGS LAQKLGCR