Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1443 |
Symbol | |
ID | 6485613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1410190 |
End bp | 1411356 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642736835 |
Product | DNA/RNA non-specific endonuclease |
Protein accession | YP_002040589 |
Protein GI | 194443189 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1864] DNA/RNA endonuclease G, NUC1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000963372 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAAA ATGGTGTTAA AAAACATGCG CCAGATATTA TTGAATATCC ATTTTTCATT CGCTATCTGA GTGCGAGAAA TTATTGGCTT CACGATTATG CATATAATAC GATGTTTTTT GGTATCAATA TGAATATCAC GTTGTATTCT TTTGAGCTCA TTTTCTATGA TGGCTTCGAT GTTTATCTGT TATTAATTTT TACCGTGATA GTGTTGTCTT TAATGATGAG CGCATCTAAC GGCTGGCAGG GTAATATAAC CAAATTATTG CTACCTGAAT TATCAGGGCA GTTATTATTA AGAAAGAAAA AAATGAATAA AACCATTAAT CTGCTAAAAT TACTGCCCGT AGTATTATTA AGCGCATGTA CTACATCGTA TCCTCCCCAG GATACAACAT CGGCACCCGA GTTACCCCAT CGTAACGTAC TCGTTCAGCA ACCTGATAAC TGTAGCGTTG GCTGTCCTCA AGGAGGAAGT CAACAAACAA TCTATCGCCA TGTCTATACG CTCAATAATA ATAGCGCCAC GAAATTTGCC AACTGGGTTG CCTATAGCGT GACAAAGACC AGCCAGGCAA GCGGTCGCCC GCGGAACTGG GCGCAGGACC CCGATTTACC GCCCTCGGAT ACGTTGGCCC CTTCGGCCTA TAAAAATGCC CATACGCTAT TAAAAGTCGA CAGGGGACAC CAGGCGCCGT TGGCAGGATT GGGCGGCGTT TCGGACTGGC CGTCGTTAAA TTATTTATCG AATATTACGC CGCAGAAATC CGCCCTGAAC CAGGGAGCAT GGGCTGCACT GGAAAACCGG GTGCGCGAAC TTGCCAAACA GGCTGATGTA TCTGTAGTGC ACGTAGTGAC CGGCCCCCTT TTTGAGCGGC ATATCGCCAC ATTGCCAGAA GATGCGACGG TAGAAATTCC CAGCGGGTAC TGGAAGGTTT TATTCACCGG AACGGCGCCG TCAAAAAGTG AAGGAAATTA CGCTGCGTTT ATTATGGATC AGAATACACC CCGTTCGGCG AATTTTTGCG ACTATCAGGT TACCGTGGAG GCTATCGAAC ATAAAACGAA GCCAGTGCTG ACGCTGTGGT CTGCCTTGCC TGAAGCGGTA GCCAGCGAGG TGAAAACGAC AAAGGGGAGT CTGGCGCAGA AGTTAGGTTG TCGATGA
|
Protein sequence | MLKNGVKKHA PDIIEYPFFI RYLSARNYWL HDYAYNTMFF GINMNITLYS FELIFYDGFD VYLLLIFTVI VLSLMMSASN GWQGNITKLL LPELSGQLLL RKKKMNKTIN LLKLLPVVLL SACTTSYPPQ DTTSAPELPH RNVLVQQPDN CSVGCPQGGS QQTIYRHVYT LNNNSATKFA NWVAYSVTKT SQASGRPRNW AQDPDLPPSD TLAPSAYKNA HTLLKVDRGH QAPLAGLGGV SDWPSLNYLS NITPQKSALN QGAWAALENR VRELAKQADV SVVHVVTGPL FERHIATLPE DATVEIPSGY WKVLFTGTAP SKSEGNYAAF IMDQNTPRSA NFCDYQVTVE AIEHKTKPVL TLWSALPEAV ASEVKTTKGS LAQKLGCR
|
| |