Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2049 |
Symbol | |
ID | 6483800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1990704 |
End bp | 1992023 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642737405 |
Product | hypothetical protein |
Protein accession | YP_002041155 |
Protein GI | 194446438 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.245619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.249878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAACAGA TAGCCCGCTC TGTCGCCCTG GCATTTAATA ATCTGCCCCG ACCCCACCGC GTTATGCTGG GGTCACTTAC CGTTCTGACA CTGGCCGTCG CCGTATGGCG GCCCTATGTT TACCACCCAG AATCCGCACC AACCGTTAAA ACTATTGAAC TGGAGAAAAG CGAGATTCGT TCCCTCTTAC CGGAGGCCAG CGAACCCATC GATCAGGCCG CGCAGGAAGA TGAAGCTATT CCTCAGGATG AGCTGGACGA TAAAACCGCA GGCGAAGTCG GCGTCCATGA ATACGTCGTC TCCACAGGCG ATACGTTAAG CAGCATTCTG AATCAGTACG GCATCGATAT GAGCGATATT AGCCGACTTG CCGCTTCTGA TAAGGAGCTG CGCCATCTGA AAATTGGCCA ACAGCTTTCC TGGACACTGA CCGCCGATGG CGATTTACAG CGTCTGACAT GGGAAGTCTC CCGCCGTGAA ACGCGTACCT ACGATCGCAC TGCCAACGGT TTTAAAATGA GCAGTGAAAT GCAGCAGGGG GACTGGGTTA ACAGTCTGCT GAAAGGTACG GTAGGGGGTA GCTTTGTCGC CAGCGCGAAA GAGGCCGGTT TAACCAGCAG CGAAATCAGC GCAGTGATAA AAGCAATGCA GTGGCAGATG GATTTTCGCA AGCTGAAAAA GGGCGATGAA TTTTCGGTTC TGATGTCGCG CGAGATGCTG GATGGCAAGC GTGAACAGAG TCAGTTGTTG GGCGTGCGGA TGCGTTCCGA TGGTAAAGAT TACTACGCCA TTCGCGCCGC TGACGGTAAA TTCTATGACC GTAACGGTGT TGGCCTGGCG AAAGGCTTTT TACGCTTCCC GACCGCCAAA CAGTTCCGCA TCTCCTCCAA CTTCAATCCG CGTCGTCTGA ACCCGGTTAC CGGACGCGTT GCGCCGCATC GTGGCGTTGA CTTTGCGATG CCGCAGGGTA CGCCGGTGCT GTCGGTGGGG GATGGCGAGG TCGTGGTCGC TAAACGTAGC GGCGCTGCTG GTTACTACAT TGCGATTCGT CATGGACGCA CCTACACCAC ACGTTACATG CACTTGCGTA AGCTGCTGGT GAAACCGGGG CAAAAAGTGA AACGTGGCGA TCGTATTGCG CTTTCTGGTA ACACCGGGCG TTCCACAGGG CCGCATCTGC ATTATGAGGT ATGGATCAAC CAGCAAGCCG TTAACCCTCT AACAGCAAAA TTGCCGCGCA CGGAAGGTCT GACGGGGTCA GATCGTCGTG AATACCTGGC ACAGGTGAAA GAGGTTCTGC CGCAACTGCG CTTCGATTAA
|
Protein sequence | MQQIARSVAL AFNNLPRPHR VMLGSLTVLT LAVAVWRPYV YHPESAPTVK TIELEKSEIR SLLPEASEPI DQAAQEDEAI PQDELDDKTA GEVGVHEYVV STGDTLSSIL NQYGIDMSDI SRLAASDKEL RHLKIGQQLS WTLTADGDLQ RLTWEVSRRE TRTYDRTANG FKMSSEMQQG DWVNSLLKGT VGGSFVASAK EAGLTSSEIS AVIKAMQWQM DFRKLKKGDE FSVLMSREML DGKREQSQLL GVRMRSDGKD YYAIRAADGK FYDRNGVGLA KGFLRFPTAK QFRISSNFNP RRLNPVTGRV APHRGVDFAM PQGTPVLSVG DGEVVVAKRS GAAGYYIAIR HGRTYTTRYM HLRKLLVKPG QKVKRGDRIA LSGNTGRSTG PHLHYEVWIN QQAVNPLTAK LPRTEGLTGS DRREYLAQVK EVLPQLRFD
|
| |