Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3757 |
Symbol | |
ID | 6482531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3620988 |
End bp | 3622265 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642739024 |
Product | hypothetical protein |
Protein accession | YP_002042735 |
Protein GI | 194446812 |
COG category | [S] Function unknown |
COG ID | [COG3266] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAAT TCAAACCAGA AGACGAGCTG AAACCCGATC CCAGCGATCG TCGTACTGGT CGTTCTCGTC AATCTTCAGA ACGCGATAAT GAGCCGCAGA TCAACTTTGA TGACGTTGAT CTGGATGCCG ACGATCGCCG TCCGACGCGT ACGCGTAAAG CGCGTAGTGA AGAACCTGAA GTTGAAGAAG AGTACGAATC CGATGAAGAC GATACGGTGG ACGAAGAGCG TGTTGAACGC CGCCCACGTA AGCGTAAAAA AGCGACCCAT AAGCCAGCCT CTCGTCAGTA CATGATGATG GGCGTTGGCG TACTGGTGCT GCTGCTGTTG ATTATCGGTA TCGGCTCCGC GCTGAAAGCC CCCTCAACGT CTTCCAGCGA GCCGTCGGCC TCTGGCGAAA AGAGTATCGA TCTTTCCGGT AACGCCGCCG ACCAGGCGAA TGCGACCCAG CCTGCGCCGG GCGCCACCTC CGCAGAACAA ACCGCGGGCA ATACGTCGCA GGATATTTCG TTGCCGCCGA TTTCTTCAAC GCCGACGCAG GGACAGTCGC CTGTGGTCGC TGACGGTCAG CAGCGCGTGG AAGTGCAGGG CGATCTGAAT AATGCGCTGA CGCAGAATCC AGAGCAGATG AACAATGTTG CGGTGAACTC TACGTTGCCG ACAGAGCCTG CAACCGTCGC GCCAGTTCGC AATGGCAGCA CGACGCGTCA GGCGGCGGTT AGCGAACCTG CCGAGCGTCA TACCACGCGT CCGGAACGTA AACAGGCCGT CATTGAACCT AAGAAGCCGC AGACCACGGC GAAAACCACC ACTGCGGAAC CGAAGAAACC GGTCGCGCCA GTGAAACGCA CGGAACCGGC AGCGCCAGCC GCGACGCCGA AAGCGACCAC CACGACGGCT GCGCCGACAG CGACGGCAAG CGCTGCGCCG GTACAAACCG CGAAGCCAGC GCAAGCCTCG ACGACGCCTG TCGCAGGCGG CGGGAAAAGC GCCGGCAACG TTGGCGCATT AAAGAGCGCG CCATCCAGCC ACTACACATT GCAGCTCAGT AGTTCTTCAA ATTACGACAA CCTGAACGGT TGGGCGAAGA AAGAGAACCT GAAAAATTAT GTGGTATACG AGACGACGCG TAATGGACAA CCGTGGTATG TGCTGGTAAC GGGGATGTAT GCTTCGAAAG AAGATGCTAA ACGTGCGGTG TCCACCTTAC CTGCCGATGT GCAGGCGAAA AACCCGTGGG CAAAACCGTT GCATCAGGTT CAGGCCGATC TGAAATAA
|
Protein sequence | MDEFKPEDEL KPDPSDRRTG RSRQSSERDN EPQINFDDVD LDADDRRPTR TRKARSEEPE VEEEYESDED DTVDEERVER RPRKRKKATH KPASRQYMMM GVGVLVLLLL IIGIGSALKA PSTSSSEPSA SGEKSIDLSG NAADQANATQ PAPGATSAEQ TAGNTSQDIS LPPISSTPTQ GQSPVVADGQ QRVEVQGDLN NALTQNPEQM NNVAVNSTLP TEPATVAPVR NGSTTRQAAV SEPAERHTTR PERKQAVIEP KKPQTTAKTT TAEPKKPVAP VKRTEPAAPA ATPKATTTTA APTATASAAP VQTAKPAQAS TTPVAGGGKS AGNVGALKSA PSSHYTLQLS SSSNYDNLNG WAKKENLKNY VVYETTRNGQ PWYVLVTGMY ASKEDAKRAV STLPADVQAK NPWAKPLHQV QADLK
|
| |