Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3957 |
Symbol | |
ID | 6484435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3838250 |
End bp | 3840205 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642739217 |
Product | hypothetical protein |
Protein accession | YP_002042927 |
Protein GI | 194444786 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.565903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 0.86497 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTAC TGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGACAG TATCAACAAC TGGTTCGCGA TGTGGTTATT CCTTACCAGT GGGATGCGTT AAACGATCGT ATTCCAGAGG CTGAACCCAG CCATGCCATT GAAAATTTCC GCATTGCCGC AGGACAGCAG ACGGGCGACT TTTACGGCAT GGTCTTTCAG GACAGCGACG TGGCGAAATG GCTGGAAGCG GTTGCCTGGT CACTGTGCCA GAAGCCCGAT CCCGCGCTTG AGAAAACCGC CGATGAGGTG ATTGAACTGG TGGCCGCCGC GCAGTGTGAC GATGGCTATC TCAATACGTA CTTTACGGCA AAAGCCCCGC AAGAACGCTG GAGCAACCTG GCGGAGTGCC ACGAGCTTTA TTGCGCCGGG CACCTGATTG AAGCGGGCGT CGCCTTCTTT CAGGCCACCG GCAAGCGTCG GCTGCTAGAC GTCGTTTGTC GCCTGGCCGA TCATATCGAC AGCACTTTCG GCCCTGGCGA AAATCAGCTG CACGGCTATC CGGGCCACCC GGAAATTGAG CTGGCGTTGA TGCGTCTGTA TGAGGTAACA GAGCAGCCGC GCTATATGAC GCTGGCAAGC TACTTTATCG GGCAGCGCGG CGCCCAACCG CACTTCTACG ACGAAGAGTA CGAAAAACGC GGCCAAACCT CTTACTGGCA TACCTACGGC CCGGCGTGGA TGGTCAAAGA CAAAGCCTAC AGCCAGGCGC ATCTGCCAAT TTCGCAGCAG CAGACGGCCA TTGGCCACGC GGTACGTTTT GTCTATCTGA TGACTGGCGT GGCGCATCTC GCTCGCCTGA GCAACGATGA AGGCAAACGC CAGGACTGCC TGCGCCTATG GAAAAATATG GCGCAGCGTC AGCTGTATAT CACCGGAGGC ATCGGTTCAC AGAGCAGTGG CGAAGCCTTT AGCAGCGATT ACGATTTACC GAATGATTCG GTCTATGCGG AAAGTTGCGC TTCAATCGGC CTGATGATGT TCGCCCGCCG GATGCTGGAA ATGGAAGCCG ATAGCCAGTA CGCCGACGTG ATGGAGCGCG CGCTGTACAA CACCGTCCTC GGCGGTATGG CGCTGGATGG CAAGCATTTC TTCTACGTCA ACCCACTGGA AGTGCATCCA AAATCGTTAA ACTTCAACCA TATTTACGAT CACGTTAAGC CCATCCGCCA GCGCTGGTTT GGCTGCGCCT GCTGCCCGCC GAACATCGCC CGCGTACTCA CCTCCCTTGG TCACTACATC TACACGCCGC GTGCGGATGC GCTGTACATC AATATGTACG TGGGTAACAG CATGGAAATA CCGGTTGAAA ATGGCGCGCT CAAACTGCGA ATCAGCGGGA ACTACCCGTG GCATGAGCAG GTGAAGATTG CCATCGACTC TGTGCAGCCG GTACGTCACA CGCTGGCGCT ACGTCTGCCG GACTGGTGCC CTGAGGCAAA AGTGACGCTC AACGGGCTGG AAGTGGAGCA GGATATTCGC AAAGGTTATC TGCATATCCG TCGAACCTGG CAGGAGGGCG ATACGATAAC CCTGACGCTG CCGATGCCGG TTCGCCGCGT GTATGGCAAT CCGCTGGCGC GTCACGTCGC CGGTAAGGTC GCCATTCAGC GCGGGCCGCT GGTCTATTGC CTTGAGCAGG CCGATAACGG CGAAGAACTG CATAATCTGT GGTTACCGAA AGAGAGTGAG TTCCGGGTCT TTGAGGGCAA AGGGATTTTT GCGCATAAGA TGCTGATTCA GGCTGAAGGC GAGAAGCAAA GCGCCCCAGA TGCGCAGCAT CAGGCGTTGT GGCACTACGA TAACGCGCCG TCATCGCGCC AGCCGCAGAC GCTAACGTTC ATTCCGTGGT TTAGCTGGGC CAACCGTGGC GAGGGCGAAA TGCGGATTTG GGTTAACGAG CGGTAA
|
Protein sequence | MNVLEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGQQ TGDFYGMVFQ DSDVAKWLEA VAWSLCQKPD PALEKTADEV IELVAAAQCD DGYLNTYFTA KAPQERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLD VVCRLADHID STFGPGENQL HGYPGHPEIE LALMRLYEVT EQPRYMTLAS YFIGQRGAQP HFYDEEYEKR GQTSYWHTYG PAWMVKDKAY SQAHLPISQQ QTAIGHAVRF VYLMTGVAHL ARLSNDEGKR QDCLRLWKNM AQRQLYITGG IGSQSSGEAF SSDYDLPNDS VYAESCASIG LMMFARRMLE MEADSQYADV MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLNFNHIYD HVKPIRQRWF GCACCPPNIA RVLTSLGHYI YTPRADALYI NMYVGNSMEI PVENGALKLR ISGNYPWHEQ VKIAIDSVQP VRHTLALRLP DWCPEAKVTL NGLEVEQDIR KGYLHIRRTW QEGDTITLTL PMPVRRVYGN PLARHVAGKV AIQRGPLVYC LEQADNGEEL HNLWLPKESE FRVFEGKGIF AHKMLIQAEG EKQSAPDAQH QALWHYDNAP SSRQPQTLTF IPWFSWANRG EGEMRIWVNE R
|
| |