Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3893 |
Symbol | |
ID | 6794137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 3783714 |
End bp | 3785669 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642778013 |
Product | hypothetical protein |
Protein accession | YP_002148608 |
Protein GI | 197247483 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTAC TGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGACAG TATCAACAAC TGGTTCGCGA TGTGGTTATT CCTTACCAGT GGGATGCGTT AAACGATCGT ATTCCAGAGG CTGAACCCAG CCATGCCATT GAAAATTTCC GCATTGCCGC AGGCCAGCAG ACGGGCGACT TTTACGGCAT GGTCTTTCAG GACAGCGACG TGGCGAAATG GCTGGAAGCG GTTGCCTGGT CACTGTGCCA GAAGCCCGAT CCCGCGCTTG AGAAAACCGC CGATGAGGTG ATTGAACTGG TGGCCGCCGC GCAGTGTGAC GATGGCTATC TCAATACGTA CTTTACGGCA AAAGCCCCGC AAGAACGCTG GAGCAACCTG GCGGAGTGCC ATGAGCTTTA TTGCGCCGGG CATCTGATTG AAGCGGGCGT CGCCTTCTTT CAGGCCACCG GCAAGCGTCG GCTGCTAGAC GTCGTTTGTC GCCTGGCCGA TCATATCGAC AGCACTTTCG GCCCTGGCGA AAATCAGCTG CACGGCTATC CGGGCCACCC GGAAATTGAG CTGGCGCTGA TGCGTCTGTA TGAGGTAACA GAGCAGCCGC GCTATATGAC GCTGGCAAGC TACTTTATCG GGCAGCGCGG CACCCAACCG CACTTCTACG ACGAAGAGTA CGAAAAACGC GGCCAAACCT CTTACTGGCA TACCTACGGC CCGGCGTGGA TGGTCAAAGA CAAAGCCTAC AGCCAGGCGC ATCTGCCAAT TTCACAGCAG CAGACGGCAA TCGGCCACGC GGTACGTTTT GTCTATCTGA TGACTGGCGT GGCGCATCTC GCTCGCCTGA GCAACGATGA AGGCAAACGC CAGGACTGCC TGCGTCTGTG GAAAAATATG GCGCAGCGTC AGCTGTATAT CACCGGAGGC ATCGGTTCAC AGAGCAGTGG CGAAGCCTTT AGCAGCGATT ACGATTTACC GAATGATTCG GTCTATGCGG AAAGCTGCGC TTCAATCGGC CTGATGATGT TCGCCCGCCG GATGCTGGAA ATGGAAGCCG ATAGCCAGTA CGCCGACGTG ATGGAGCGCG CGCTGTACAA TACCGTCCTC GGCGGTATGG CGCTGGATGG CAAGCATTTC TTCTACGTCA ACCCACTGGA AGTGCATCCA AAATCGTTAA AATTCAACCA TATTTACGAT CACGTTAAGC CCATCCGCCA GCGCTGGTTT GGCTGCGCCT GCTGCCCGCC GAACATCGCC CGCGTGCTCA CCTCCCTTGG TCACTACATC TACACGCCGC GTGCGGATGC GCTGTACATC AATATGTACG TGGGTAACAG CATGGAAATA CCGGTTGAAA ATGGCGCGCT CAAACTGCGA ATCAGCGGGA ACTACCCGTG GCATGAGCAG GTGAAGATCG CCATCGACTC TGTGCAGCCG GTACGTCACA CGCTGGCGCT ACGTCTGCCG GACTGGTGCC CTGAGGCAAA AGTGACGCTC AACGGGCTGG AAGTGGAGCA GGATATTCGC AAAGGTTATC TGCATATCCG TCGGACCTGG CAGGAGGGCG ATACGATAAC CCTGACGCTG CCGATGCCGG TTCGCCGCGT GTATGGCAAT CCGCTGGCGC GTCACGTCGC CGGTAAGGTC GCCATTCAGC GCGGGCCGCT GGTCTATTGC CTTGAGCAGG CCGATAACGG CGAAGAACTG CATAATCTGT GGTTACCGAA AGAGAGTGAG TTCCGGGTCT TTGAGGGCAA AGGGCTTTTT GCGCATAAGA TGCTGATTCA GGCTGAAGGC GAGAAGCAAA GCGCCCCAGA TGCGCAGCAT CAGGCGTTGT GGCACTACGA TAACGCGCCG TCATCGCGCC AGCCGCAGAC GCTAACGTTC ATTCCGTGGT TTAGCTGGGC CAACCGTGGC GAGGGCGAAA TGCGGATTTG GGTTAACGAG CGGTAA
|
Protein sequence | MNVLEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGQQ TGDFYGMVFQ DSDVAKWLEA VAWSLCQKPD PALEKTADEV IELVAAAQCD DGYLNTYFTA KAPQERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLD VVCRLADHID STFGPGENQL HGYPGHPEIE LALMRLYEVT EQPRYMTLAS YFIGQRGTQP HFYDEEYEKR GQTSYWHTYG PAWMVKDKAY SQAHLPISQQ QTAIGHAVRF VYLMTGVAHL ARLSNDEGKR QDCLRLWKNM AQRQLYITGG IGSQSSGEAF SSDYDLPNDS VYAESCASIG LMMFARRMLE MEADSQYADV MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLKFNHIYD HVKPIRQRWF GCACCPPNIA RVLTSLGHYI YTPRADALYI NMYVGNSMEI PVENGALKLR ISGNYPWHEQ VKIAIDSVQP VRHTLALRLP DWCPEAKVTL NGLEVEQDIR KGYLHIRRTW QEGDTITLTL PMPVRRVYGN PLARHVAGKV AIQRGPLVYC LEQADNGEEL HNLWLPKESE FRVFEGKGLF AHKMLIQAEG EKQSAPDAQH QALWHYDNAP SSRQPQTLTF IPWFSWANRG EGEMRIWVNE R
|
| |