Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2845 |
Symbol | |
ID | 6484249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2777919 |
End bp | 2779148 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642738168 |
Product | phage integrase family protein |
Protein accession | YP_002041901 |
Protein GI | 194442657 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.809863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.121401 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATTT CAGATAGTTA TCTAAAGTCG TGCCTCGGGC GCGAACGCGA TAAAGTTGAG GAGAAGGCAG ACCGGGACGG TCTGTGGGTG CGCATTTCCA AAAAGGGCGC CGTCACTTTT TTCTACCGAT TCCGTTTCCT GGGTAAGCAG GACAAGATGA CGATCGGCAA TTACCCGGAA TTCGGGTTGA AGGCCGCGCG CGAAGAGGTA ACTAAGTGGG CCGCCATTCT TGCCCGGGGA GAAAATCCGC GGATCAGGCA AAGCCTCGAT AAAGCTAAAA TCAACAGCCA GTACACATTC GAGGAACTTT TCAGAGAATG GCATGCGATG GTATGCGTTC AGAAAGAAAC ATCCGATCAA ATACTGCGTT CGTTTGAGCT GCATGTATTC CCTAAGCTGG GTAAGTATCC GGCGCATCAG TTGACGCTGC ACAACTGGCT TACAGTTCTG GACCGACTGG CGCAGGGATA CACTGAGATC ACCCGGCGAG TAATTAGTAA CGGTCGGCAG TGCTACTCAT GGGCAGTGAA GCGCCAGTTG CTTGAGGTTA ACCCACTTTC TGAAATGTCC GGTCGTGATT TTGGTATTCA GAAGAAAATG GGGGAGAGAA CACTGGATCG CAAAGAAATT GCGATTGTCT GGCGAGCTAT TGAGGATTCC CGCCTTATTG AGCGAAACAA GATCCTTTAT AAATTGTCCC TGATATGGGC GTGCAGGGTC GGTGAACTCC GTCAGGCTGA AGTTTCGCAT TTCGATTTTG AGGAGGGCGT CTGGACCGTA CCGTGGGAAA ATCATAAAAC GGGACGGAAA AGTAAGAAGC CGATAATCCG CCCGATCATC CCTGAAATGC TCCCGCTGAT ACAACGAGCC ATTGAGCTGG CGCCAGGCCG TTTTGTTTTC TCAAAATATG CAGACAAGCC GATGAGCGAA GGCTTTCATA TGAGCATCAG CAGCAACCTT GTTAAGTTCA TGCTGAAGGC TTATAACGAG CAGGTTCCGC ACTTTACGAT CCATGATCTA CGCAGAACTG CGCGAACGAA TTTCTCCGAG CTGACTGAAC CACATATTGC TGAGATGATG CTCGGGCACA AACTGCCTGG AGTGTGGTCG GTGTACGACA AATACACCTA TATCGAAGAA ATGAGAGAAG CTTATAGTAA ATGGTGGGCC CGACTGATGA GCATCATCGA GCCCGATGTT CTGGAGTTCA CGCCACGTCA GACCGGATGA
|
Protein sequence | MAISDSYLKS CLGRERDKVE EKADRDGLWV RISKKGAVTF FYRFRFLGKQ DKMTIGNYPE FGLKAAREEV TKWAAILARG ENPRIRQSLD KAKINSQYTF EELFREWHAM VCVQKETSDQ ILRSFELHVF PKLGKYPAHQ LTLHNWLTVL DRLAQGYTEI TRRVISNGRQ CYSWAVKRQL LEVNPLSEMS GRDFGIQKKM GERTLDRKEI AIVWRAIEDS RLIERNKILY KLSLIWACRV GELRQAEVSH FDFEEGVWTV PWENHKTGRK SKKPIIRPII PEMLPLIQRA IELAPGRFVF SKYADKPMSE GFHMSISSNL VKFMLKAYNE QVPHFTIHDL RRTARTNFSE LTEPHIAEMM LGHKLPGVWS VYDKYTYIEE MREAYSKWWA RLMSIIEPDV LEFTPRQTG
|
| |