Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0975 |
Symbol | |
ID | 6486573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 979124 |
End bp | 980074 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642736381 |
Product | VirK-like protein |
Protein accession | YP_002040141 |
Protein GI | 194443072 |
COG category | [S] Function unknown |
COG ID | [COG2990] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.587318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGC TTACAGATAA TACCTGGTAC ACTTCAGATT ACATCTCTCC TTTACAATTA TTTATTCGTC TGACGCGCGG GCAATTACAG CCAGGAAAAT TCTGGCGTAA AGCCAGCTTT CGTCGCAAGT TTTTAATCCG CTCATTAGTC ATGCCGCGTG CGACCAGCCA ATTACTGACC AATCTCACCC AATGGCCGGA GTTAAATACG TTACTTGCTC GTCAGCCGCG TTTGCCTATT CGGCTACATC GTCCTTATAT GGCTGTAAAT ATTAAACGCG ATTTCGCCTT AGATGCGCTG TGTTTTCATT ATCAACAGAT GCGCCAACTT TTATCGCGGG AACAACAAGT TAGCTATTTA AGTCAGTATG GCCTGAATCT TGCTAAATTT GAAACTAAAA CCGGCGAGTT GTTTCAACTT GATTTAGTCA GTCTGGTCTC ACTGGATAAA GAAGGTGAAA GCACTATTGT TGTTCGCGAC GCACAGTTAC GTATTCTGGC AGAGATCACA TTTACCCTGT GTCGCTTTAA CCAACAACGC ACACTATTTA TCGGCGGATT ACAGGGCGCG GCAAACGACG TCCCTCACGA AATTATCCAG CAGGCGACCA AAGCCTGCCA TGGTCTTTTT CCCAAACGCA TCGTGATGGA GGCTCTCTGT CAGTTTGCGC AAGTCTTTCA GGCAGAAAAG ATTATAGCTG TGAGTAACGA TGCGCACGTT TACCGTAGCT GGCGATACAT GGATAAAAAG ACGCAAATGC ATGCCGACTA TGACGCGTTT TGGGAATCGT TAGGCGGTGA AAGAATTAAA GGGAATTACT ATACGCTGCC GCTGGCGATC GCCCGAAAAA GCGAAGCGGA GATCGCCAGT AAAAAACGGG CCGAGTATCG CCGCCGCTAT GCATTACTCG ATAGCGTCGT CGAACAGGTT CCGGCCACAT TCAAACATTA A
|
Protein sequence | MTQLTDNTWY TSDYISPLQL FIRLTRGQLQ PGKFWRKASF RRKFLIRSLV MPRATSQLLT NLTQWPELNT LLARQPRLPI RLHRPYMAVN IKRDFALDAL CFHYQQMRQL LSREQQVSYL SQYGLNLAKF ETKTGELFQL DLVSLVSLDK EGESTIVVRD AQLRILAEIT FTLCRFNQQR TLFIGGLQGA ANDVPHEIIQ QATKACHGLF PKRIVMEALC QFAQVFQAEK IIAVSNDAHV YRSWRYMDKK TQMHADYDAF WESLGGERIK GNYYTLPLAI ARKSEAEIAS KKRAEYRRRY ALLDSVVEQV PATFKH
|
| |