Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2303 |
Symbol | |
ID | 6485593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2216401 |
End bp | 2218257 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642737648 |
Product | putative assembly protein |
Protein accession | YP_002041390 |
Protein GI | 194442823 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAT TTCTGACGAC GCTGATGATT CTCCTGGTCG TGCTGGTGGC CGGATTCTCT GCGTTGGTAT TGTTGGTCAA TCCGAATGAT TTCCGCGCCT ATATGGTGCA GCAGGTTGCC GCGCGTAGTG AATATCAGTT GCAACTGGAC GGGCCGCTGC GCTGGCACGT ATGGCCGCAA CTCAGCATCC TTTCCGGGCG AATGACGCTA ACGGCGCGAG GCGCCAGCGA ACCGCTGGTG CGGGCTGACA ATATGCGCCT GGATGTCGCT TTATGGCCGC TGTTGAGTCA TCAGCTTCAT GTTAAGCAGG TGATGCTGAA AGGCGGGGTG ATTCAACTGA CGCCGCAAAC GGAAGCGGTA CGCAGCGATG ATGCGCCGGT CGCGCCGAAA GATAATACCT TGCCGGACGT AGCGGAAGAT CGCGGCTGGT CGTTTGATAT TCGTAGCCTG CGCGTTGCCG ACAGCGTGCT GGTTTTCCAG CATGAAAATG ATGATCAGGT CACGGTGCGC GATATTCGTC TGAATATGGA ACAAGATGCC GAGCATCGCG GCACGTTTGA TTTTTCGGGA CGCGTAAACC GCGATCAGCG GGATCTTGCT TTATCGTTCA GCGGAACGGT GGATGCTTCC GATTACCCGC ATAATTTAAC CGCGGGTATT GAGCAACTGC GCTGGCAATT GCAGGGCGCG GATCTACCAG CGCAGGGGAT TGAGGGGCAG GGACAATTGC AGGCGCAGTG GCAGGAGGCG CAAAAACGTC TTTCATTTAA CCACTTAAAT TTAACGGCGA ATGACAGTTC TCTGACCGGG CAAGTTCAGG TAACGCTGGC GGAGCAGCCG GAATGGCAAA TTGACCTCCA GTCCAGCAAG CTTAATCTGG ACAACCTGTT GCCGCATCAT AGCGCCGTGG CTCAGACAGG CGGCGCGGTG TCGCAGGGGC AAAATACGCT CCCCCTGACC AGGCCAGTTA TCGCGTCGCG TGTTGGTGCG CCGCCTTATA AGGGGCTGCA AAGCTTTACG GCGGAGATTG CTTTACAGGC GGATTCAGTT CGCTGGCGGG GAATGGACTT TACCCAGGTT TCGACGAAGA TGTCTAATCA GGCCGGATTG CTGGATATTA CTGAGTTGCA GGGGAAACTG GCCGACGGAG AGATGTCGCT GCCCGGCACG CTGGACGCCC GCACCGCCAG TCCGCGCATC GAATTCCACC CTCGCCTCAA CCATGTTGAG ATTGGGACTA TCCTGAAAGC CTTCAATTAT CCGATTAGCC TGACCGGTAA GATGTCGCTG GTTGGCGATT TCTCCGGGGC GGATATTGAC GCGGAGGCAT TTCGCCACAG CTGGAAAGGA AAAGCGCATG TTGACATGAG CAATACGCGC CTCGAAGGAA TGAATTTCCA GCAACTGGTG CAGCAGGCCG TAGAGCGAAG CGGCGGCGAC GCGCAGCAGT CGCAGGAAAA TATGGACAAC GCGACCCGAC TGGATCGCTT CACGACCGAT CTGACGCTAA ATAAGGGCAC GCTGACGCTC GACGACATGG TCGGGCAATC TTCTATGCTG GCGTTAACCG GCAGCGGAAC GCTGGATCTG GTTAAGCAGA GCTGTGATAC ACAATTTAAT CTGCGTGTGC TGGGCGGCTG GAACGGCGAT AGCAACCTGA TCACCTTCCT GAAAGAGACG CCGGTGCCGC TGCGCGTCTA TGGCAAGTGG CAGGAGCTGA ACTATACCCT GCAAGTGGAT CAGTTATTGC GCAAGCATTT ACAGGATGAG GCGAAGCGTC GGCTCAACGA CTGGGCGGAT CGCAATAAAG ATACCCGCAA CGGTAAAGAT GTGAAGAAAC TGCTGAATAA GCTGTAG
|
Protein sequence | MRRFLTTLMI LLVVLVAGFS ALVLLVNPND FRAYMVQQVA ARSEYQLQLD GPLRWHVWPQ LSILSGRMTL TARGASEPLV RADNMRLDVA LWPLLSHQLH VKQVMLKGGV IQLTPQTEAV RSDDAPVAPK DNTLPDVAED RGWSFDIRSL RVADSVLVFQ HENDDQVTVR DIRLNMEQDA EHRGTFDFSG RVNRDQRDLA LSFSGTVDAS DYPHNLTAGI EQLRWQLQGA DLPAQGIEGQ GQLQAQWQEA QKRLSFNHLN LTANDSSLTG QVQVTLAEQP EWQIDLQSSK LNLDNLLPHH SAVAQTGGAV SQGQNTLPLT RPVIASRVGA PPYKGLQSFT AEIALQADSV RWRGMDFTQV STKMSNQAGL LDITELQGKL ADGEMSLPGT LDARTASPRI EFHPRLNHVE IGTILKAFNY PISLTGKMSL VGDFSGADID AEAFRHSWKG KAHVDMSNTR LEGMNFQQLV QQAVERSGGD AQQSQENMDN ATRLDRFTTD LTLNKGTLTL DDMVGQSSML ALTGSGTLDL VKQSCDTQFN LRVLGGWNGD SNLITFLKET PVPLRVYGKW QELNYTLQVD QLLRKHLQDE AKRRLNDWAD RNKDTRNGKD VKKLLNKL
|
| |