Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4539 |
Symbol | |
ID | 6482395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4413986 |
End bp | 4415101 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642739765 |
Product | hypothetical protein |
Protein accession | YP_002043447 |
Protein GI | 194446311 |
COG category | [R] General function prediction only |
COG ID | [COG3948] Phage-related baseplate assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATAG CCGAACCCGA CTTTATTGAC CGCGATCCCG CGCAAATCAC CAGCGAGATG ATTGCGCAAT ATGAAGAAGC CAGCGGTAAA AAACTCTATC CGGCGCAGGC TGAGCGGCTG CTCATTGACC TGTTTGCTTA TCGTGAAAAC CTTGTCCGCA TCGCCATCCA GGAGGCAGCG AAGCAAAACC TGGTCGCGTA TTCCCGTGCG CCGATGCTGG ATTATTTAGG CGAGCTGGTT GGCGTTCACC GTCTGCCCGC TCAGGCGGCA AAAACCACGC TGCAGTTTTC TGTTACTCAA GCGGCTAAAA GTAACCTGGT GATTCCACAG GGTACCCGCG CCAGCGCGTC GGATAGCGTG ATGTTCGCCA CCGACGAAGA TGTTCTGTTG CCTGCGGGCA GCCTGAGCGT TGCGGTAACT GCAACCTGTG TAGCAACCGG TGAATCCGGC AATAACTGGC AACCTGCGCA AATCAGCGCG CTGGTGGATC GGGTAGGCAA TTACGATCTC AGCGTCACCA ATCTGACGGC CTCAAGTGGC GGCTGCGGCG AAGAGAACGA CGACGCGCTA CGTAAACGCA TCCAGCTAGC GCCGGAAAGT TTCAGCAACG CGGGCAGCTA TGGCGCCTAT CGCTTCCATA CGCTCTCGGT CAGCCAGTCG ATTATCGACG TGGCGGTGCT GGGGCCGGAT GAAGGGCTGG CGGAAGGCTG CGTGGAACTC TATCCGCTGA CCCTGAACGG TCTGCCGGGG CCAGAGCTTC TTGCCCAGAT CGAACGGGAG GTGAGCAAAG AGAAAAAGCG CCCGCTAACC GATAAGGTGA GCGCTAAATG TTCTCCGCGC GTGGCTTATC AGATCCGCGC CCGGTTGACG CTGTTTACCA CCGCCGATCA GGAGACGACG CTTGCCGCCG CGCGTGAAGC GATTAATACA TGGACGCGCT CGCGCCAGAC CCGGCTGGGC CAGGACATTG TGCCAAACCA GATAATTAAA GTATTACAGG TTGACGGCGT TTACGACGTC GCGCTGGATA TGCCTGCGAA AAAGGTACTA CAGGCGCACG AATGGGCGGA ATGCACGGCC ATTGACGTGA CGATTGCCGG AGTCAGCGAT GGATAA
|
Protein sequence | MAIAEPDFID RDPAQITSEM IAQYEEASGK KLYPAQAERL LIDLFAYREN LVRIAIQEAA KQNLVAYSRA PMLDYLGELV GVHRLPAQAA KTTLQFSVTQ AAKSNLVIPQ GTRASASDSV MFATDEDVLL PAGSLSVAVT ATCVATGESG NNWQPAQISA LVDRVGNYDL SVTNLTASSG GCGEENDDAL RKRIQLAPES FSNAGSYGAY RFHTLSVSQS IIDVAVLGPD EGLAEGCVEL YPLTLNGLPG PELLAQIERE VSKEKKRPLT DKVSAKCSPR VAYQIRARLT LFTTADQETT LAAAREAINT WTRSRQTRLG QDIVPNQIIK VLQVDGVYDV ALDMPAKKVL QAHEWAECTA IDVTIAGVSD G
|
| |