Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2023 |
Symbol | |
ID | 6482172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1968482 |
End bp | 1969363 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642737382 |
Product | tail fibre assembly protein |
Protein accession | YP_002041132 |
Protein GI | 194443424 |
COG category | [R] General function prediction only |
COG ID | [COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.0278369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATATCG GTCCACACGG ACACGTCGTT ATTGTGGACG CAGACGGTAA TGCGGAAACC TTTGGTCTTA TGGACGGCGG TGTGGATGCT GCTATTACGG CATATTTCGG GTCGCAATTA CAGGAACGGG TACAGCAAAA TATCATCCGT GAATACCTGG GGGAACAGCC CGTCGGCACC GCCTTTGTTA TTGAAACGGG TAACAGTAAA CATCCGTGGC TGGTTCCCGC CCCGACGATG CGCGTTCCGC TGATTATTGA CGGCACCGAC GCGGTTTATA ATGCAACACG GGCTGCGTTA CTGGCAATTT TTCAGCACAA TAAAAGCGCC GGAGAAGACC GGAAAATTAC ATCTGTTGCA TTACCTGCAA TGGGGGCCGG ATGTGGTCAG GTCCCCCCCG GACAGCGTCG CCCGGCAAAT TGTACTGATA TAGCCCCTCC TGATATTCCC TCCAGTCATA TTGCTGTTTT TGACGCTGAA ACCCAAACGT GGAGTCTGCA GGAGGATCAC CGCGGCGAGA CGGTTTACGA CACAACAGCC GGCAATCAGG TTTATATCTC CGATCTTGGT CCGCTACCTG AAAACGTCAC ATCAGTTTCA CCAGGTGGTG GATACAAAAA ATGGGATAGT AAGGCTCAGG TCTGGATGAA TGATGAAGCT GCGGAGGCCG CAGCCAGACT TCGTGAAGCT GAAGGAACGA AAAACAGACT CCTGCAAATA ACGTCTGAAA AAATCGCGCC GTTACAGGAT GCAGTGGATC TGGACGAAGC AACCAATAAA GAAAAAGCTT CTCTTCTGGC ATGGAGAAAG TACCGGGTAC AGGTAAACCG TGTTGATACT TTAAAGCCTG TCTGGCCGGA GAAACCAGCC AGTAGTTTAT AA
|
Protein sequence | MYIGPHGHVV IVDADGNAET FGLMDGGVDA AITAYFGSQL QERVQQNIIR EYLGEQPVGT AFVIETGNSK HPWLVPAPTM RVPLIIDGTD AVYNATRAAL LAIFQHNKSA GEDRKITSVA LPAMGAGCGQ VPPGQRRPAN CTDIAPPDIP SSHIAVFDAE TQTWSLQEDH RGETVYDTTA GNQVYISDLG PLPENVTSVS PGGGYKKWDS KAQVWMNDEA AEAAARLREA EGTKNRLLQI TSEKIAPLQD AVDLDEATNK EKASLLAWRK YRVQVNRVDT LKPVWPEKPA SSL
|
| |