Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4379 |
Symbol | |
ID | 6483584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4251570 |
End bp | 4252478 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739621 |
Product | baseplate assembly protein J |
Protein accession | YP_002043315 |
Protein GI | 194443982 |
COG category | [R] General function prediction only |
COG ID | [COG3948] Phage-related baseplate assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.443477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00000851022 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAACTG TTGACCTGAG TCTGCTACCT GTTCCTGATG TGGTCGAGGA ACTGGACTAT GAAACTATCC TTGCGGAGCG CATTGCAACG CTGATTTCGC TCTATCCGGA AAACCAGCAG GAAGCCGTCG CCCGGACGCT CGCACTTGAG TCTGAGCCAA TTGTTAAATT GCTGCAGGAA AACGCCTACC GCGAGGTTAT CTGGCGTCAG CGTGTCAATG AAGCTGCACG CGCAGTGATG CTGGCTTATG CCATAGACAG TGACCTCGAT AATATCGGGG CGAATTTCAG TGTTGAGCGC CTTGTCGTCA CGCCTGCTGA TGACACCACC ATTCCACCCA CTCCGGCAGA AATGGAACTC GACGCAGATT ATCGTCTGCG TATACAGCAG GCTTTTGAGG GACTGAGCGT GGCGGGGTCT GTCGGATCGT ACCAGTATCA TGGCCGTAGT GCTGACGGGC GCGTCGGCGA TATTTCAGTT ATCAGCCCGT CGCCAGCCTG TGTGACGATT TCCGTGCTGT CTCGTGAAAA CAACGGCGTC GCATCTGAGG AACTGCTTGC AATTGTGCGC AATGCCCTGA ACGCAGAAGA TGTCAGGCCG GTAGCTGACC GGGTGACGGT ACAGTCAGCC GAAATCGTTA ACTACCAGAT TAACGCCACG CTTTATCTTT ATCCCGGCCC GGAAAGTGAA CCCATCAGGG CGGCGGCTGA GGCAAAGCTG AAAGCCTATA TCAGCGCGCA GCACCGCCTC GGGCGCGATA TCCGTAAATC AGCGATTTAT GCCGCCCTGC ATGTTGAGGG TGTTCAGCGG GTGGAGCTGG CGGCACCGGT CACGGATATT GTTCTCGATA ACACACAGGC GTCCTTTTGC ACTGACTACA GCCTTGTAAT CGGGGGCTCT GATGAATGA
|
Protein sequence | MATVDLSLLP VPDVVEELDY ETILAERIAT LISLYPENQQ EAVARTLALE SEPIVKLLQE NAYREVIWRQ RVNEAARAVM LAYAIDSDLD NIGANFSVER LVVTPADDTT IPPTPAEMEL DADYRLRIQQ AFEGLSVAGS VGSYQYHGRS ADGRVGDISV ISPSPACVTI SVLSRENNGV ASEELLAIVR NALNAEDVRP VADRVTVQSA EIVNYQINAT LYLYPGPESE PIRAAAEAKL KAYISAQHRL GRDIRKSAIY AALHVEGVQR VELAAPVTDI VLDNTQASFC TDYSLVIGGS DE
|
| |