Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4554 |
Symbol | |
ID | 6486621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4425131 |
End bp | 4426558 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739780 |
Product | phage tail sheath protein |
Protein accession | YP_002043462 |
Protein GI | 194442415 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00163736 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.433622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCTA ATTACCTGCA CGGTGTAGAG ACCATTGAGA TCGAAACCGG CCCACGTCCG GTTAAGGCGG TTAAATCTGC GGTTATTGGT CTGATCGGCA CCGCGCCATG CGGCCCGGTT AACCAGCCAA CGCTGTGCCT TTCTGAAAGC GACGCGGCAC AGTTTGGCCC AGGTCTGGCA AATTTCACCA TCCCGCAGGC GCTGAAGGCG ATCTACGATC ACGGCGCAGG GACGGTCGTG GTGATTAACG TGCTGAATCC GGCGGTACAC AAAAGTACCA TTCCCAGTGA AACCGTGAAG GTTGATGACA ACGGTCAGAT TCAACTCAAG CACGGGGCCG TGCAAACGAT GAACATTGGC CGCAGCACGA ACGCCGGAAA CGCTTATATC AAAGGCACCG ATTACACCAT TGATATGCTG ACCGGTAAAA TCACCTGCAT GGGGACCAAC CTGAAACCCG GCGTTCAGGC CTATGTAAAT TATACTTACG CGGACCCCAC CAAAGTGACT GCCGCCGATA TTGTTGGCGC GGTAAACACG GCGGGTGACC GTACCGGCAT GAAGCTGTTA CAGGACACCT GGAACCAGTT TGGTTTTTAC GCAAAGATCC TGATTGCGCC GGTCTTTTGT ACGCAAAACT CGGTCGCCGT TAAGCTTATC GCTCAGGCAG AAGCGCTGGG AGCCATTACC TACATTGATG CGCCCATCGG CACGACTTTC CAGCAGGTTT TGGCAGGGCG CGGCCCGCAG GGGGCGATTA ACTTCAATAC CAGTTCCGAT CGCGCGCGTC TGTGCTATCC GCACGTTAAA GTGTACGACA GCGCGACGGA CAGCGAAGTC CTGGAGCCGC TCTCCTCCCG CGCCGCTGGC CTGCGTGCCA AAGTGGATCT GGAAAAAGGC TTCTGGTGGA GCAACTCCAA TCAGGAAATT CAGGGCATTA CCGGCGTAGA GCGCTCGCTG TCAGCGATGA TCGACGATCC GCAAAGCGAA GTGAATCAAC TGAATGAAAA CGGCATCACC ACCATCTTCA ACAGCTATGG CTCCGGTTTG CGCCTGTGGG GCAACCGTAC CGCCGCCTGG CCGACGGTTA CTCATATGCG TAACTTTGAG AACGTGCGCC GTACCGGCGA TGTAATCAAC GAATCGATTC GTTATTTCAG CCAGCAGTAT ATGGATATGC CGATAAACCA GGCGCTGATC GACGCGCTGA CCGAATCGGT GAACACCTGG GGCCGCAAGC TGATTGCCGA CGGCGCGTTG TTGGGCTTTG AATGCTGGTA CGACCCGGCG CGTAACGAAC AGACTGAACT GGCAGCCGGG CATCTGTTGC TGAGCTACAA ATTCACCCCG CCGCCGCCAC TGGAACGTCT GACGTTTGAA ACCGAAATTA CCTCTGAATA TTTAGTTTCT CTGGAGAGCA ATCGCTAA
|
Protein sequence | MAANYLHGVE TIEIETGPRP VKAVKSAVIG LIGTAPCGPV NQPTLCLSES DAAQFGPGLA NFTIPQALKA IYDHGAGTVV VINVLNPAVH KSTIPSETVK VDDNGQIQLK HGAVQTMNIG RSTNAGNAYI KGTDYTIDML TGKITCMGTN LKPGVQAYVN YTYADPTKVT AADIVGAVNT AGDRTGMKLL QDTWNQFGFY AKILIAPVFC TQNSVAVKLI AQAEALGAIT YIDAPIGTTF QQVLAGRGPQ GAINFNTSSD RARLCYPHVK VYDSATDSEV LEPLSSRAAG LRAKVDLEKG FWWSNSNQEI QGITGVERSL SAMIDDPQSE VNQLNENGIT TIFNSYGSGL RLWGNRTAAW PTVTHMRNFE NVRRTGDVIN ESIRYFSQQY MDMPINQALI DALTESVNTW GRKLIADGAL LGFECWYDPA RNEQTELAAG HLLLSYKFTP PPPLERLTFE TEITSEYLVS LESNR
|
| |