Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4383 |
Symbol | |
ID | 6484064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4255458 |
End bp | 4256645 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739625 |
Product | phage tail sheath protein |
Protein accession | YP_002043319 |
Protein GI | 194444297 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.301789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.000355682 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGATT TTCACCACGG CACGCAGGTC ATCGAAATTA ATGACGGTAC GCGTGTTATT TCCACAGTAG CGACTGCGGT CGTCGGCATG GTTTGTACAG CCAGCGATGC AGATGCCACG CTATTTCCCC TCAATGAACC GGTACTGATT ACCAATGTGC AAAGCGCCAT TGCGAAAGCC GGTAAAAAAG GCACGCTGGC TGCATCACTG CAGGCCATTG CAGACCAGTC AAAACCCGTC ACTGTTGTTG TACGTGTTGA GGATGGAACC GGCGATGACG AGGAAGCTGC GCTCGCACAG ACTGTTTCCA ACATTATCGG AGGTACGGAT GAGAACGGTA AATACACCGG TATCAAGGCT CTCCTGACCG CTCAGGCCGT CACCGGCGTC AAGCCGCGTA TTCTTGGGGT GCCGGGGCTG GATACTAAAG AGGTCGCGGT CGCGCTTGCG TCGGCTGCCA TTAAGTTACG TGCATTTGCT TACGTCAGCG CGTGGGGATG TAAGACTATT TCCGAAGCGA TGGAATATCG TAAAAATTTC AGCCAGCGCG AGTTGATGGT TATCTGGCCT GATTTCCTCG CATGGGACAC CGTCAAAAAT ACCACCGCAA CGGCTTACGC CACTGCGCGT GCACTCGGCC TGCGTGCTTA CATCGACCAG ACTGTCGGCT GGCACAAAAC CCTGTCTAAC GTTGGTGTAC AGGGAGTTAC CGGCATCAGC GCCTCAGTGT TCTGGGATTT GCAGGCATCC GGCACCGATG CTGACCTGCT CAACGAGGCC GGGGTTACAA CGCTGGTACG CAAGGACGGT TTCCGTTTCT GGGGTAACCG CACCTGCTCA GATGACCCGC TTTTTCTGTT TGAGAACTAC ACCCGCACCG CGCAGGTACT GGCCGACACG ATGGCTGAGG CGCACATGTG GGCGGTCGAT AAGCCCATTA CCGCTACGCT CATTCGTGAC ATTGTTGACG GAATCAATGC CAAATTCCGC GAGCTGAAAT CAAACGGCTA CATCGTGGAG GGTAAATGCT GGTTCGATGA GGAATCGAAC GACAAGGAAA CCCTCAAGGC CGGGAAACTG TATATCGACT ACGACTATAC ACCGGTTCCG CCACTGGAAA GCCTGACCCT GCGCCAGCGT ATCACCGATA AATATCTGGT GAATCTGGCC GAATCGGTCA ACAGCTAA
|
Protein sequence | MSDFHHGTQV IEINDGTRVI STVATAVVGM VCTASDADAT LFPLNEPVLI TNVQSAIAKA GKKGTLAASL QAIADQSKPV TVVVRVEDGT GDDEEAALAQ TVSNIIGGTD ENGKYTGIKA LLTAQAVTGV KPRILGVPGL DTKEVAVALA SAAIKLRAFA YVSAWGCKTI SEAMEYRKNF SQRELMVIWP DFLAWDTVKN TTATAYATAR ALGLRAYIDQ TVGWHKTLSN VGVQGVTGIS ASVFWDLQAS GTDADLLNEA GVTTLVRKDG FRFWGNRTCS DDPLFLFENY TRTAQVLADT MAEAHMWAVD KPITATLIRD IVDGINAKFR ELKSNGYIVE GKCWFDEESN DKETLKAGKL YIDYDYTPVP PLESLTLRQR ITDKYLVNLA ESVNS
|
| |