Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4387 |
Symbol | |
ID | 6486784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4257722 |
End bp | 4260163 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739627 |
Product | gp24 |
Protein accession | YP_002043321 |
Protein GI | 194445727 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.0182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATA ACCTGAGGCT TGAGGTATTG CTGAAAGCGG TCGACCAGGC GACCCGACCG CTTAAATCCA TCCAGACCGC GAGTAAAACT CTGTCGGGTG ATATTCGCAA CACACAAAAG GGGCTGCGCG ACCTGAACGG TCAGGCGTCG AAAATCGACG GCTTTCGTAA GGCAAGCGCG CAACTGGCCG TAACCGGTCA GGCGCTTGAT AAGGCGAAGC GCGAAGCCGG TGAGCTGGCT GCGCAGTTTA AAAACACTAC CAGTCCGACC CGCGCGCAGG CGCAGGCGCT CGAAGCGGCA AAACGTGCCG CCTCTGAGCT GCAGATGAAA TATAACAGCC TGAGAACATC GGTACAGCGC CAGCGCTCCG AGCTGATGCA GGCCGGTATT AACATCCGCA CCCTGTCTGC CGATGAGCGT CGGCTCAAAA CCTCCATCAG CGAAACGACG GTGCAGCTTA ACCGCCAGCG TGAGGCACTG GCGCGTGTCA GTGCGCAGCA GGCGAAATTA AGCCGGGTGA AAGAACGATA CAAATCAGGT AAAGAGCTTG CCGGTAACAT GGCTGCAGCA GGTGCTGCCG GGGTCGGTAT TGCGACGGCG GGAACGATGG CCGGGGTTAA ATTACTGATG CCCGGTTATG ACTTTGCACA GAAAAACTCC GAGCTGCAGG CTGTGCTCGG GGTTGATAAA CAGTCGCCAG AAATGCAGGC GTTACGTAAA CAGGCGCGCC AGCTCGGTGA CAATACCGCC GCTTCTGCTG ATGATGCAGC CAGTGCGCAG ATTATCATTG CAAAAGGTGG TGGTGATGCG GCAGCTATAG CGGCTATGAC GCCTGTGACT CTCAACCTGT CACTTGCGAA CAGAAAAACA ATGGAGGAAA ACGCGCAACT GTTGATGGGG ACAAAAGCCG CCTTTCAGCT TTCTAATGAC GCGGCTGCAC ATATTGGTGA TGTTCTTTCA ACCACGATGA ACAAAACCAC CGCTGATTTT CAGGGACTAA GTGACTCATT AAGTTACCTT GCCCCTGTTG CGAAAAATGC CGGAGTGAGT CTTGAACAAG CGGCGGCGAT TACCGGCACA CTTCATGATA ATAACATCAG GGGGTCAATG GCTGGGACGG GCGGCGCGGC TGTAATAACG AGACTACAGG CACCAACAGG CAAAGCATAC GATGCCCTCA AAGAGTTGGG TGTTAAAACC TCGGACAGCA AAGGCAATAC GCGCCCGTTA TTTACCATCC TGAAAGAAAT GCAGGCCAGT TTTGAGCGCA ACAAGCTCGG AACTGGTCAG AAAGCTGAAT ATGTGAAAAC CATATTCGGC GAGGAGGCCA TGAAGTCTGC AAGTGTGCTG ATGGCCGCAG CGACAAGCGG AAAGCTCGAT AAACTCACCG CTACGATAAA GGCATCCGAC GGAAAAACCG AGGAACTGGT CAAGGTTATG CAGGACAACC TCGGCGGCGA CTTCAAAGAG TTCCAGTCTG CTTATGAGGC TGTCGGTACC GACCTATACG ACCAGCAAGA TAGCTCACTG CGCAAGCTCA CCCAAACCGC CACGCAGTAT GTGTTAAAAC TCGACAACTG GATCAAGGAT AACAAGGAAT TAGCGGAAAC TATCGGCATC ATCGCCGGTG GCGCACTTGC TCTGATTGGT ATTATCGGCG GCATTGGTCT CGTTGCGTGG CCGGTTGTCA TGGGGATTAA CGCCATTATT GCCGCTGCTG GCGTGCTGGG TACGGTCTTT ACTGTTGCTG GTAGTGCCAT TGTGACAGCG CTCGGTGCGA TTACCTGGCC GATTGTGGCT GTTGGTGCGG CGATTGTGGC CGGGGCGCTA CTTATTCGTA AATACTGGGA GCCCATCAGC GCATTTTTTA CGGGGGTGAT TGAGGGCATC ATGAGCGCCT TTGCGCCGGT CGGGGAAATG TTCGCACCAC TGGCTCCCAT CTTTGACGGA CTCGGTGAGA AGCTGCGCGG CGTCTGGCAG TGGTTTAAAG ACCTGATTGC ACCGGTCAAG GCCACGCAGG AGACGCTTGA TAGCTGCAAA AATGTCGGCG TCATATTTGG TCAGGCACTG GCCGATGCGC TGATGTTGCC TCTGAATATT TTCAATAAGC TGCGTGGTGG TCTCGATGTA ATTCTCGAAA AGCTCGGCCT TGTTAAAAAG GAATCGAGCA GTATTGATAC GGAAACGGCA AAAACGCCGC TGGTTGGTCA GGGTGGAGGG TATATTCCGA CAACCAGCTC GCTTGGTGGG TATCAGGCTT ATCAGCCTGT CACGGCCCCC GCCGGTCGTA CCTATATTGA CCAGAGCAGC CCGACCTATC AAATCAACCT GCCGGGTGGC GGCGCGCCGG GTGGTCAATT GGGTAACCAG TTGCAGGATG CGTTAGAAAA ATATGAACGC GACAAGCGAG CCAAAGCCCG CGCTAGCATG ATGCACGATT AA
|
Protein sequence | MSNNLRLEVL LKAVDQATRP LKSIQTASKT LSGDIRNTQK GLRDLNGQAS KIDGFRKASA QLAVTGQALD KAKREAGELA AQFKNTTSPT RAQAQALEAA KRAASELQMK YNSLRTSVQR QRSELMQAGI NIRTLSADER RLKTSISETT VQLNRQREAL ARVSAQQAKL SRVKERYKSG KELAGNMAAA GAAGVGIATA GTMAGVKLLM PGYDFAQKNS ELQAVLGVDK QSPEMQALRK QARQLGDNTA ASADDAASAQ IIIAKGGGDA AAIAAMTPVT LNLSLANRKT MEENAQLLMG TKAAFQLSND AAAHIGDVLS TTMNKTTADF QGLSDSLSYL APVAKNAGVS LEQAAAITGT LHDNNIRGSM AGTGGAAVIT RLQAPTGKAY DALKELGVKT SDSKGNTRPL FTILKEMQAS FERNKLGTGQ KAEYVKTIFG EEAMKSASVL MAAATSGKLD KLTATIKASD GKTEELVKVM QDNLGGDFKE FQSAYEAVGT DLYDQQDSSL RKLTQTATQY VLKLDNWIKD NKELAETIGI IAGGALALIG IIGGIGLVAW PVVMGINAII AAAGVLGTVF TVAGSAIVTA LGAITWPIVA VGAAIVAGAL LIRKYWEPIS AFFTGVIEGI MSAFAPVGEM FAPLAPIFDG LGEKLRGVWQ WFKDLIAPVK ATQETLDSCK NVGVIFGQAL ADALMLPLNI FNKLRGGLDV ILEKLGLVKK ESSSIDTETA KTPLVGQGGG YIPTTSSLGG YQAYQPVTAP AGRTYIDQSS PTYQINLPGG GAPGGQLGNQ LQDALEKYER DKRAKARASM MHD
|
| |