Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2801 |
Symbol | |
ID | 6486507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2743218 |
End bp | 2746313 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642738124 |
Product | gifsy-1 prophage VmtH |
Protein accession | YP_002041858 |
Protein GI | 194443589 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 0.617877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA TAGCGAACCT GGTCATTGAT TTAAGTATCG ACAGCGCAGA GTTCCGAAAC GAAGTTCCGC GCATTAAAAA ATTGCTGAAC GATGCGGCTG GTGACTCAGA ACGTTCAGCG GCCCGGATGC AGCGTTTTCT GGATAAGCAG ACGGAGGCGA CGCGCCGGAC GTCCGCCAGT CTGGAGCAAG TGACTGCCAG CAGTACCGCG TACAGTTCCG CTGTGGAGAA AAGCGCAGCG GCCAGTACGC GTCTGGCGGC GGATGTGGAT CAGACGCGAC AGCGGGTGGA GGCACTGGGA AGGAAACTGC GTGAGGAACA GGCGCAGTCA GCGGCTGTGG CGGCAGCACA GGACAGGACA AGTGCTGCTT TTTACCGCCA GATTGACAGT GTAAAACAGT TAAGCGGTGG TCTGCAGGAG CTGCAGCGTA TCCAGGCGCA GGTACGACAG GCGAAAGGAC GCGGAGATAT CTCACAGGGC GATTATCTGG CGCTGGTGTC TGAAACCGCC AGGAAGACCC GTGAGCTTAC CGATGCCGAA GCGCTGGCCA CGCAGAAAAA AGCACAGTTT ATACGCCGCC TGAAAGAGCA GACGACGGTA CAGGGCCTCT CCCGTACCGA GCTGCTGCGG GTGAAGGCGG CTGAACTGGG TGTCAGTAGC GCCGCAGATA TTTATATCCG TAAACTGGAG CGTACCGGAA CTGCCACCCA TACGCTAGGA CTGAAAAGCG CTGCTGCCCG TCGTGAACTG GGCGTGCTGG CTGGTGAGCT GGCCCGTGGG AATTTCGGGG CACTGCGGGG AAGTGGTATC ACGCTCGCTA ACCGCGCCGG GTGGATCGAG CAACTGATGT CTCCGAAGGG CATGATGCTC GGCGGGCTGG CTGGCGGCGT GGCTGCTGCT GTTTACGGGC TGGGCAAGGC CTACTATGAA GGCGCTAAGG AAAGCGAGAC GTTCAATAAA CAGCTTATTC TGACCGGGAG TTATGCCGGA AAAACCACAG GCCAGCTTAA TGCGATGGCG AAGTCGCTCG CCGGAAATGG CGTCACGCAG CACGATGCTG CAGGCGTGCT GGCACAGGTG GTCGGTAGCG GAGCGTTTAC CGGGCAGGCA ATGGCAATGG TATCCCGTAC CGCGACCAGA ATGCAGGAAA ACGTGGGACA ATCAGTGGAT GAAACCATCC GCCAGTTTAA ACGCCTGCGG GATGATCCGG TGAATGCGGC GAAAGAACTG GACAGGACAC TGCATTTTCT GACAGCCACC CAGCTTGAAC AAATCAGGGT ACTGGGCGAG CAGGGAAGAG TGGCTGATGC CGCGAAAATT GCCATGTCCG CGTATTCGGA AGAAATGAAT AAGCGGATGG GGGACGTACA CGACAATCTG GGCTGGATTG AAAGAGCATG GAATGCTGTC GGTGATGCGG CGAAGTGGGC ATGGGATCGG ATGCTGGATA TCGGGCGGGA AGACACGCTC GATGAAAAGA TCGCGACACT GCAGGAAAAA ATCGCGCGCG ACAGAAAAAC GCCCTGGACG GTGTCTTCTT CCCAGACTGA ATACGATCAG CAGCAGCTGA ACGAACTTCA GGAACAGAAA CGCCAGAAGG ACCTGCTGGA TGCGAAGGCG CAGGCAGAGC GTAATTATCA GGAAACGCAG AAACGTCGGA ACGAGCAGAA CGCCGCGCTG AACCGGGATA ATGAAACTGA ATCCCTGCGG CACCAACGGG AGGTGGCGCG CATTACCGCC ATGCAGTATG CCGATGCTGC TGTACGCAAT GCCGCACTGG AGCGCGAAAA TGAACGTCAT AAAAAGGCGT TGTCACAACA GGCGAAAAAG CCAAAGACTT ACCACAACGA CGAGGCCAGG CGACTGCTTT TGCAGTACAG CCAGCAACAG GCGCAGACTG AAGGGCAGAT TACCGCCGCG AAGCTTTCCA CGACCGAAAA AATGACGGAA GCGCATAAGC AGCTTTTGTC ATTTCAGCAG CGCATCGCTG ATTTGTCCGG TAAAAAACTG ACGGCGGATG AACAAAGCGT ACTGGCACAT AAGGATGAAA TTGCGCTTGC GCTACAGAAG CTGGATATCT CACAACAGGA TTTGCAACAC CAGAATGCCC TTAATGAACT GAAGAAAAAG ACGCTCACAT TGACCAGCCA GCTCGCTGAC GAAGAATCCC GCGTCAGGCA ACAGCACGCA ATGGCGCTGG CCACAATGGG TATGGGCGAT CAGCAACGTG GCCGATACGA AGAGCGTCTG AAAATTCAGC AGCACTACCA GGAACAACTG GAGCAGCTTA AACGCGACAG CAAGGCAAAA GGGACATACG GTTCTGACGA ATATCGTCAG GCGGAGCAGG CGCTGAAGGG CAGTCTCGAT CGCCGGCTGG CTGAGTGGGC GGATTACAAT GCGAAAGTTG ACGCTGCGCA GGGAGACTGG ACGCTGGGGG CGTCGCGGGC GCTGGATAAC TTTCTGGCGC AGGGCGGCAA TGTGGCAGGC ATGACGGAGA ACGTTTTCAC AAACGCATTT AACGGCATGG CGGACAGTAT CGCGAATTTT GCCGTGACCG GAAAGGGCAG TTTCCGGAGC CTGACGGTCT CCATCCTGGC TGACCTTGCA AAAATGGAGG CACGTATTGC GGCTTCTAAA CTGTTGGGTT CAGTGCTGGC AATGTTCGGC TTTGGCACAT CGGCAGGCGG CAGTACACCA TCAGGGGCAT ACAGTTCTGC GGCGCTGTCG GTTATTCCGA ATGCGGACGG CGGCGTGTAC CGTTCGGCAG GACTCAGCCA GTACAGCGGC AGCATTGTTA ATCGCCCGAC ATTTTTTGCT TTTGCCAAAG GTGCCGGGGT GATGGGCGAG GCAGGACCGG AGGCAATATT ACCACTTCGT CGTGGTGCTG ACGGTAAGCT GGGTGTCGTG GCAGCCGGTT CAGGAGGGAT GGCGATGTTT GCGCCTGAGT ACAACATTGA AATCCACAAC GACGCCGGCA ACGGACAGAT TGGTCCGCAG GCATTACAGG CCGTATATAA CATTGGAAAA AAAGCCGCCA TTGATTTCTG GCAACAGCAG TCGCGTGACG GGGGTATTGC TGGAGGAGGG CGATAA
|
Protein sequence | MDQIANLVID LSIDSAEFRN EVPRIKKLLN DAAGDSERSA ARMQRFLDKQ TEATRRTSAS LEQVTASSTA YSSAVEKSAA ASTRLAADVD QTRQRVEALG RKLREEQAQS AAVAAAQDRT SAAFYRQIDS VKQLSGGLQE LQRIQAQVRQ AKGRGDISQG DYLALVSETA RKTRELTDAE ALATQKKAQF IRRLKEQTTV QGLSRTELLR VKAAELGVSS AADIYIRKLE RTGTATHTLG LKSAAARREL GVLAGELARG NFGALRGSGI TLANRAGWIE QLMSPKGMML GGLAGGVAAA VYGLGKAYYE GAKESETFNK QLILTGSYAG KTTGQLNAMA KSLAGNGVTQ HDAAGVLAQV VGSGAFTGQA MAMVSRTATR MQENVGQSVD ETIRQFKRLR DDPVNAAKEL DRTLHFLTAT QLEQIRVLGE QGRVADAAKI AMSAYSEEMN KRMGDVHDNL GWIERAWNAV GDAAKWAWDR MLDIGREDTL DEKIATLQEK IARDRKTPWT VSSSQTEYDQ QQLNELQEQK RQKDLLDAKA QAERNYQETQ KRRNEQNAAL NRDNETESLR HQREVARITA MQYADAAVRN AALERENERH KKALSQQAKK PKTYHNDEAR RLLLQYSQQQ AQTEGQITAA KLSTTEKMTE AHKQLLSFQQ RIADLSGKKL TADEQSVLAH KDEIALALQK LDISQQDLQH QNALNELKKK TLTLTSQLAD EESRVRQQHA MALATMGMGD QQRGRYEERL KIQQHYQEQL EQLKRDSKAK GTYGSDEYRQ AEQALKGSLD RRLAEWADYN AKVDAAQGDW TLGASRALDN FLAQGGNVAG MTENVFTNAF NGMADSIANF AVTGKGSFRS LTVSILADLA KMEARIAASK LLGSVLAMFG FGTSAGGSTP SGAYSSAALS VIPNADGGVY RSAGLSQYSG SIVNRPTFFA FAKGAGVMGE AGPEAILPLR RGADGKLGVV AAGSGGMAMF APEYNIEIHN DAGNGQIGPQ ALQAVYNIGK KAAIDFWQQQ SRDGGIAGGG R
|
| |