Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A0703 |
Symbol | |
ID | 6518213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 676750 |
End bp | 679845 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642745844 |
Product | gifsy-1 prophage VmtH |
Protein accession | YP_002113667 |
Protein GI | 194735514 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00431467 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACCAGA TAGCGAACCT GGTCATTGAT TTAAGTATCG ACAGCGCAGA GTTCCGAAAC GAAGTTCCGC GCATTAAAAA ATTGCTGAAC GATGCGGCTG GTGACTCAGA ACGTTCAGCG GCCCGGATGC AGCGTTTTCT GGATAAGCAG ACGGAGGCGA CGCGCCGGAC GTCCGCCGGT CTGGAGCAGG TGACTGCCAG CAGTACCGCG TACAGTTCCG CTGTGGAGAA AAGCGCAGCG GCCAGTACGC GTCTGGCGGC GGATGTGGAT CAGACGCGAC AGCGGGTGGA GGCACTGGGA AGGAAACTGC GTGAGGAACA GGCGCAGTCA GCGGCTGTGG CGGCAGCACA GGACAGGACA AGTGCTGCTT TTTACCGCCA GATTGACAGT GTAAAACAGT TAAGCGGTGG TCTGCAGGAG CTGCAGCGTA TCCAGGCGCA GGTACGACAG GCGAAAGGAC GCGGAGATAT CTCACAGGGC GATTATCTGG CGCTGGTGTC TGAAACCGCC AGGAAGACCC GTGAGCTTAC CGATGCCGAA GCGCTGGCCA CGCAGAAAAA AGCACAGTTT ATACGCCGCC TGAAAGAGCA GACGACGGTA CAGGGCCTCT CCCGTACTGA GCTGCTGCGG GTGAAGGCGG CTGAACTGGG TGTCAGCAGC GCCGCAGATA TTTATATCCG TAAACTGGAG CGTACCGGAA CTGCCACCCA TACGCTAGGA CTGAAAAGCG CCGCTGCCCG TCGTGAACTG GGCGTGCTGG CTGGTGAGCT GGCCCGTGGG AATTTCGGGG CACTGCGGGG AAGTGGTATC ACGCTCGCTA ACCGCGCCGG GTGGATCGAG CAACTGATGT CTCCGAAGGG CATGATGCTC GGCGGGCTGG CTGGCGGCGT GGCTGCTGCT GTTTACGGGC TGGGTAAGGC CTACTATGAA GGAGCTAAAG AAAGCGAGGC GTTCAATAAA CAGCTTATTC TGACCGGGAG TTATGCCGGA AAAACCACAG GCCAGCTTAA TGCGATGGCG AAGTCGCTCG CCGGAAATGG CGTCACGCAG CACGACGCTG CAGGCGTGCT GGCACAGGTG GTCGGTAGCG GAGCGTTTAC CGGGCAGGCA GTGGCAATGG TATCCCGTAC CGCGACCAGA ATGCAGGAAA ACGTTGGACA ATCAGTGGAT GAAACCATCC GCCAGTTTAA ACGCCTGCGG GATGATCCGG TGAATGCGGC GAAAGAACTG GACAGGACAC TGCATTTTCT GACCGCCACC CAGCTTGAAC AAATCAGGGT ACTGGGCGAG CAGGGAAGAG TGGCTGATGC CGCGAAAATT GCCATGTCCG CGTATTCGGA AGAAATGAAT AAGCGGATGG GGGACGTACA CGACAATCTG GGCTGGATTG AAAGAGCATG GAATGCTGTC GGTGATGCGG CGAAGTGGGC ATGGGATCGG ATGCTGGATA TCGGGCGGGA AGACACGCTC GATGAAAAGA TCGCGACACT GCAGGAAAAA ATCGCGCGCG GCAGAAAAAC GCCCTGGACG GTGTCTTCCT CCCAGACTGA ATACGATCAG CAGCAGCTGA ACGAACTTCA GGAACAGAAA CGCCAGAAGG ACCTGCTGGA TGCGAAGGCG CAGGCAGAGC GTAATTATCA GGAAACGCAG AAACGTCGGA ACGAGCAGAA CGCCGCGCTG AACCGGGATA ATGAAACTGA ATCCCTGCGG CACCAACGGG AGGTGGCGCG CATTACCGCC ATGCAGTATG CCGATGCTGC TGTACGCAAT GCCGCACTGG AGCGTGAAAA TGAACGTCAT AAAAAGGCGT TGTCACAACA GGCGAAAAAG CCAAAGACTT ACCACAACGA CGAGGCCAGG CGACTGCTTT TGCAGTACAG CCAGCAACAG GCGCAGACTG AAGGGCAGCT TGCCGCCGCG AAGCTTTCCA CGACCGAAAA AATGACGGAA GCGCATAAGC AGCTTTTGTC ATTTCAGCAG CGCATCGCTG ATTTGTCCGG TAAAAAACTG ACGGCGGATG AACAAAGCGT ACTGGCACAT AAGGATGAAA TCGCGCTTGC GCTACAGAAG CTGGATATCT CACAACAGGA TTTGCAACAC CAGAATGCCC TTAATGAACT GAAGAAAAAG ACGCTCACAT TGACCAGCCA GCTCGCTGAC GAAGAATCCC GCGTCAGGCA ACAGCACGCA ATGGCGCTGG CCACAATGGG TATGGGCGAT CAGCAACGTG GCCGATACGA AGAGCGTCTG AAAATTCAGC AGCACTACCA GGAACAACTG GAGCAGCTTA AACGCGACAG CAAGGCAAAA GGGACATACG GTTCTGACGA ATATCGTCAG GCGGAGCAGG CGCTGAAGGG CAGTCTCGAT CGCCGGCTGG CTGAGTGGGC GGATTACAAT GCGAAAGTTG ACGCTGCGCA GGGAGACTGG ACGCTGGGGG CGTCGCGGGC GCTGGATAAC TTTCTGGCGC AGGGCGGCAA TGTGGCAGGC ATGACGGAGA ACGTTTTCAC AAACGCATTT AACGGCATGG CGGACAGTAT CGCGAATTTT GCCGTGACCG GAAAGGGCAG TTTCCGGAGC CTGACGGTCT CCATCCTGGC TGACCTTGCA AAAATGGAGG CACGTATTGC GGCTTCTAAA CTGTTGGGTT CAGTGCTGGC AATGTTCGGC TTTGGCACAT CGGCAGGCGG CAGTACACCA TCAGGGGCAT ACAGTTCTGC GGCGCTGTCG GTTATTCCGA ATGCGGACGG CGGCGTGTAC CGTTCGGCAG GACTCAGCCA GTACAGCGGC AGCATTGTTA ATCGCCCGAC ATTTTTTGCT TTTGCCAAAG GTGCCGGGGT GATGGGCGAG GCAGGACCGG AGGCAATATT ACCACTTCGT CGTGGTGCTG ACGGTAAGCT GGGTGTCGTG GCAGCCGGTT CAGGAGGGAT GGCGATGTTT GCGCCTGAGT ACAACATTGA AATCCACAAC GACGCCGGCA ACGGACAGAT TGGTCCGCAG GCATTACAGG CCGTATATAA CATTGGAAAA AAAGCCGCCA TTGATTTCTG GCAACAGCAG TCGCGTGACG GGGGTATTGC CGGAGGAGGG CGATAA
|
Protein sequence | MDQIANLVID LSIDSAEFRN EVPRIKKLLN DAAGDSERSA ARMQRFLDKQ TEATRRTSAG LEQVTASSTA YSSAVEKSAA ASTRLAADVD QTRQRVEALG RKLREEQAQS AAVAAAQDRT SAAFYRQIDS VKQLSGGLQE LQRIQAQVRQ AKGRGDISQG DYLALVSETA RKTRELTDAE ALATQKKAQF IRRLKEQTTV QGLSRTELLR VKAAELGVSS AADIYIRKLE RTGTATHTLG LKSAAARREL GVLAGELARG NFGALRGSGI TLANRAGWIE QLMSPKGMML GGLAGGVAAA VYGLGKAYYE GAKESEAFNK QLILTGSYAG KTTGQLNAMA KSLAGNGVTQ HDAAGVLAQV VGSGAFTGQA VAMVSRTATR MQENVGQSVD ETIRQFKRLR DDPVNAAKEL DRTLHFLTAT QLEQIRVLGE QGRVADAAKI AMSAYSEEMN KRMGDVHDNL GWIERAWNAV GDAAKWAWDR MLDIGREDTL DEKIATLQEK IARGRKTPWT VSSSQTEYDQ QQLNELQEQK RQKDLLDAKA QAERNYQETQ KRRNEQNAAL NRDNETESLR HQREVARITA MQYADAAVRN AALERENERH KKALSQQAKK PKTYHNDEAR RLLLQYSQQQ AQTEGQLAAA KLSTTEKMTE AHKQLLSFQQ RIADLSGKKL TADEQSVLAH KDEIALALQK LDISQQDLQH QNALNELKKK TLTLTSQLAD EESRVRQQHA MALATMGMGD QQRGRYEERL KIQQHYQEQL EQLKRDSKAK GTYGSDEYRQ AEQALKGSLD RRLAEWADYN AKVDAAQGDW TLGASRALDN FLAQGGNVAG MTENVFTNAF NGMADSIANF AVTGKGSFRS LTVSILADLA KMEARIAASK LLGSVLAMFG FGTSAGGSTP SGAYSSAALS VIPNADGGVY RSAGLSQYSG SIVNRPTFFA FAKGAGVMGE AGPEAILPLR RGADGKLGVV AAGSGGMAMF APEYNIEIHN DAGNGQIGPQ ALQAVYNIGK KAAIDFWQQQ SRDGGIAGGG R
|
| |