Gene SeSA_A0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0703 
Symbol 
ID6518213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp676750 
End bp679845 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content57% 
IMG OID642745844 
Productgifsy-1 prophage VmtH 
Protein accessionYP_002113667 
Protein GI194735514 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00431467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACCAGA TAGCGAACCT GGTCATTGAT TTAAGTATCG ACAGCGCAGA GTTCCGAAAC 
GAAGTTCCGC GCATTAAAAA ATTGCTGAAC GATGCGGCTG GTGACTCAGA ACGTTCAGCG
GCCCGGATGC AGCGTTTTCT GGATAAGCAG ACGGAGGCGA CGCGCCGGAC GTCCGCCGGT
CTGGAGCAGG TGACTGCCAG CAGTACCGCG TACAGTTCCG CTGTGGAGAA AAGCGCAGCG
GCCAGTACGC GTCTGGCGGC GGATGTGGAT CAGACGCGAC AGCGGGTGGA GGCACTGGGA
AGGAAACTGC GTGAGGAACA GGCGCAGTCA GCGGCTGTGG CGGCAGCACA GGACAGGACA
AGTGCTGCTT TTTACCGCCA GATTGACAGT GTAAAACAGT TAAGCGGTGG TCTGCAGGAG
CTGCAGCGTA TCCAGGCGCA GGTACGACAG GCGAAAGGAC GCGGAGATAT CTCACAGGGC
GATTATCTGG CGCTGGTGTC TGAAACCGCC AGGAAGACCC GTGAGCTTAC CGATGCCGAA
GCGCTGGCCA CGCAGAAAAA AGCACAGTTT ATACGCCGCC TGAAAGAGCA GACGACGGTA
CAGGGCCTCT CCCGTACTGA GCTGCTGCGG GTGAAGGCGG CTGAACTGGG TGTCAGCAGC
GCCGCAGATA TTTATATCCG TAAACTGGAG CGTACCGGAA CTGCCACCCA TACGCTAGGA
CTGAAAAGCG CCGCTGCCCG TCGTGAACTG GGCGTGCTGG CTGGTGAGCT GGCCCGTGGG
AATTTCGGGG CACTGCGGGG AAGTGGTATC ACGCTCGCTA ACCGCGCCGG GTGGATCGAG
CAACTGATGT CTCCGAAGGG CATGATGCTC GGCGGGCTGG CTGGCGGCGT GGCTGCTGCT
GTTTACGGGC TGGGTAAGGC CTACTATGAA GGAGCTAAAG AAAGCGAGGC GTTCAATAAA
CAGCTTATTC TGACCGGGAG TTATGCCGGA AAAACCACAG GCCAGCTTAA TGCGATGGCG
AAGTCGCTCG CCGGAAATGG CGTCACGCAG CACGACGCTG CAGGCGTGCT GGCACAGGTG
GTCGGTAGCG GAGCGTTTAC CGGGCAGGCA GTGGCAATGG TATCCCGTAC CGCGACCAGA
ATGCAGGAAA ACGTTGGACA ATCAGTGGAT GAAACCATCC GCCAGTTTAA ACGCCTGCGG
GATGATCCGG TGAATGCGGC GAAAGAACTG GACAGGACAC TGCATTTTCT GACCGCCACC
CAGCTTGAAC AAATCAGGGT ACTGGGCGAG CAGGGAAGAG TGGCTGATGC CGCGAAAATT
GCCATGTCCG CGTATTCGGA AGAAATGAAT AAGCGGATGG GGGACGTACA CGACAATCTG
GGCTGGATTG AAAGAGCATG GAATGCTGTC GGTGATGCGG CGAAGTGGGC ATGGGATCGG
ATGCTGGATA TCGGGCGGGA AGACACGCTC GATGAAAAGA TCGCGACACT GCAGGAAAAA
ATCGCGCGCG GCAGAAAAAC GCCCTGGACG GTGTCTTCCT CCCAGACTGA ATACGATCAG
CAGCAGCTGA ACGAACTTCA GGAACAGAAA CGCCAGAAGG ACCTGCTGGA TGCGAAGGCG
CAGGCAGAGC GTAATTATCA GGAAACGCAG AAACGTCGGA ACGAGCAGAA CGCCGCGCTG
AACCGGGATA ATGAAACTGA ATCCCTGCGG CACCAACGGG AGGTGGCGCG CATTACCGCC
ATGCAGTATG CCGATGCTGC TGTACGCAAT GCCGCACTGG AGCGTGAAAA TGAACGTCAT
AAAAAGGCGT TGTCACAACA GGCGAAAAAG CCAAAGACTT ACCACAACGA CGAGGCCAGG
CGACTGCTTT TGCAGTACAG CCAGCAACAG GCGCAGACTG AAGGGCAGCT TGCCGCCGCG
AAGCTTTCCA CGACCGAAAA AATGACGGAA GCGCATAAGC AGCTTTTGTC ATTTCAGCAG
CGCATCGCTG ATTTGTCCGG TAAAAAACTG ACGGCGGATG AACAAAGCGT ACTGGCACAT
AAGGATGAAA TCGCGCTTGC GCTACAGAAG CTGGATATCT CACAACAGGA TTTGCAACAC
CAGAATGCCC TTAATGAACT GAAGAAAAAG ACGCTCACAT TGACCAGCCA GCTCGCTGAC
GAAGAATCCC GCGTCAGGCA ACAGCACGCA ATGGCGCTGG CCACAATGGG TATGGGCGAT
CAGCAACGTG GCCGATACGA AGAGCGTCTG AAAATTCAGC AGCACTACCA GGAACAACTG
GAGCAGCTTA AACGCGACAG CAAGGCAAAA GGGACATACG GTTCTGACGA ATATCGTCAG
GCGGAGCAGG CGCTGAAGGG CAGTCTCGAT CGCCGGCTGG CTGAGTGGGC GGATTACAAT
GCGAAAGTTG ACGCTGCGCA GGGAGACTGG ACGCTGGGGG CGTCGCGGGC GCTGGATAAC
TTTCTGGCGC AGGGCGGCAA TGTGGCAGGC ATGACGGAGA ACGTTTTCAC AAACGCATTT
AACGGCATGG CGGACAGTAT CGCGAATTTT GCCGTGACCG GAAAGGGCAG TTTCCGGAGC
CTGACGGTCT CCATCCTGGC TGACCTTGCA AAAATGGAGG CACGTATTGC GGCTTCTAAA
CTGTTGGGTT CAGTGCTGGC AATGTTCGGC TTTGGCACAT CGGCAGGCGG CAGTACACCA
TCAGGGGCAT ACAGTTCTGC GGCGCTGTCG GTTATTCCGA ATGCGGACGG CGGCGTGTAC
CGTTCGGCAG GACTCAGCCA GTACAGCGGC AGCATTGTTA ATCGCCCGAC ATTTTTTGCT
TTTGCCAAAG GTGCCGGGGT GATGGGCGAG GCAGGACCGG AGGCAATATT ACCACTTCGT
CGTGGTGCTG ACGGTAAGCT GGGTGTCGTG GCAGCCGGTT CAGGAGGGAT GGCGATGTTT
GCGCCTGAGT ACAACATTGA AATCCACAAC GACGCCGGCA ACGGACAGAT TGGTCCGCAG
GCATTACAGG CCGTATATAA CATTGGAAAA AAAGCCGCCA TTGATTTCTG GCAACAGCAG
TCGCGTGACG GGGGTATTGC CGGAGGAGGG CGATAA
 
Protein sequence
MDQIANLVID LSIDSAEFRN EVPRIKKLLN DAAGDSERSA ARMQRFLDKQ TEATRRTSAG 
LEQVTASSTA YSSAVEKSAA ASTRLAADVD QTRQRVEALG RKLREEQAQS AAVAAAQDRT
SAAFYRQIDS VKQLSGGLQE LQRIQAQVRQ AKGRGDISQG DYLALVSETA RKTRELTDAE
ALATQKKAQF IRRLKEQTTV QGLSRTELLR VKAAELGVSS AADIYIRKLE RTGTATHTLG
LKSAAARREL GVLAGELARG NFGALRGSGI TLANRAGWIE QLMSPKGMML GGLAGGVAAA
VYGLGKAYYE GAKESEAFNK QLILTGSYAG KTTGQLNAMA KSLAGNGVTQ HDAAGVLAQV
VGSGAFTGQA VAMVSRTATR MQENVGQSVD ETIRQFKRLR DDPVNAAKEL DRTLHFLTAT
QLEQIRVLGE QGRVADAAKI AMSAYSEEMN KRMGDVHDNL GWIERAWNAV GDAAKWAWDR
MLDIGREDTL DEKIATLQEK IARGRKTPWT VSSSQTEYDQ QQLNELQEQK RQKDLLDAKA
QAERNYQETQ KRRNEQNAAL NRDNETESLR HQREVARITA MQYADAAVRN AALERENERH
KKALSQQAKK PKTYHNDEAR RLLLQYSQQQ AQTEGQLAAA KLSTTEKMTE AHKQLLSFQQ
RIADLSGKKL TADEQSVLAH KDEIALALQK LDISQQDLQH QNALNELKKK TLTLTSQLAD
EESRVRQQHA MALATMGMGD QQRGRYEERL KIQQHYQEQL EQLKRDSKAK GTYGSDEYRQ
AEQALKGSLD RRLAEWADYN AKVDAAQGDW TLGASRALDN FLAQGGNVAG MTENVFTNAF
NGMADSIANF AVTGKGSFRS LTVSILADLA KMEARIAASK LLGSVLAMFG FGTSAGGSTP
SGAYSSAALS VIPNADGGVY RSAGLSQYSG SIVNRPTFFA FAKGAGVMGE AGPEAILPLR
RGADGKLGVV AAGSGGMAMF APEYNIEIHN DAGNGQIGPQ ALQAVYNIGK KAAIDFWQQQ
SRDGGIAGGG R