Gene SNSL254_A2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2801 
Symbol 
ID6486507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2743218 
End bp2746313 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content56% 
IMG OID642738124 
Productgifsy-1 prophage VmtH 
Protein accessionYP_002041858 
Protein GI194443589 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.617877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA TAGCGAACCT GGTCATTGAT TTAAGTATCG ACAGCGCAGA GTTCCGAAAC 
GAAGTTCCGC GCATTAAAAA ATTGCTGAAC GATGCGGCTG GTGACTCAGA ACGTTCAGCG
GCCCGGATGC AGCGTTTTCT GGATAAGCAG ACGGAGGCGA CGCGCCGGAC GTCCGCCAGT
CTGGAGCAAG TGACTGCCAG CAGTACCGCG TACAGTTCCG CTGTGGAGAA AAGCGCAGCG
GCCAGTACGC GTCTGGCGGC GGATGTGGAT CAGACGCGAC AGCGGGTGGA GGCACTGGGA
AGGAAACTGC GTGAGGAACA GGCGCAGTCA GCGGCTGTGG CGGCAGCACA GGACAGGACA
AGTGCTGCTT TTTACCGCCA GATTGACAGT GTAAAACAGT TAAGCGGTGG TCTGCAGGAG
CTGCAGCGTA TCCAGGCGCA GGTACGACAG GCGAAAGGAC GCGGAGATAT CTCACAGGGC
GATTATCTGG CGCTGGTGTC TGAAACCGCC AGGAAGACCC GTGAGCTTAC CGATGCCGAA
GCGCTGGCCA CGCAGAAAAA AGCACAGTTT ATACGCCGCC TGAAAGAGCA GACGACGGTA
CAGGGCCTCT CCCGTACCGA GCTGCTGCGG GTGAAGGCGG CTGAACTGGG TGTCAGTAGC
GCCGCAGATA TTTATATCCG TAAACTGGAG CGTACCGGAA CTGCCACCCA TACGCTAGGA
CTGAAAAGCG CTGCTGCCCG TCGTGAACTG GGCGTGCTGG CTGGTGAGCT GGCCCGTGGG
AATTTCGGGG CACTGCGGGG AAGTGGTATC ACGCTCGCTA ACCGCGCCGG GTGGATCGAG
CAACTGATGT CTCCGAAGGG CATGATGCTC GGCGGGCTGG CTGGCGGCGT GGCTGCTGCT
GTTTACGGGC TGGGCAAGGC CTACTATGAA GGCGCTAAGG AAAGCGAGAC GTTCAATAAA
CAGCTTATTC TGACCGGGAG TTATGCCGGA AAAACCACAG GCCAGCTTAA TGCGATGGCG
AAGTCGCTCG CCGGAAATGG CGTCACGCAG CACGATGCTG CAGGCGTGCT GGCACAGGTG
GTCGGTAGCG GAGCGTTTAC CGGGCAGGCA ATGGCAATGG TATCCCGTAC CGCGACCAGA
ATGCAGGAAA ACGTGGGACA ATCAGTGGAT GAAACCATCC GCCAGTTTAA ACGCCTGCGG
GATGATCCGG TGAATGCGGC GAAAGAACTG GACAGGACAC TGCATTTTCT GACAGCCACC
CAGCTTGAAC AAATCAGGGT ACTGGGCGAG CAGGGAAGAG TGGCTGATGC CGCGAAAATT
GCCATGTCCG CGTATTCGGA AGAAATGAAT AAGCGGATGG GGGACGTACA CGACAATCTG
GGCTGGATTG AAAGAGCATG GAATGCTGTC GGTGATGCGG CGAAGTGGGC ATGGGATCGG
ATGCTGGATA TCGGGCGGGA AGACACGCTC GATGAAAAGA TCGCGACACT GCAGGAAAAA
ATCGCGCGCG ACAGAAAAAC GCCCTGGACG GTGTCTTCTT CCCAGACTGA ATACGATCAG
CAGCAGCTGA ACGAACTTCA GGAACAGAAA CGCCAGAAGG ACCTGCTGGA TGCGAAGGCG
CAGGCAGAGC GTAATTATCA GGAAACGCAG AAACGTCGGA ACGAGCAGAA CGCCGCGCTG
AACCGGGATA ATGAAACTGA ATCCCTGCGG CACCAACGGG AGGTGGCGCG CATTACCGCC
ATGCAGTATG CCGATGCTGC TGTACGCAAT GCCGCACTGG AGCGCGAAAA TGAACGTCAT
AAAAAGGCGT TGTCACAACA GGCGAAAAAG CCAAAGACTT ACCACAACGA CGAGGCCAGG
CGACTGCTTT TGCAGTACAG CCAGCAACAG GCGCAGACTG AAGGGCAGAT TACCGCCGCG
AAGCTTTCCA CGACCGAAAA AATGACGGAA GCGCATAAGC AGCTTTTGTC ATTTCAGCAG
CGCATCGCTG ATTTGTCCGG TAAAAAACTG ACGGCGGATG AACAAAGCGT ACTGGCACAT
AAGGATGAAA TTGCGCTTGC GCTACAGAAG CTGGATATCT CACAACAGGA TTTGCAACAC
CAGAATGCCC TTAATGAACT GAAGAAAAAG ACGCTCACAT TGACCAGCCA GCTCGCTGAC
GAAGAATCCC GCGTCAGGCA ACAGCACGCA ATGGCGCTGG CCACAATGGG TATGGGCGAT
CAGCAACGTG GCCGATACGA AGAGCGTCTG AAAATTCAGC AGCACTACCA GGAACAACTG
GAGCAGCTTA AACGCGACAG CAAGGCAAAA GGGACATACG GTTCTGACGA ATATCGTCAG
GCGGAGCAGG CGCTGAAGGG CAGTCTCGAT CGCCGGCTGG CTGAGTGGGC GGATTACAAT
GCGAAAGTTG ACGCTGCGCA GGGAGACTGG ACGCTGGGGG CGTCGCGGGC GCTGGATAAC
TTTCTGGCGC AGGGCGGCAA TGTGGCAGGC ATGACGGAGA ACGTTTTCAC AAACGCATTT
AACGGCATGG CGGACAGTAT CGCGAATTTT GCCGTGACCG GAAAGGGCAG TTTCCGGAGC
CTGACGGTCT CCATCCTGGC TGACCTTGCA AAAATGGAGG CACGTATTGC GGCTTCTAAA
CTGTTGGGTT CAGTGCTGGC AATGTTCGGC TTTGGCACAT CGGCAGGCGG CAGTACACCA
TCAGGGGCAT ACAGTTCTGC GGCGCTGTCG GTTATTCCGA ATGCGGACGG CGGCGTGTAC
CGTTCGGCAG GACTCAGCCA GTACAGCGGC AGCATTGTTA ATCGCCCGAC ATTTTTTGCT
TTTGCCAAAG GTGCCGGGGT GATGGGCGAG GCAGGACCGG AGGCAATATT ACCACTTCGT
CGTGGTGCTG ACGGTAAGCT GGGTGTCGTG GCAGCCGGTT CAGGAGGGAT GGCGATGTTT
GCGCCTGAGT ACAACATTGA AATCCACAAC GACGCCGGCA ACGGACAGAT TGGTCCGCAG
GCATTACAGG CCGTATATAA CATTGGAAAA AAAGCCGCCA TTGATTTCTG GCAACAGCAG
TCGCGTGACG GGGGTATTGC TGGAGGAGGG CGATAA
 
Protein sequence
MDQIANLVID LSIDSAEFRN EVPRIKKLLN DAAGDSERSA ARMQRFLDKQ TEATRRTSAS 
LEQVTASSTA YSSAVEKSAA ASTRLAADVD QTRQRVEALG RKLREEQAQS AAVAAAQDRT
SAAFYRQIDS VKQLSGGLQE LQRIQAQVRQ AKGRGDISQG DYLALVSETA RKTRELTDAE
ALATQKKAQF IRRLKEQTTV QGLSRTELLR VKAAELGVSS AADIYIRKLE RTGTATHTLG
LKSAAARREL GVLAGELARG NFGALRGSGI TLANRAGWIE QLMSPKGMML GGLAGGVAAA
VYGLGKAYYE GAKESETFNK QLILTGSYAG KTTGQLNAMA KSLAGNGVTQ HDAAGVLAQV
VGSGAFTGQA MAMVSRTATR MQENVGQSVD ETIRQFKRLR DDPVNAAKEL DRTLHFLTAT
QLEQIRVLGE QGRVADAAKI AMSAYSEEMN KRMGDVHDNL GWIERAWNAV GDAAKWAWDR
MLDIGREDTL DEKIATLQEK IARDRKTPWT VSSSQTEYDQ QQLNELQEQK RQKDLLDAKA
QAERNYQETQ KRRNEQNAAL NRDNETESLR HQREVARITA MQYADAAVRN AALERENERH
KKALSQQAKK PKTYHNDEAR RLLLQYSQQQ AQTEGQITAA KLSTTEKMTE AHKQLLSFQQ
RIADLSGKKL TADEQSVLAH KDEIALALQK LDISQQDLQH QNALNELKKK TLTLTSQLAD
EESRVRQQHA MALATMGMGD QQRGRYEERL KIQQHYQEQL EQLKRDSKAK GTYGSDEYRQ
AEQALKGSLD RRLAEWADYN AKVDAAQGDW TLGASRALDN FLAQGGNVAG MTENVFTNAF
NGMADSIANF AVTGKGSFRS LTVSILADLA KMEARIAASK LLGSVLAMFG FGTSAGGSTP
SGAYSSAALS VIPNADGGVY RSAGLSQYSG SIVNRPTFFA FAKGAGVMGE AGPEAILPLR
RGADGKLGVV AAGSGGMAMF APEYNIEIHN DAGNGQIGPQ ALQAVYNIGK KAAIDFWQQQ
SRDGGIAGGG R