Gene SNSL254_A4363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4363 
Symbol 
ID6484199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4240381 
End bp4241418 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID642739605 
Productphage portal protein pbsx family 
Protein accessionYP_002043299 
Protein GI194444567 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000000346537 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAAGA AACGCAACAA GCGCCAGCAG CCGCCGCGCA CCCAAAACCA CACCGCCGCA 
CCAGCGCAAA GCATGGAGGC ATTCACCTTT GGTGAGCCAA CGCCGGTACT CGACCGCCGC
GATATTCTCG ATTATGTCGA GTGTATCAAC AACGGCCAGT GGTACGAGCC GCCGGTGAGC
TTCTCCGGGC TGGCGAAAAG TATGCGCGCC GCCGTGCACC ACAGCTCACC GATTTACGTA
AAGCGTAATA TTCTGGTGTC GACCTACATC CCGCACCCGT TGTTATCCCG TCAGGACTTC
ACCCGGTTTG CGCTCGACTA TCTGGTGTTT GGCAATGCTT TTATCGAAGA GCGTCGCAGC
CTGACCGGCA AGCCGTTAAA ACTGGAAACC TCACCGGCGA AATACACCCG CCGTGGCATC
GAGGAGGACG TGTACTGGTA TATTCAGTCC TACACGCAGC CGCACCAGTT CGCGCCCGGC
TCCGTCTTCC ACCTGCTCGA GCCCGATATT AATCAGGAGC TTTACGGGAT GCCGGAATAC
CTGAGCGCAC TCAATTCAGC CTGGCTGAAT GAATCAGCGA CCCTGTTCCG TCGCAAGTAT
TACCAGAACG GCGCGCATGC GGGTTACATC ATGTATGTGA CCGACGCCGC GCAAAGCAGC
ACCGACGTCG AGGCACTGCG AAAGGCGATG CGCGACTCGA AAGGGCTCGG CAATTTTAAG
AACCTGTTTT TCTACGCCCC TAATGGTAAA GCAGACGGGA TTAAAATTGT GCCACTGAGC
GAAGTCGCCA CGAAGGATGA TTTTTTTAAT ATCAAGAAAG TCAGCGCCGC TGACCTGCTC
GACGCGCACC GCATCCCATT CCAGCTTATG GGCGGTAAGC CCGATAACGT CGGCTCAGTG
GGTGACGTTG AGAAGGTGGC AAAGGTCTTT GTACGTAACG AACTGACCCC GCTACAGGCG
CGGTTTATGG AGTTGAACGA GTGGGCGGGT GAAGAGATTA TCCGCTTCGA AAAATATAGC
CTCGGCGACG ACGAGTAA
 
Protein sequence
MSKKRNKRQQ PPRTQNHTAA PAQSMEAFTF GEPTPVLDRR DILDYVECIN NGQWYEPPVS 
FSGLAKSMRA AVHHSSPIYV KRNILVSTYI PHPLLSRQDF TRFALDYLVF GNAFIEERRS
LTGKPLKLET SPAKYTRRGI EEDVYWYIQS YTQPHQFAPG SVFHLLEPDI NQELYGMPEY
LSALNSAWLN ESATLFRRKY YQNGAHAGYI MYVTDAAQSS TDVEALRKAM RDSKGLGNFK
NLFFYAPNGK ADGIKIVPLS EVATKDDFFN IKKVSAADLL DAHRIPFQLM GGKPDNVGSV
GDVEKVAKVF VRNELTPLQA RFMELNEWAG EEIIRFEKYS LGDDE