Gene SNSL254_A2938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2938 
Symbol 
ID6483359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2869316 
End bp2870377 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content49% 
IMG OID642738255 
Productphage portal protein pbsx family 
Protein accessionYP_002041984 
Protein GI194445286 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.19492e-28 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTAAAA AGAAACACTT CATTAAGCGC GACCAGCGCG GCGATAAGTC AAAAAAAATG 
AGCATCATTA CGTTCGGCAA ACCGGAACCG GTTCTGACCA CTGGCACCGA CTACCGGGAT
ATCTGGTACG ACAATGCCGC CGATCATTTT ACTCAGCCAA TTGACCGGCT GGCACTGGCA
CAACTGATTA ACCTTAACGG TCAACATGGC GGTATCATCC ACGCCCGTAA AAACATGATT
GTGTCTGATT ATCTGTCTGG CGGCCTGACT TACGACCAAC TGGAAGCCGC AGCTTTTGAC
TACATCACAT TTGGGGATAT TGCGCTTGGA AAAATTCGTA ACGGATGGGG AGATGTGATC
GGACTGGAAC CCTTACCCGG CCTCTATATC CGACGCAGGA AAGACAGGAA CAACGCAACT
GATCAACCTG GTGATTACGT GGTGTTACAG GAAGGCGAAC CGCAGATATG GCCTGAAGAA
GATATTATCT TCATCAAAAT GTATGATCCG CAACAGCATA TTTACGGACT GCCGGACTAC
ATCGGCGGCG TACATTCTGC ATTACTCAAC AGTGAAGCGG TCATTTTCCG TCGCCGTTAC
TACCACAATG GCGCCCACAC TGGCGGCATT CTCTACACGC GCGATCCCAG CATGACGGAT
GAAATGGAAG AGGAAATTGA ACAGCAGCTG CGTGACAGCA AAGGGATCGG CAACTTCTCC
ACCATCCTGG TAAACATTCC CGGTGGAGAC GGTGACGCCA TCAAATTCAT TGAAATGGGG
GATATTTCCG CTAAGGATGA ATTTGCCAAC ATCAAAAATA TCAGCGCCCA GGATATTCTG
AACGCGCACC GTTTTCCTGC CGGGCTTGCC GGCATTGTCC CGCAAAATAC TGCCGGACTT
GGTGACGTAG AAAAGGCCGA ACGGATTTAT AAAAAAAGCG AAGTCGCCCC TGTTCAGCGC
CGGTTTATGA TGGCCGTAAA CAATGATCCA GAAATACCGG AAAGGCTACA CCTTAACTTT
GATTTAAGTT ACGCAGAATC AACGGATAAG GGTGCAGCAT GA
 
Protein sequence
MSKKKHFIKR DQRGDKSKKM SIITFGKPEP VLTTGTDYRD IWYDNAADHF TQPIDRLALA 
QLINLNGQHG GIIHARKNMI VSDYLSGGLT YDQLEAAAFD YITFGDIALG KIRNGWGDVI
GLEPLPGLYI RRRKDRNNAT DQPGDYVVLQ EGEPQIWPEE DIIFIKMYDP QQHIYGLPDY
IGGVHSALLN SEAVIFRRRY YHNGAHTGGI LYTRDPSMTD EMEEEIEQQL RDSKGIGNFS
TILVNIPGGD GDAIKFIEMG DISAKDEFAN IKNISAQDIL NAHRFPAGLA GIVPQNTAGL
GDVEKAERIY KKSEVAPVQR RFMMAVNNDP EIPERLHLNF DLSYAESTDK GAA