Gene SeHA_C3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3494 
Symbol 
ID6489503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3391374 
End bp3392435 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content50% 
IMG OID642743623 
Productphage portal protein, pbsx family 
Protein accessionYP_002047237 
Protein GI194450777 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.19269e-27 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTAAAA AGAAACACTT CGTTAAGCGC GACCAGCGCG GCGATAAGTC AAAAAAAATG 
AGCATCATTA CGTTCGGCAA ACCGGAACCT GTTCTGACCA CCGGTACCGA CTACCGGGAT
ATCTGGTACG ACAATGCAGC CGATCATTTT ACCCAGCCAA TTGACCGGCT GGCACTGGCA
CAACTGATTA ACCTTAACGG TCAACATGGC GGCATCATCC ATGCCCGTAA AAACATGATT
GTGTCTGATT ATCTGTCTGG CGGCCTGACT TACGACCAGC TGGAAGCCGC TGCTTTTGAC
TACATCACAT TTGGGGATAT TGCACTTGGA AAAATTCGTA ACGGATGGGG AGATGTGATC
GGACTGGAAC CCTTACCCGG TCTCTATATC CGACGCAGGA AAGACAGGAA CAACGCAGCT
GATCAACCTG GTGATTACGT GGTGCTACAG GAAGGCGAAC CGCAGATATG GCCGCAGGAA
GATATCATTT TTATCAAGAT GTACGACCCG CAGCAGCATA TTTACGGACT GCCGGACTAC
ATCGGCGGCG TACATTCTGC ATTGCTCAAC AGTGAAGCGG TCATTTTCCG TCGCCGCTAT
TACCACAATG GCGCACACAC GGGCGGTATT CTTTATACCC GCGACCCCAG CATGACGGAT
GAAATGGAAG AAGAAATTGA ACAGCAGCTG CGTGACAGCA AAGGGATCGG CAACTTCTCC
ACCATCCTGG TAAACATTCC CGGTGGAGAC GGTGACGCCA TCAAATTCAT TGAAATGGGG
GATATTTCCG CTAAGGATGA ATTTGCCAAC ATCAAGAATA TCAGCGCCCA GGACATTCTG
AACGCGCACC GTTTTCCTGC CGGGCTTGCC GGCATTGTCC CGCAAAATAC TGCCGGGCTG
GGTGACGTAG AAAAGGCCGA ACGGATTTAT AAAAAAAGCG AAGTCGCCCC TGTTCAGCGC
CGTTTTATGA TGGCCGTAAA CAATGATCCA GAAATACCGG GAAACCTGCA CCTGAACTTT
GATTTAAGTT ACACAGAATC AACGGATAAG GGTGCGGTAT GA
 
Protein sequence
MSKKKHFVKR DQRGDKSKKM SIITFGKPEP VLTTGTDYRD IWYDNAADHF TQPIDRLALA 
QLINLNGQHG GIIHARKNMI VSDYLSGGLT YDQLEAAAFD YITFGDIALG KIRNGWGDVI
GLEPLPGLYI RRRKDRNNAA DQPGDYVVLQ EGEPQIWPQE DIIFIKMYDP QQHIYGLPDY
IGGVHSALLN SEAVIFRRRY YHNGAHTGGI LYTRDPSMTD EMEEEIEQQL RDSKGIGNFS
TILVNIPGGD GDAIKFIEMG DISAKDEFAN IKNISAQDIL NAHRFPAGLA GIVPQNTAGL
GDVEKAERIY KKSEVAPVQR RFMMAVNNDP EIPGNLHLNF DLSYTESTDK GAV