Gene SeAg_B2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2801 
Symbol 
ID6792471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2743734 
End bp2744753 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content53% 
IMG OID642776978 
Productphage portal protein, pbsx family 
Protein accessionYP_002147592 
Protein GI197250748 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC GGAAATACAG GGAACGCCGC ACCGTTACCA GACCGCGCCA TATGAGCCTT 
ATCACGCTGG GTAAGCCAGA ACCCATTCTG ACGACCGGCA CGAACTATAC AGACGTCTGG
TATGACAATG AAGCGGAACA CTGGACGCTC CCGATTGACC GGCTGGCGCT GGCGCAACTG
GTAAACCTGA ACGCGCAGCA CGGTGGCGTG CTGTATGCCC GCCGCAATAT GGTGACAGCA
AATTATAATG GCGGCGGCCT GACGCATGAG CAACTGGGCG CGGCCGTGTT TGACTGGCTG
ACGTTCGGTG ATGTGGCCAT TCTCAAGGTA CGTAACGGCT GGGGGGATGT AATCGCACTT
TACCCGCTGC CGGCACTCTA TACCCGCCAG CGTAAGACCG GGGAATTTGT TGTACTTCAG
CAGGGTGAAC CGGTAATTTA TCCGCCTGAA GATATTATTT TTCTCAGGCA GTACGACCCG
CAACAGGCCA TTTATGGTCT TCCGGATTAC ATCAGCGGCA TCCACTCCGC CATGCTCAAC
GGTGAAGCCA CGATTTTTCG CCGGCGTTAC TACCACAACG GTGGTCACAC GGGCGGCATG
ATTTATTGCA ACGACCCGAA TATGACCGAC GAAGTGGAAG AAGAAATCAT TCAGAAGCTG
GAGCAGTCGA AGGGGATCGG GAACTTCAGC ACCATGTTTG TGAACATCCC CAAAGGCGAT
CCGGATGGCA TCAAATTTAT CCCGATTGGC GATATCAGTG CCAAAGATGA GTTTCAGAAC
ATCAAAAGCA TCAGCGCCCA GGACGTGCTG ACCGCGCATC GTTTTCCGGC AGGTCTGGCA
GGGATTATCC CCACCAACGG AGCTATAATG GGCGATATTG AAAAAGCGGC TAAAACATAC
CGTAAAGCGG AGATTTTACC CATTCAGCGT ATGTTCAGCG CCGCAGTGGC GCAGGAAAGT
GATGTACCGC CCCACCTGTA CCTTAATTTC CTGAAAGACA GTGAGCTGGA AGGTGATTAA
 
Protein sequence
MKKRKYRERR TVTRPRHMSL ITLGKPEPIL TTGTNYTDVW YDNEAEHWTL PIDRLALAQL 
VNLNAQHGGV LYARRNMVTA NYNGGGLTHE QLGAAVFDWL TFGDVAILKV RNGWGDVIAL
YPLPALYTRQ RKTGEFVVLQ QGEPVIYPPE DIIFLRQYDP QQAIYGLPDY ISGIHSAMLN
GEATIFRRRY YHNGGHTGGM IYCNDPNMTD EVEEEIIQKL EQSKGIGNFS TMFVNIPKGD
PDGIKFIPIG DISAKDEFQN IKSISAQDVL TAHRFPAGLA GIIPTNGAIM GDIEKAAKTY
RKAEILPIQR MFSAAVAQES DVPPHLYLNF LKDSELEGD