Gene SeD_A3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3048 
Symbol 
ID6873376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2939570 
End bp2940628 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content56% 
IMG OID642786078 
Productphage major capsid protein, P2 family 
Protein accessionYP_002216724 
Protein GI198245755 
COG category 
COG ID 
TIGRFAM ID[TIGR01551] phage major capsid protein, P2 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.000831709 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAGA ATACCCGCTT TGCTTTTAAC GCTTACCTGC AGCAGCTGGC GCGTCTGAAC 
GGTGTGGCCG TTGAAGAACT GTCCAGCAAG TTCACTGTAG AGCCGTCTGT GCAGCAGACG
CTGGAAGACC AGATCCAGCA GTCCGCCGCA TTCCTGACGC TGATTAACGT CACGCCAGTG
ACTGAGCAGT CCGGTCAGCT GCTGGGGCTG GGTGTTGGCA GCACCATTGC CGGAACCACT
GACACCACTG CGAAAGAGCG TGAACCTGTC GATCCGACGC TGATGGTCGA TGTGGAATAT
AAATGCGAGC AGACCAACTT TGACACGGTG CTGACCTACG CGAAGCTGGA CCTGTGGGCG
AAGTTTCAGG ATTTCCAGGT GCGTATCCGT GACGCCATCG TGAAACGTCA GGCACTGGAC
CGCATCATGA TCGGCTTTAA CGGCGTGAAG CGTGCGAAAA CCTCCAACCG TAGTGAAAAC
CCGCTGCTGC AGGATGTGAA TAAAGGCTGG CTGCAGAAAA TCCGTGAGGA TGCACCGGAT
CACGTCATGG GCAGCACCAC CGCGGGCGGC GAAACCACAC CGGGTGCGGT GAAAGTCGGG
AAAGGTGGCG AATATGCCAA CCTGGACGCT GTGGTGATGG ATGCGGTCAA TGAGCTTATC
GACGTGGTCT ACCAGGACGA TGACGATCTG GTGGTGATTT GCGGTCGTGA ACTGCTGTCT
GACAAGTATT TCCCGCTGGT CAACAAAGAG CAGGAAAACA GTGAAAAATT GGCAGCCGAT
ATGATTATCA GTCAGAAACG CATGGGCGGT CTGCAGGCCG TGCGTGCGCC GTTCTTCCCG
CCGAATGCGC TGCTGATCAC CCGTCTGGAT AACTTGTCCA TCTACTGGCA GGAAGACACC
CGCCGCCGTT CAGTTATCGA CAACCCGAAA CGTGACCGGA TTGAAAATTT TGAATCCGTT
AACGAAGCCT ACGTGGTTGA GGACTACCGC TGCGCCGCAC TGGTGGAAAA CCTCCAGATT
GGCGACTTCA GCGCCGCCGC AGCAGAAGCC GGAGCGTAA
 
Protein sequence
MKKNTRFAFN AYLQQLARLN GVAVEELSSK FTVEPSVQQT LEDQIQQSAA FLTLINVTPV 
TEQSGQLLGL GVGSTIAGTT DTTAKEREPV DPTLMVDVEY KCEQTNFDTV LTYAKLDLWA
KFQDFQVRIR DAIVKRQALD RIMIGFNGVK RAKTSNRSEN PLLQDVNKGW LQKIREDAPD
HVMGSTTAGG ETTPGAVKVG KGGEYANLDA VVMDAVNELI DVVYQDDDDL VVICGRELLS
DKYFPLVNKE QENSEKLAAD MIISQKRMGG LQAVRAPFFP PNALLITRLD NLSIYWQEDT
RRRSVIDNPK RDRIENFESV NEAYVVEDYR CAALVENLQI GDFSAAAAEA GA