Gene SeD_A2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2251 
Symbol 
ID6875062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2151528 
End bp2152607 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content55% 
IMG OID642785352 
Producttail protein 
Protein accessionYP_002216014 
Protein GI198246233 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.00492829 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGACA GTCAATTTGC ACGTCCTGAA CTTCCTCAGT TGATTGCTAC CATTCGCAGC 
GATTTACTGA CCCGTTTTCA GCAGGATGTT GTGTTACGTC GCATGGATGC CGAGGTTTAC
AGCCGGGTAC AGGCTGCTGC CGTACATACG CTGTATGGTT ATATCGATTA TCTGGCCCGG
AATATGCTGC CTGATATGTG TGATGAGGAC TGGCTTTACC GTCACGCGAG GATTAAGCGT
TGTCCCAGGA AAAATGCCGT ATCTGCGAAG GGATTTGCAC GCTGGGATGG TATTGCCGGA
ACGCCGGAGA TCCCCGCGGG TACACAGATT CAGCGGGATG ATCAGGTTAC ATTCACGACC
CTGCAGACGG TGAAAGCTTC CGGCGGCCTG TTACGTGTGC CGGTTATTGC TGATGTGGCG
GGAACTGCCG GTAATACTGA CGATGGTACG GCGTTACGCC TTGGCACGCC GATTACTGGT
ATTCCTTCTA CAGGTTACGC TGACACTCTG ACCGGGGGGG CTGATACAGA GGAGCCTGAA
ACGTGGCGCG CGCGCGTCAT GGAACGCTAT TACTGGATAC CACAGGGGGG CGCTGATCCT
GATTACGTCA TCTGGGCAAA GGAAATCGCG GGAATAACCC GTGCGTGGAC ATTCCGCCAT
TATAAGGGGA CCGGCACCGT TGGTGTGATG GTGGCTACCA GTAACCCGGT GAATCCGGCT
CCTGGCGACG ATCTCGTTAA GGCTGTACGT GACCATATTT TGCCGCTGGC ACCTGTTGCT
GGCGGCGGAC TCTTTGTTTT CGCTGCCACT GAAAAAAGCA TTCCGGTAAC AGTCGCACTG
GCCAAAGATA CCCCGGAAAT TCGTACTGCC ATTATTGCGG AGCTAAATGC GCTGATGCTG
CGTGATGGCG CGCCGTCCGG AAAAATTTAT GTTTCGCGAA TCAGCGAGGC GATAAGTCTG
GCGACCGGGG AAGTGGCACA TCAGCTGCGT GTGCCGGCGG CAGATGTGGT GCTGGGAAAA
ACTGAACTTC CTGTCCTGGG GAATATAACC TGGGCCACCT ATACCGGGGA GAACGGATAA
 
Protein sequence
MADSQFARPE LPQLIATIRS DLLTRFQQDV VLRRMDAEVY SRVQAAAVHT LYGYIDYLAR 
NMLPDMCDED WLYRHARIKR CPRKNAVSAK GFARWDGIAG TPEIPAGTQI QRDDQVTFTT
LQTVKASGGL LRVPVIADVA GTAGNTDDGT ALRLGTPITG IPSTGYADTL TGGADTEEPE
TWRARVMERY YWIPQGGADP DYVIWAKEIA GITRAWTFRH YKGTGTVGVM VATSNPVNPA
PGDDLVKAVR DHILPLAPVA GGGLFVFAAT EKSIPVTVAL AKDTPEIRTA IIAELNALML
RDGAPSGKIY VSRISEAISL ATGEVAHQLR VPAADVVLGK TELPVLGNIT WATYTGENG