Gene SeD_A2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2249 
Symbol 
ID6872299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2149389 
End bp2150951 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content48% 
IMG OID642785350 
Producttail protein 
Protein accessionYP_002216012 
Protein GI198243482 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0759311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0336742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGAA TTGATACGCC CACCGCGCAA AAAGATAAAT TTGGTCAGGG AAAAAACGGA 
TTTACGAATG GTGATCCCGC CACGGGCCGC CGCGCAACGG ATCTCAACAG TGATATGTGG
GATGCAGTCC AGGAAGAGGT CTGTACTGTT ATTGAAGCCG CCGGCATACC ACTCAGTAAA
GGCGAACATA CGCAGCTTCA CGCCGCCATT GGCAGGCTGA TCGATGAACA GGTTAAAACC
CGTCTTGAAA AAAATCAGAA TGGCGCGGAC ATCCCGAATA AGCCGCTGTT TCTCCAGAAC
GTCGGTTTAG GAGAAACGAT AAATCTCGCT GCAGGGGCCC TGCAAAAATC GCAGAACGGC
GGCGATATTC CTGACAAAAA ACAATTTGCG AGAACCATCG GTGCGGTAAC GTCAACCACC
ATTACACTTG GCGAATCAGG CTGGTTCAAA ATCGCCACGG TTGTAATGCC GCAGGCTACA
TCAACTGCGG TGATTAAACT GTACGGTGGG GCGGGGTTTA ACGCTGGTTC ACCTGAACAG
GCGGCAATCA GCGAACTGGT ATTGCGTGCC GGTAATGGTT CACCTGTTGG AATAACCGCC
ACATTATGGA GGCGTTCACC TTCTGCTGCT AACGAGGTCG CATGGGTTAA TACATCAGGC
GACACCTACG ATATTTATAT TAATATCGGC CAGTATGCGT ACTGGTTAAT TGCGCAATAT
GATTACACCG GTAATGCAAA TGTCACGCTG CACAGTACGC CTGAATATTC ATCAGTTCAG
CCGGGAAACT CAACCAGCGG TCAGACATAT ACACTGTTTA ATAGTCTGAT GAAACCCACA
GCCGGTGACG TTGAGGCACT GTCAGTTAAT GGAGGGAGGC TAAACGGTCC GTTAGGCATT
GGTACTGATA ATGCGCTGGG TGGTAATTCG ATTGTATTCG GAGATAACGA TACAGGGTTT
AAGTGGCACA GTGACGGCGT TCTGGGGATT TATGCCAATA ATGCTCTGGT TGGTTATATC
GACAATTCCG GGCTGCACAT GTCAGTAGAT GTTCTCACTA ATGGTGCCGT ACGCGCAGGC
AACGCAAAAA AACTGTCACT GACGAGCAAT AATAATTCGA CAATGACAGC CACGTTTAAT
TTATGGGGCG ACGCAAACAG GCCAACAGTT ATTGAACTGG ACGACGATCA GGGATGGCAT
CTGTACAGCC AGCGAAATCC TGACGGTTCG ATTGTCTTTA CGGTCAATGG AGATATCACC
GCTAACACGC TTCGTGCAGG CGGGGCTATC TATCAGAATA ACGGCGACAT CTTTGGTTCG
CTATGGGGAA ATGGCTGGTT AAGTACCTGG ATTAATAATA ATCTCGTCTT AGATGTTCAG
TTAGGGGCTG GCACATCAGT GACTACCTGG AACAATGCAG GTTCCTGGCC TAACACTCCC
GGATATGTAG TTACCTCCGT CTGGAAAGAT TATCAGGGCG AAAATATTGA TGGTATTAAT
TATGCGCCTT TGCAAAAACG AGTCGGGAGT CAGTGGTATA CCGTACAAGG GGGAACGGTA
TAA
 
Protein sequence
MHRIDTPTAQ KDKFGQGKNG FTNGDPATGR RATDLNSDMW DAVQEEVCTV IEAAGIPLSK 
GEHTQLHAAI GRLIDEQVKT RLEKNQNGAD IPNKPLFLQN VGLGETINLA AGALQKSQNG
GDIPDKKQFA RTIGAVTSTT ITLGESGWFK IATVVMPQAT STAVIKLYGG AGFNAGSPEQ
AAISELVLRA GNGSPVGITA TLWRRSPSAA NEVAWVNTSG DTYDIYINIG QYAYWLIAQY
DYTGNANVTL HSTPEYSSVQ PGNSTSGQTY TLFNSLMKPT AGDVEALSVN GGRLNGPLGI
GTDNALGGNS IVFGDNDTGF KWHSDGVLGI YANNALVGYI DNSGLHMSVD VLTNGAVRAG
NAKKLSLTSN NNSTMTATFN LWGDANRPTV IELDDDQGWH LYSQRNPDGS IVFTVNGDIT
ANTLRAGGAI YQNNGDIFGS LWGNGWLSTW INNNLVLDVQ LGAGTSVTTW NNAGSWPNTP
GYVVTSVWKD YQGENIDGIN YAPLQKRVGS QWYTVQGGTV