Gene SeD_A3025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3025 
Symbol 
ID6871119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2923345 
End bp2924445 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content54% 
IMG OID642786056 
Productphage late control D family protein 
Protein accessionYP_002216702 
Protein GI198244478 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.00666291 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACGTTA ATTCTGATCT CCTGAATCTG AACAGCAAAA GCCCGGCTTT CAGTATCACC 
ATCGAAGGCA AAGACGTGAC GACGGTGATG GACGCGCGCC TGATGAGTCT GACACTGACG
GATAACCGGG GCTTTGAAGC GGACCAGCTT GATCTGGAGC TGGACGACGC CGACGGGCTG
ATCGCCCTGC CGCGACGTGG GGCTGTGATT CAGCTGGCGC TGGGCTGGAA AGGCCAGCCG
CTTTTCCCTA AAGGGGTTTT TACCGTAGAT GAAATTGAAC ACAGCGGTGC CCCTGACCGG
CTGACCATCA GGGCGCGTAG CGCAGATTTC CGTGAAACCC TCAATACACG GCGCGAAAAA
TCATGGCATC AGACAACGGT GGGGGAGGTG GTAAAGGAAA TCGCCGCCCG GCATAACCTC
AAAGTGGCGC TGGGTAAAGA CCTGACGGAT AAGGCGCTGG ATCATATGGA CCAGACCAAT
GAAAGCGATG CAAGTTTTCT GATGAAACTG GCGAGGCAGT ATGGGGCGAT TGCTTCCGTT
AAAGACGGGA ACCTGCTATT TATCCGGCAG GGACAGGGAA GAACGGCAAG CGGCAAGCCG
CTGCCGGTTA TCACCATCAC GCGCAAAGCC GGTGACGGTC ATCGGTTCAC CCTTGCTGAT
CGTGGTGCCT ATACCGGTGT TATTGCCAGC TGGTTGCATA CGCGTGAACC CAGGAAAAAA
GAGACAACCA GTGTTAAGCG TCGTCGAAAG AAAGCCACCA CACCCAAAGA GCCGGAAGCA
AAACAGGGTG ATTATCTGGT GGGAACGGAT GAAAACGTGT TGGTTCTTAA TCGTACCTAC
GCCAACCGGA GCAATGCAGA GCGCGCAGCA AAAATGCAGT GGGAACGCCT GCAGCGCGGG
GTTGCTTCAT TTTCCCTGCA GCTCGCTGAG GGGCGGGCTG ATCTCTATAC GGAAATGCCG
GTGAAGGTGA CGGGATTTAA GCAGCCGATT GATGATGCAG AATGGACCAT TACCACCCTG
ACGCATTCTG TCAGCTCGGA TAATGGATTT ACGACCAGCA TGGAGCTTGA AGTAAAGATT
GATGGTCTTG AAATCGAATA A
 
Protein sequence
MNVNSDLLNL NSKSPAFSIT IEGKDVTTVM DARLMSLTLT DNRGFEADQL DLELDDADGL 
IALPRRGAVI QLALGWKGQP LFPKGVFTVD EIEHSGAPDR LTIRARSADF RETLNTRREK
SWHQTTVGEV VKEIAARHNL KVALGKDLTD KALDHMDQTN ESDASFLMKL ARQYGAIASV
KDGNLLFIRQ GQGRTASGKP LPVITITRKA GDGHRFTLAD RGAYTGVIAS WLHTREPRKK
ETTSVKRRRK KATTPKEPEA KQGDYLVGTD ENVLVLNRTY ANRSNAERAA KMQWERLQRG
VASFSLQLAE GRADLYTEMP VKVTGFKQPI DDAEWTITTL THSVSSDNGF TTSMELEVKI
DGLEIE