Gene SeD_A0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0638 
Symbol 
ID6872270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp638653 
End bp640152 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content50% 
IMG OID642783854 
Productterminase large subunit 
Protein accessionYP_002214540 
Protein GI198245578 
COG category[R] General function prediction only 
COG ID[COG5565] Bacteriophage terminase large (ATPase) subunit and inactivated derivatives 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.317674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACTGG ACGCGATTCT TGATAGCCTG AGCGACGAAG AGCAAATCGA ATTGCTCGAG 
CTACTCGAAG AAGAAGAGAA CTACCGAAAT ACACACTTGC TATATGAGTT TGCGCCATAC
AGCAAACAGC GTGAGTTCAT CGACGCAGGT CATGACTATC CAGAGCGATG TTTTATGGCT
GGTAACCAGC TTGGTAAGTC ATTTACTGGC GCTGCTGAAG TCGCGTTTCA CCTTACCGGG
CGATACCCAG GAACGAAAGG TTATCCGGCT GATGGTAAAT ATGGCGGAGA GTGGAAAGGT
AAGCGTTTCT ATGAGCCAGT TGTCTTCTGG ATTGGCGGTG AAACAAACGA GACTGTAACC
AAAACGACTC AACGCATCCT GTGCGGTCGT ATCGAAGAGA ATGATGAACC TGGCTATGGG
TCAATCCCGA AAGAGGACAT CATTAGCTGG AAGAAGTCAC CATTCTTCCC TAATCTTGTT
GATCACCTTC TTGTTAAGCA CCACACGCCA GAAGGCGTCG AAGATGGCAT CTCAATATGC
TACTTCAAGC CTTACTCACA GGGCCGCGCC CGCTGGCAGG GCGACACAAT TCACGGCGTT
TGGTTTGACG AAGAGCCGCC ATATAGCATC TATGGCGAAG GTCTTACCCG TACCAACAAA
TACGGGCAAT TCTCAATTCT GACGTTTACC CCGCTGATGG GGATGTCTGA CGTTGTTACC
AAGTTCCTGA AGAATCCCAG TAAGTCGCAG AAAGTGGTCA ACATGACCAT CTATGACGCT
GAGCACTACA CCGACGAGCA GAAAGAGCAA ATCATCGCAT CCTATCCTGA GCATGAGAGA
GAGGCGCGTG CTCGCGGTAT TCCTACGATG GGTAGCGGTC GAATATTCCA GATACCGGAA
GAGGCGATTA AGTGCCAGCC GTTTGAGTGT CCCGATCACT TCTATGTTAT CGACGCTCAG
GACTTCGGCT GGAACCACCC GCAAGCTCAC ATTCAGCTTT GGTGGGACAA AGACGCAGAT
GTTTTCTATC TGGCGCGTGT ATGGAAGAAA TCAGAGAACA CTGCCGTTCA GGCATGGGGT
GCTGTTAAGT CGTGGGCTAA CAAAATACCT GTCGCGTGGC CTCATGACGG TCACCAACAC
GAAAAGGGCG GTGGTGAGCA ACTTAAAACC CAATATGCGG ACGCCGGGTT CTCTATGCTT
CCCGAACACG CAACGTTCTC GGATGGCGGT AACTCAGTAG AGTCAGGCAT TAGTGAACTT
CGTGACCTGA TGCTTGAAGG AAGATTCAAA GTATTCAACA CATGCGAACC ATTTTTTGAA
GAGTTCCGCC TATATCATCG CGATGAGAAC GGCAAGATTG TCAAGACCAA CGATGATGTG
CTCGATGCTA CTCGCTACGG CTACATGATG CGCCGCTTCG CCAGGATGAT GCGCGATATC
AGAAATCCGA AAGAAAAGAA AATCCCCGCA CCGATTAGAC CAGTACGCAG AGGACGATAA
 
Protein sequence
MELDAILDSL SDEEQIELLE LLEEEENYRN THLLYEFAPY SKQREFIDAG HDYPERCFMA 
GNQLGKSFTG AAEVAFHLTG RYPGTKGYPA DGKYGGEWKG KRFYEPVVFW IGGETNETVT
KTTQRILCGR IEENDEPGYG SIPKEDIISW KKSPFFPNLV DHLLVKHHTP EGVEDGISIC
YFKPYSQGRA RWQGDTIHGV WFDEEPPYSI YGEGLTRTNK YGQFSILTFT PLMGMSDVVT
KFLKNPSKSQ KVVNMTIYDA EHYTDEQKEQ IIASYPEHER EARARGIPTM GSGRIFQIPE
EAIKCQPFEC PDHFYVIDAQ DFGWNHPQAH IQLWWDKDAD VFYLARVWKK SENTAVQAWG
AVKSWANKIP VAWPHDGHQH EKGGGEQLKT QYADAGFSML PEHATFSDGG NSVESGISEL
RDLMLEGRFK VFNTCEPFFE EFRLYHRDEN GKIVKTNDDV LDATRYGYMM RRFARMMRDI
RNPKEKKIPA PIRPVRRGR