Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0638 |
Symbol | |
ID | 6872270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 638653 |
End bp | 640152 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642783854 |
Product | terminase large subunit |
Protein accession | YP_002214540 |
Protein GI | 198245578 |
COG category | [R] General function prediction only |
COG ID | [COG5565] Bacteriophage terminase large (ATPase) subunit and inactivated derivatives |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.317674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACTGG ACGCGATTCT TGATAGCCTG AGCGACGAAG AGCAAATCGA ATTGCTCGAG CTACTCGAAG AAGAAGAGAA CTACCGAAAT ACACACTTGC TATATGAGTT TGCGCCATAC AGCAAACAGC GTGAGTTCAT CGACGCAGGT CATGACTATC CAGAGCGATG TTTTATGGCT GGTAACCAGC TTGGTAAGTC ATTTACTGGC GCTGCTGAAG TCGCGTTTCA CCTTACCGGG CGATACCCAG GAACGAAAGG TTATCCGGCT GATGGTAAAT ATGGCGGAGA GTGGAAAGGT AAGCGTTTCT ATGAGCCAGT TGTCTTCTGG ATTGGCGGTG AAACAAACGA GACTGTAACC AAAACGACTC AACGCATCCT GTGCGGTCGT ATCGAAGAGA ATGATGAACC TGGCTATGGG TCAATCCCGA AAGAGGACAT CATTAGCTGG AAGAAGTCAC CATTCTTCCC TAATCTTGTT GATCACCTTC TTGTTAAGCA CCACACGCCA GAAGGCGTCG AAGATGGCAT CTCAATATGC TACTTCAAGC CTTACTCACA GGGCCGCGCC CGCTGGCAGG GCGACACAAT TCACGGCGTT TGGTTTGACG AAGAGCCGCC ATATAGCATC TATGGCGAAG GTCTTACCCG TACCAACAAA TACGGGCAAT TCTCAATTCT GACGTTTACC CCGCTGATGG GGATGTCTGA CGTTGTTACC AAGTTCCTGA AGAATCCCAG TAAGTCGCAG AAAGTGGTCA ACATGACCAT CTATGACGCT GAGCACTACA CCGACGAGCA GAAAGAGCAA ATCATCGCAT CCTATCCTGA GCATGAGAGA GAGGCGCGTG CTCGCGGTAT TCCTACGATG GGTAGCGGTC GAATATTCCA GATACCGGAA GAGGCGATTA AGTGCCAGCC GTTTGAGTGT CCCGATCACT TCTATGTTAT CGACGCTCAG GACTTCGGCT GGAACCACCC GCAAGCTCAC ATTCAGCTTT GGTGGGACAA AGACGCAGAT GTTTTCTATC TGGCGCGTGT ATGGAAGAAA TCAGAGAACA CTGCCGTTCA GGCATGGGGT GCTGTTAAGT CGTGGGCTAA CAAAATACCT GTCGCGTGGC CTCATGACGG TCACCAACAC GAAAAGGGCG GTGGTGAGCA ACTTAAAACC CAATATGCGG ACGCCGGGTT CTCTATGCTT CCCGAACACG CAACGTTCTC GGATGGCGGT AACTCAGTAG AGTCAGGCAT TAGTGAACTT CGTGACCTGA TGCTTGAAGG AAGATTCAAA GTATTCAACA CATGCGAACC ATTTTTTGAA GAGTTCCGCC TATATCATCG CGATGAGAAC GGCAAGATTG TCAAGACCAA CGATGATGTG CTCGATGCTA CTCGCTACGG CTACATGATG CGCCGCTTCG CCAGGATGAT GCGCGATATC AGAAATCCGA AAGAAAAGAA AATCCCCGCA CCGATTAGAC CAGTACGCAG AGGACGATAA
|
Protein sequence | MELDAILDSL SDEEQIELLE LLEEEENYRN THLLYEFAPY SKQREFIDAG HDYPERCFMA GNQLGKSFTG AAEVAFHLTG RYPGTKGYPA DGKYGGEWKG KRFYEPVVFW IGGETNETVT KTTQRILCGR IEENDEPGYG SIPKEDIISW KKSPFFPNLV DHLLVKHHTP EGVEDGISIC YFKPYSQGRA RWQGDTIHGV WFDEEPPYSI YGEGLTRTNK YGQFSILTFT PLMGMSDVVT KFLKNPSKSQ KVVNMTIYDA EHYTDEQKEQ IIASYPEHER EARARGIPTM GSGRIFQIPE EAIKCQPFEC PDHFYVIDAQ DFGWNHPQAH IQLWWDKDAD VFYLARVWKK SENTAVQAWG AVKSWANKIP VAWPHDGHQH EKGGGEQLKT QYADAGFSML PEHATFSDGG NSVESGISEL RDLMLEGRFK VFNTCEPFFE EFRLYHRDEN GKIVKTNDDV LDATRYGYMM RRFARMMRDI RNPKEKKIPA PIRPVRRGR
|
| |