Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1085 |
Symbol | |
ID | 3229626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | - |
Start bp | 986530 |
End bp | 987747 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637120649 |
Product | HK97 family major capsid protein |
Protein accession | YP_181800 |
Protein GI | 57234159 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAGA TTATGGAACT CATGGAAAAG AGAGCGAAGG CATGGGAGGC TGCAAAGGCT TTTCTTAACA CCCACTCTCA GAACGGCGGC ATGGTTTCTG CGGAGGATGC CGCAACCTAT GACAAGATGG AGAAGGAAGT CACCGACCTC ACCCACGATA TCGAGCGCCT GCAGCGTCAG GAGCAGATCG ACAAGATGAT GAGCGCACCG ACCTCTGCTC CCCTTACCGG AAAGCCCGGT GCCAAGGATG AGCCGGAGGA TAAGCCCGGC ATCGCTTCCA AGGCATACCG CTCCGCCTTC TGGAACAACA TCCGCAAGCG CAACTACTAC GATGTCCAGA ACGTGCTGGA GGTCGGCACC GATGCCAACG GCGGATATCT CGTCCCGGAC GAGTACGAAA AGCAGCTGAT CGACGCGCTT CAGGAGGAGA ACTTCTTCCG TTCCCTCGCT ACGGTCATTC AGACCCAGTC CGGCACCCAC ACCATTCCGG TCGTCGCCTC TCACGGCACT GCCGCTTGGA TGGATGAGAA CGGCCTGTAC CCTGAATCCG ACGATACCTT CGACCAGATC AGCCTTTCCG CTTACAAGCT GGGCACGGCG ATCAAGGTTT CCGAGGAGCT GATGAACGAC TCCGTTTTCG ACCTCGAAAA GTACATCTCC ACCGAGTTTG CACGCAGGAT CGGTGCTGCC GAGGAGGAGG CTTTCCTGAT CGGCGACGGC AACAAAAAGC CCGAAGGCGT GTTCACCAAG GTGGCAGCCA CTACCGGCGC GACCACAGAG ATCAACAATG CCACGGTGTC CTTCGACGAC ATCATGGACG TGTTCCATTC GCTCCGCAGC GTCTACAGGA ACAAGGCCAT CTGGATCTTG AACGATACCA CCATCAAGGC GCTCAGGAAG ATCAAGGACA ACAACGGGAA CTACATCTGG CAGCCCTCTG TTGTGGCTGG CCAGCCCGAC ACCATCTTGA ACCGTCCTTA CAAGACCAGC ATCTACGCGC CGGAGCTTGC TGCGGGCAAT ACCGCGATCC TCTTTGGCGA CTTCAGCTTC TACTGGATCG CTGACCGTCA GGGACGCTCC TTCAAGCGCC TCTCCGAGCT CTATGCGGCA AACGGCCAGA TTGGCTTCCT TGCTTCCGAG CGCGTGGACG GCAAGCTCAT CCTTCCGGAG GCTGTAAAGG GTCTGTCCGT CAAGGCGGCC TCTTCTTCGA GAGGATAA
|
Protein sequence | MTQIMELMEK RAKAWEAAKA FLNTHSQNGG MVSAEDAATY DKMEKEVTDL THDIERLQRQ EQIDKMMSAP TSAPLTGKPG AKDEPEDKPG IASKAYRSAF WNNIRKRNYY DVQNVLEVGT DANGGYLVPD EYEKQLIDAL QEENFFRSLA TVIQTQSGTH TIPVVASHGT AAWMDENGLY PESDDTFDQI SLSAYKLGTA IKVSEELMND SVFDLEKYIS TEFARRIGAA EEEAFLIGDG NKKPEGVFTK VAATTGATTE INNATVSFDD IMDVFHSLRS VYRNKAIWIL NDTTIKALRK IKDNNGNYIW QPSVVAGQPD TILNRPYKTS IYAPELAAGN TAILFGDFSF YWIADRQGRS FKRLSELYAA NGQIGFLASE RVDGKLILPE AVKGLSVKAA SSSRG
|
| |