Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0867 |
Symbol | |
ID | 4663694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 1072159 |
End bp | 1073382 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639819089 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_966315 |
Protein GI | 120601915 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.887226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCA TCATCAGAGA CCTGCTGGAA CAGCAGGGTA AGGCATTCGA AGACTTCAAG AAGGCGAACG ATGCTCGTCT GAACGCCATG GCCGAGGGAA AGGCCGTATC GGAATTGGAA AGCAAGGTCG ACAAGGCAGG TGCGGAACTG GACCGTGTTG CCAAGACCCT CGACGAACTG GCCAAGAAGG CCAACCGCCC ATCTACGGGC AACGATGAAC AAGCCCAGAT TGATGCTGAA CATAAAACAG CATGGGAACG CTGGGCCCGC AAAGGTGACG ATCACGGCCT TGCCGACATC GAAGCCAAAT CCATCAGCGT GGGGACACCT GCCGATGGCG GCTACGCGCT GCCCATTGAA CAGGACCGCA CCATACTCCG GCTTCTGCGT GAACAATCCC CCATGCGGCA AGTATGCCGC GTCCTCACCA TCGGCACCGA AGACTACCGC AAGCTCGTTA ACCTTGGTGG AACGGGCTCC GGCTGGGTAG GTGAAAAGGC GGCACGACCG GAAACCGGCA CCCCCACACT GGCAGAGATC AAGCCCTTCA TGGGGGAGGT GTACGCCAAC CCTGCCGTTA CGCAGAAAGC CCTTGATGAT CTGTTCTTCA ATGTGGAGGC GGAGCTCTCT GCAGACATCG TCACAGAGTT TGCCGAACAG GAAGGCAGTG CATTCCTGAG TGGCGATGGC ACCAACAAAC CCAAAGGACT GCTTGCCTAC CCGCAGGCTG CCACCGCTGA TGGCACCCGT GCCTTCGGGA CTCTGCAGTT CCTCATCACG GGCGTGGCCG GAGGTTTCAA GTCTCCCTCC ACCACCGTTC ATCCCGCCGA TGACCTTGTG GACCTCATCT ACGCCCTCAA GAAAGGCCAT AGAGCTGGGG CAACGTTCAT GATGAACGGC AAGACCCTCT CGACCCTACG CAAATGGAAG GACGCAGAAG GCAACTACAT CTGGCAACCA GGCATCCAGG CAGGGCAACC GTCCGTTCTC CTCGGATACT CCGTAACAGA GAACGAGGAC ATGCCGGATG TCGGTGCTGG TGCTATCCCC ATCGCCTTTG GCAACTTCCA GCGTGCCTAC TGGATCATTG ACCGTATCGG CATCAGGAGC CTTCGGGATC CCTTCACCAA CAAGCCCTAC GTGCACTTCT ACACCACGAA GCGCGTAGGC GGCATGTTGG TCGATTCGGA AGCCGTGAAG CTGCTCAAGC TGGCAGCAGC GTAA
|
Protein sequence | METIIRDLLE QQGKAFEDFK KANDARLNAM AEGKAVSELE SKVDKAGAEL DRVAKTLDEL AKKANRPSTG NDEQAQIDAE HKTAWERWAR KGDDHGLADI EAKSISVGTP ADGGYALPIE QDRTILRLLR EQSPMRQVCR VLTIGTEDYR KLVNLGGTGS GWVGEKAARP ETGTPTLAEI KPFMGEVYAN PAVTQKALDD LFFNVEAELS ADIVTEFAEQ EGSAFLSGDG TNKPKGLLAY PQAATADGTR AFGTLQFLIT GVAGGFKSPS TTVHPADDLV DLIYALKKGH RAGATFMMNG KTLSTLRKWK DAEGNYIWQP GIQAGQPSVL LGYSVTENED MPDVGAGAIP IAFGNFQRAY WIIDRIGIRS LRDPFTNKPY VHFYTTKRVG GMLVDSEAVK LLKLAAA
|
| |