Gene Dvul_0867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0867 
Symbol 
ID4663694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1072159 
End bp1073382 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content58% 
IMG OID639819089 
ProductHK97 family phage major capsid protein 
Protein accessionYP_966315 
Protein GI120601915 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.887226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCA TCATCAGAGA CCTGCTGGAA CAGCAGGGTA AGGCATTCGA AGACTTCAAG 
AAGGCGAACG ATGCTCGTCT GAACGCCATG GCCGAGGGAA AGGCCGTATC GGAATTGGAA
AGCAAGGTCG ACAAGGCAGG TGCGGAACTG GACCGTGTTG CCAAGACCCT CGACGAACTG
GCCAAGAAGG CCAACCGCCC ATCTACGGGC AACGATGAAC AAGCCCAGAT TGATGCTGAA
CATAAAACAG CATGGGAACG CTGGGCCCGC AAAGGTGACG ATCACGGCCT TGCCGACATC
GAAGCCAAAT CCATCAGCGT GGGGACACCT GCCGATGGCG GCTACGCGCT GCCCATTGAA
CAGGACCGCA CCATACTCCG GCTTCTGCGT GAACAATCCC CCATGCGGCA AGTATGCCGC
GTCCTCACCA TCGGCACCGA AGACTACCGC AAGCTCGTTA ACCTTGGTGG AACGGGCTCC
GGCTGGGTAG GTGAAAAGGC GGCACGACCG GAAACCGGCA CCCCCACACT GGCAGAGATC
AAGCCCTTCA TGGGGGAGGT GTACGCCAAC CCTGCCGTTA CGCAGAAAGC CCTTGATGAT
CTGTTCTTCA ATGTGGAGGC GGAGCTCTCT GCAGACATCG TCACAGAGTT TGCCGAACAG
GAAGGCAGTG CATTCCTGAG TGGCGATGGC ACCAACAAAC CCAAAGGACT GCTTGCCTAC
CCGCAGGCTG CCACCGCTGA TGGCACCCGT GCCTTCGGGA CTCTGCAGTT CCTCATCACG
GGCGTGGCCG GAGGTTTCAA GTCTCCCTCC ACCACCGTTC ATCCCGCCGA TGACCTTGTG
GACCTCATCT ACGCCCTCAA GAAAGGCCAT AGAGCTGGGG CAACGTTCAT GATGAACGGC
AAGACCCTCT CGACCCTACG CAAATGGAAG GACGCAGAAG GCAACTACAT CTGGCAACCA
GGCATCCAGG CAGGGCAACC GTCCGTTCTC CTCGGATACT CCGTAACAGA GAACGAGGAC
ATGCCGGATG TCGGTGCTGG TGCTATCCCC ATCGCCTTTG GCAACTTCCA GCGTGCCTAC
TGGATCATTG ACCGTATCGG CATCAGGAGC CTTCGGGATC CCTTCACCAA CAAGCCCTAC
GTGCACTTCT ACACCACGAA GCGCGTAGGC GGCATGTTGG TCGATTCGGA AGCCGTGAAG
CTGCTCAAGC TGGCAGCAGC GTAA
 
Protein sequence
METIIRDLLE QQGKAFEDFK KANDARLNAM AEGKAVSELE SKVDKAGAEL DRVAKTLDEL 
AKKANRPSTG NDEQAQIDAE HKTAWERWAR KGDDHGLADI EAKSISVGTP ADGGYALPIE
QDRTILRLLR EQSPMRQVCR VLTIGTEDYR KLVNLGGTGS GWVGEKAARP ETGTPTLAEI
KPFMGEVYAN PAVTQKALDD LFFNVEAELS ADIVTEFAEQ EGSAFLSGDG TNKPKGLLAY
PQAATADGTR AFGTLQFLIT GVAGGFKSPS TTVHPADDLV DLIYALKKGH RAGATFMMNG
KTLSTLRKWK DAEGNYIWQP GIQAGQPSVL LGYSVTENED MPDVGAGAIP IAFGNFQRAY
WIIDRIGIRS LRDPFTNKPY VHFYTTKRVG GMLVDSEAVK LLKLAAA