Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_3374 |
Symbol | |
ID | 3758351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 3342921 |
End bp | 3344201 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637784287 |
Product | Phage putative head morphogenesis protein, SPP1 gp7 |
Protein accession | YP_389863 |
Protein GI | 78358414 |
COG category | [S] Function unknown |
COG ID | [COG2369] Uncharacterized protein, homolog of phage Mu protein gp30 |
TIGRFAM ID | [TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTTG AGCCTGTAGC GCTGCCCCCC AAAGAGGCCC TTGCCTATTG GCAGGACAAG GTGTCCGTCA CCCCGGAGGT GTTCAAGAAC CTTTCCGGGC AGGCCCGCGC ACGTGCCTTC GCCGTGTCGG GGCTGTCCCG GCAGGACCAG ATAGCCGCCG TGCAACAGGC TGTCCATGAA GCCATGGCCA ACGGGGAAAC TCTCAAAGAC TTCAAGGGGC GTATGGACGC CGTACTGGAG GGAGCCAGGC TGCCGCGCTG GCGGCTGGAG AACATCTACC GCACCAACGT ACAGAGCGCC TACATGGCCG GGCGTTATGC ACAGATGCAG CGTACCACGG CTTTGCGTCC CTACTGGCGC TATGTGGCCG TGGCCGACAA ACGCACCCGC CCGGATCATT TGGCGCTGCA TGGGCTTGTC TATCCCCACG ATCATAAATT CTGGAGCACC TATTATCCAC CCAACGGCTT CGCCTGCCGC TGCACAGTGC AAACGCTTTC TGAGCGGCAG GCCAAGCAAA GCGGTGTGGA AATCCAGAAG GATATGCCGG ACCTCATTGA GCCGGTGGAC CCGCGCACAG GCAATCGTCT TCCGGCCCGT CGCCCGGTGC CCGATGCTGG CTTTGCGGGA AACGTGGGGC AAGACTGGTT ACATGGACTC GCACCTTCGG AACTGGACGC CAAGATCAAA GACCTGCCTC TTCCCACGCT TTGCCGCACC GGCGGCACAT CCTTTGCCGA TCCACAAGCC GGTGCCCCGT GCAGGCCGCC ACTGGCCTCA TTGGCCAAGC GCCACATCCT GCCGGTTACG GCAAAGGACA TTCTGCCCGG GGGCCTGAAG GCGGAAGAGT ATGTGGCTGC CTTCCTCAAA GAGTTCAACC TTGCCGATAT CAATGCCAGC GCTGTGCATA CGATTCCGGG CGGTATCCCC GTCGTTATCG GCAAGGGATT GTTTATCGAC AAGAAAACAG GCGGTTGGAA AGTGCTGAAG AGTGGCCGTG AGCAATATCT GAAGCTGTTG GCCAGAACTG TCAAAGAACC GTGGGAGGTC TGGCAGGTGC CAGCGGAGGT GGCAGGAAAG CCTATGCCGG TGTTGCGGCT GATCAGGCTG TTCCGGGATG AGGAGGAAGC CAGGATTGGG GGCTTTGCCG TGTTCAATCT GGTGCGGGGC CGTGAATGGC AGGGAGCCAC CACCTTTACC CCCAAGCTCG GCAACGAAGC TGCCATGCTC AAATACATGG AGCGCCAGCG GCAGGGAGCG CTGCTGTATC GCGAGCCCTA A
|
Protein sequence | MTVEPVALPP KEALAYWQDK VSVTPEVFKN LSGQARARAF AVSGLSRQDQ IAAVQQAVHE AMANGETLKD FKGRMDAVLE GARLPRWRLE NIYRTNVQSA YMAGRYAQMQ RTTALRPYWR YVAVADKRTR PDHLALHGLV YPHDHKFWST YYPPNGFACR CTVQTLSERQ AKQSGVEIQK DMPDLIEPVD PRTGNRLPAR RPVPDAGFAG NVGQDWLHGL APSELDAKIK DLPLPTLCRT GGTSFADPQA GAPCRPPLAS LAKRHILPVT AKDILPGGLK AEEYVAAFLK EFNLADINAS AVHTIPGGIP VVIGKGLFID KKTGGWKVLK SGREQYLKLL ARTVKEPWEV WQVPAEVAGK PMPVLRLIRL FRDEEEARIG GFAVFNLVRG REWQGATTFT PKLGNEAAML KYMERQRQGA LLYREP
|
| |