Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0869 |
Symbol | |
ID | 4663652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 1074075 |
End bp | 1075304 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639819091 |
Product | HK97 family phage portal protein |
Protein accession | YP_966317 |
Protein GI | 120601917 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTCC TCTCCTGGTT TCGTGAGCGC AAAAGCCCCC CCCGCCCTTT GACGATTGAA GATCTGCTGG CGGACGGATT CTACACGGGA TCGGCCAAAA GCGGTGTCCC CGTGACGTGG AAGACAGCCC TGCAAGCCGC AACAGCTCTG GCTTGCGCCA GAGTCATTGC CGAAGGACTT GCCCAAGTCC CCCTGAAGCT CTTTGAGTCT CAAAACCAAG TACGTACTCC GGCCACGGGC CATCCCCTGT ACTCGCTACT CCACGACAGC CCTAACGAGT GGCAGACCAG CTTCGAATTC ATTGAGCAGA TTGCCTTGCA TCTCGTGCTG TGCGGTAACG CCTTCGTCTT TGTGAACCGC CCGCTGGGTC GAGTGGCCGA ATTACTACCC TATGAGCCGC AGCAGGTCAT CGTAAAGCGC GAGGGGTATG CGCTCTCCTA CGACATCACC ACCGAAAACG GCCAACGCCT GAATGTACCG GCTTCGGACA TGTGGCATCT ACGTGGGCCA TCGTGGAACG GGTGGATGGG GCTAGAGAGT GTCAGGCTGG CCCGTGAAGC CATCGGGCTT GCGCTTGCCA CAGAGGAGCA CGGTGCTCGC TTGTTCGCTA ACGGGGCAAC GGTTGGCGGT ATTCTATCCA CGGAACAAAC ACTCAACGAG GAACAACGCC AAGCGCTCCG TAAATCATGG GAGGCACGGC ACACCGGTGG CGGAAACGCT TTCAAAACAG CCGTTCTCTG GGGTGGGATG AAATTCACGC CCATGACCGC CCCCAACGAC CAAGCTCAGT TCCTTGAAAC CCGTAAATTC CAAGTTGAAG AGATTTGCAG GACATTCCGG GTGCTTCCCA TCATGGTGGG GTATTCGGAC AAGACTGCGA CCTACGCCAG TGCCGAGCAG ATGTTCCTCG CGCATGTTGT CCATACGCTG GGCCCGTGGT GCCGTCGCAT TGAGGCGAGC ATCGCGCACA ACCTGTTGAC CGAAGAGGAG CGTCAGCAGG GCTACTACGC CAAATTCATG CTCAACGGGC TTCTGCGGGG TGCTGCCAAG GACAGGGCTG AGTTCTATGC CCGTCTGTAT GGCATCGGCG CACTGAACCC TAACGAGATC CGTGAACTGG AGGATATGAA CCCATATGAC GGAGGTGAAC ACTACCGCGT GCCCTTGAAC ATGACTGACC CCACTTCACC CGCTGGCCAG GAGGCACCGC ATGCAACGGC TGAACTGTAA
|
Protein sequence | MNFLSWFRER KSPPRPLTIE DLLADGFYTG SAKSGVPVTW KTALQAATAL ACARVIAEGL AQVPLKLFES QNQVRTPATG HPLYSLLHDS PNEWQTSFEF IEQIALHLVL CGNAFVFVNR PLGRVAELLP YEPQQVIVKR EGYALSYDIT TENGQRLNVP ASDMWHLRGP SWNGWMGLES VRLAREAIGL ALATEEHGAR LFANGATVGG ILSTEQTLNE EQRQALRKSW EARHTGGGNA FKTAVLWGGM KFTPMTAPND QAQFLETRKF QVEEICRTFR VLPIMVGYSD KTATYASAEQ MFLAHVVHTL GPWCRRIEAS IAHNLLTEEE RQQGYYAKFM LNGLLRGAAK DRAEFYARLY GIGALNPNEI RELEDMNPYD GGEHYRVPLN MTDPTSPAGQ EAPHATAEL
|
| |