Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1106 |
Symbol | |
ID | 7173007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 1355112 |
End bp | 1356917 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643539617 |
Product | phage uncharacterized protein-like protein |
Protein accession | YP_002435528 |
Protein GI | 218886207 |
COG category | [R] General function prediction only |
COG ID | [COG5362] Phage-related terminase |
TIGRFAM ID | [TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 0.139903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCCC CTCCCATGAT TCCACCCGCA GCCGCCATGC CCACCACCCG GCGCGATACC GCCGCCCTGT ACGCCGACCT TGCCGCAGAC GCCACCCGTA GCGGCCCCGC CGCACTGGCC GCCGTGCAGG CGGAGCTTGG CCGCCGCGAC CTGTTCTACC TGCTCACCCG GCTCATGAAC CGGACCGACA TGGACCGCGA CTGGCTGTTC GAACGCTGCC GCGAGGTTCA GGCCGCGCCC GACGGCCACC TGGACCTGTG GGCGCGCGAA CACTACAAAT CCACCATCAT CACCTTCGGC CAGACCGTGC GCGACATCCT GAACGACCCG GAAATCACGG TGGGCATCTT TTCCCACTCG CGCCCGGTGG CCAAGGACTT CCTGGGCCAG ATCAAGCACG AGTTCGAACA CAACGACCTG CTGAAGCGCC TGTACCCCGA CGTGCTGTGG GCCGCGCCCC GGCGGCAGGC CCCGGTGTGG TCGCTGGACA AGGGCATCGT GGTGCGCCGC CGAGGCAACC CCAAGGAGGC CACCGTGGAG GCGTGGGGAC TGGTGGACGG CCAGCCCACG GGCAAGCACT TTCGCCTGCT GGTCTACGAC GACGTGGTTA CCCGCGACTC CGTGACCACC CCGGAAATGA TCGCCAAGGT CACGGAATGC TGGGCCCTGT CGCTCAATCT GGGGGCCAGC GGGGTGACCG GCGAAGACCG TGGCGGAGGC CGTGCCGGGA ACAGCGGCGG AGATGCCCGC GGGAAGACGG GAAACAAAAA CGGTGCCGCG GCCCCTGCGG ACGACACGGG GCTGGCCGAC GGCGCGACCG CCCCCGCGCC TGCGGATGCA GACAATGCCC TGCCCGCCAG CGGGGCGGGA CGCCGCCGCT ACATCGGCAC CCGCTACCAC TTCAACGACA CCTGGCGCAC CATCCTCGAA CGAAGGGCGG CAACGCCGCG CATCCACCCC GCCACGGCCA CCGGTGCCCC AGATGGCCCG CCAGTGCTGC TTTCCCCGGC GGCGCTGGCC GAAAAACGCC GCCAGATGGG GCCGTACGTG TTCGGCTGCC AGATGCTGCT GAACCCCGCC GCCGACACGG CCCAGGGCTT CCGCGCCCAA TGGCTGCGCC GCTACGCGTC CGATGCGGGC AACGGAGCCG ACCACGGGCG CGCATCCCCC CTGGCCCGCT GGGGCCACTG CAACCGGTAC CTGCTGGTGG ACCCGGCGGG CGAACGCAAG AAGGGCAGCG ACTACACGGT AATGCTGGTG GTTGGGCTGG CCCCGGACGG CAACCGCTAC CTGCTGGACG GGGTGCGCGA CCGGCTGAAC CTTACGGGCC GGGCCGCCGC GCTGTTCCGC CTGCACCGGA CATGGCGGCC ATTGGCCACA GGCTACGAAA AATACGGCAT GCAGGCCGAT ATCGAACACG TGCGCACCGA ACAGGAACGC CGCAACTACC GCTTCGACAT CACCCCGCTG GGCGGCCCCA TGCCCAAGAA GGACCGCATC CGCAGGCTGG TGCCGGAATT CGAGCAGGGC CGCATGCTGC TGCCGCACCG CCTGCCCTTC GTGGATGCCG AGGGCAAACG GCGCGACCTG GCGCGGGAAT TCGTGGACGA GGAGTATCTG GCCTTTCCCG TGTCCCGGCA TGACGACATG CTGGACTGCC TGGCCCGCAT TCTGGACCCC GACCTGGGCG CGGAATTCCC AGACCCGGAT TCCGGAGGGT TCGGTCCGGC AGGGCACGGC GCAGGCCATG GCGACCTTGG CTGCACGGAC GACACCGCGC ATATGGAGTA TCCCCTGTAT GGCTAG
|
Protein sequence | MPAPPMIPPA AAMPTTRRDT AALYADLAAD ATRSGPAALA AVQAELGRRD LFYLLTRLMN RTDMDRDWLF ERCREVQAAP DGHLDLWARE HYKSTIITFG QTVRDILNDP EITVGIFSHS RPVAKDFLGQ IKHEFEHNDL LKRLYPDVLW AAPRRQAPVW SLDKGIVVRR RGNPKEATVE AWGLVDGQPT GKHFRLLVYD DVVTRDSVTT PEMIAKVTEC WALSLNLGAS GVTGEDRGGG RAGNSGGDAR GKTGNKNGAA APADDTGLAD GATAPAPADA DNALPASGAG RRRYIGTRYH FNDTWRTILE RRAATPRIHP ATATGAPDGP PVLLSPAALA EKRRQMGPYV FGCQMLLNPA ADTAQGFRAQ WLRRYASDAG NGADHGRASP LARWGHCNRY LLVDPAGERK KGSDYTVMLV VGLAPDGNRY LLDGVRDRLN LTGRAAALFR LHRTWRPLAT GYEKYGMQAD IEHVRTEQER RNYRFDITPL GGPMPKKDRI RRLVPEFEQG RMLLPHRLPF VDAEGKRRDL AREFVDEEYL AFPVSRHDDM LDCLARILDP DLGAEFPDPD SGGFGPAGHG AGHGDLGCTD DTAHMEYPLY G
|
| |