Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_3049 |
Symbol | |
ID | 4661993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008741 |
Strand | - |
Start bp | 128039 |
End bp | 129379 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639813969 |
Product | hypothetical protein |
Protein accession | YP_961248 |
Protein GI | 120586903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.623992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG CGACACGACG CGGAACGCGC AACGACACGG TGCAGGCGTT GAAGCAGCTG TTCCTGCTGG TCATCTGCTA CGGGCACATC TTCGCCGAAG GCCCGGCTGC GACCCTGCCG GGTTCGACGC TGATGGATGG CCTGCCCATG CGCCTGTGGT GGGTTCCCGG CGATGTGGGG CTGTTCTTCT TCACCGCCAC GTCGGGGTAC TTCACCGCGC TGCGCTACCC CGGTGCGGCG ATGATGCAGG GCTACTGGCG GCGCAAGGTC TCGCGCCTGT GCGGGCTTTT CCTGTTCCTC AACGCGGTGC TGGGCTGCGT GTTCCTCGCC ACCGGGCGCG AGGGCGTCTT CACGTGGGAT GCGCTGGTGA ACCTGCTTGG CCTCAACGGT TTTCTCAACT GGTTCCATCT CGGCAACGAC AGCCCCTTCG GGGCGGGGCA GTGGTTCTTC ACCCTGCTGT TGCTGTTCTA CCTCGTCTAT CCGGTGATGA ACCGTGCGGT GACCACCCCG GCGCGCGGGC GGTTGCTGCT GTGGGGTTGC GTGGCGCTGG CGGCGGTCAT GCAGGTGCGT CTTCCCTACG GGCATTCGCT GTGGTCGACA TCGGTGGGCT TCGTGGCGGG CTTCTGGCTG GCGCGCTTCG GCGGTTCGCA CCCCGCACGC AGCGGATGGT TCGCCCTGTG CGCCTGCCTC GCAGGGGCGG TGGCGTACCG CTTCATCGGG CATCTGCCCG TGCTGGCGTA CTCCGTCATC GGCCTTGCCG GGTACGTCTG TACCGTGCTT GCGCTGACGG TGCGTCTTCC CCTGCTGCCC GCGAGGGTCC TCGATGCGGT GGGCGACCTG ATGCTGCCCC TCTACATCAT CCACACCTAC TTCAGGCTGC CGCTGGCGCA GGACCCGTAC CTCGACGCCT TGCTGGTGCT GACCATGAAC GCGGGCATCG CGTGGGTGCT GCTGAAGGCG TACGCCCTGC TTGCGGGACG GCGTCGGCAT GCGGGGCCGC CCACCGTCGC CCCGGTGGGG GCTGGCATGG CGGGTTCCGG TGTGCCCGGT CCCGGTACGG GCGGCAGGGC GAAACCCGTG GCCAGCCCCG CACTGGCGGC GAAAGCGCAC CATGCGGCGA ACGCCGAGAC ACCGTCCCCG CAGGCACCAG CCACCCATGA GCCTGCCGTG CAGGCGGCTT CGGGGCTTAC CCCGGCTGGG CAGACACCAT CAGGCCAGAC GCTTTCTGCT CAGGCGCTTT CCGGCCAGAC GTATTCCGGT CAGACGCTTT CCGGCCGGTC GTCTCCGGGG CAGCAGTCGT CGCCCCTGCC CGCTGGCACC AAGGGGGAGG GCACACCATG A
|
Protein sequence | MTDATRRGTR NDTVQALKQL FLLVICYGHI FAEGPAATLP GSTLMDGLPM RLWWVPGDVG LFFFTATSGY FTALRYPGAA MMQGYWRRKV SRLCGLFLFL NAVLGCVFLA TGREGVFTWD ALVNLLGLNG FLNWFHLGND SPFGAGQWFF TLLLLFYLVY PVMNRAVTTP ARGRLLLWGC VALAAVMQVR LPYGHSLWST SVGFVAGFWL ARFGGSHPAR SGWFALCACL AGAVAYRFIG HLPVLAYSVI GLAGYVCTVL ALTVRLPLLP ARVLDAVGDL MLPLYIIHTY FRLPLAQDPY LDALLVLTMN AGIAWVLLKA YALLAGRRRH AGPPTVAPVG AGMAGSGVPG PGTGGRAKPV ASPALAAKAH HAANAETPSP QAPATHEPAV QAASGLTPAG QTPSGQTLSA QALSGQTYSG QTLSGRSSPG QQSSPLPAGT KGEGTP
|
| |