Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2123 |
Symbol | |
ID | 4662114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 2465698 |
End bp | 2466978 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639820366 |
Product | hypothetical protein |
Protein accession | YP_967566 |
Protein GI | 120603166 |
COG category | [S] Function unknown |
COG ID | [COG2881] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00645364 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCATAC GTTGTCCTGA ATGCGGCTTC GAGAGGGAGG TGGACGAAGG CAGGCTGCCC CCTTCCGCAG CCATCGCCAC CTGCCCCAAG TGCCGTTGCA AATTCCGCTT CCGCGACCCG GAAGCCTCCA TACCGCAGGC CGTGACCCGC GATTCCCATG CGACCTCCGC ATCCCCCGCG TCCCAAGAGT CTCATGGCGC GACGGGCCCC GCCAGTGCGT CGCCCGAAGT CGTCGATGCG ACGGCAGCCC CCTCCGGGAT GCCCGCTGCC AGCGACCGGG GCGACACGCC CGCATATCCC GAACAACCGG TACAGACGGA CACTCCCGAC GCACCAGCTT CAGCACCCGC TGCTGCCTCC ACCTCCGAAG ACAGCGGTGA CGACCCGCTG CCCCCCGGCG CGGTCATCCC CGGAGCGCCC CGCCACGAGG CACCGGAAGC GTCCCCCGAG GCCGACCGCA CCGACAGGGC AGTGCCCCCG TACGTGCAAG GACGTGCCGA CGACAATGCC TCGCCTGCCT CCACCAAGGG CAAGGGCGAC ACCAAGGGAG ATGTGTGGGA CGCCGTCTCA TCCGTAGGCG ACCGCTGGCG CAAGCTCTAT GATACGCATA TGGCGCAGGG CACACCTCCC AATCCGGAAG ACGGCACCGG GCAACCCCGC GAGGGCATCC CGTGGGAGAA CCTCGACCGC CATGGCTTCT TTCCCGGGCT GTACCAGACC ATTCTTCGCG TCATGTTCGG GGCGCCCCGC TTCTTCACGC AAATCGGTTC CGACGGGCCG TCCATGCGGC CGGTGGCCTT CTTCATCCTG CTCGGCATCT TCCAGTCGCT GATGGAACGG CTGTGGTACA TCACCACCTT CAACATGCTT GGCCCGAGCA TCGACGACCC GCAATTGCAT GCGCTTCTGG GCGGCATCGC GCAGGAGTTC GGCATCGGGG CCACGCTGAT GCTCTCGCCG TTCACTCTCA TCCTGCAACT GGTCTGCGTC ACCGGGGCCT ACCATTTCAT GATGCGCCTC GTGCAGCCCG ACAAGGCTCA CTTCGGTACC ATGCTTCGGG TGGTGAGCTA CAGTGCGGCC CCCACGGTGG TGAGCATCGT GCCGCTGCTG GGGCCCACGG TCGGTTCGCT GTGGTTCGTG GCGTGCACCG TCATAGGTGT CAAACACGCC TACAGGCTGC CATGGAGCAG GGTTCTGCTG GCACTTGGCC CGTTGTACAT CCTCGCCATC GCCGTGGGTG TCCAGATGCT CAAGATGGTC GTCGCGGGCG GCGGAGCCTA G
|
Protein sequence | MLIRCPECGF EREVDEGRLP PSAAIATCPK CRCKFRFRDP EASIPQAVTR DSHATSASPA SQESHGATGP ASASPEVVDA TAAPSGMPAA SDRGDTPAYP EQPVQTDTPD APASAPAAAS TSEDSGDDPL PPGAVIPGAP RHEAPEASPE ADRTDRAVPP YVQGRADDNA SPASTKGKGD TKGDVWDAVS SVGDRWRKLY DTHMAQGTPP NPEDGTGQPR EGIPWENLDR HGFFPGLYQT ILRVMFGAPR FFTQIGSDGP SMRPVAFFIL LGIFQSLMER LWYITTFNML GPSIDDPQLH ALLGGIAQEF GIGATLMLSP FTLILQLVCV TGAYHFMMRL VQPDKAHFGT MLRVVSYSAA PTVVSIVPLL GPTVGSLWFV ACTVIGVKHA YRLPWSRVLL ALGPLYILAI AVGVQMLKMV VAGGGA
|
| |