Gene Dvul_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2123 
Symbol 
ID4662114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2465698 
End bp2466978 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID639820366 
Producthypothetical protein 
Protein accessionYP_967566 
Protein GI120603166 
COG category[S] Function unknown 
COG ID[COG2881] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00645364 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATAC GTTGTCCTGA ATGCGGCTTC GAGAGGGAGG TGGACGAAGG CAGGCTGCCC 
CCTTCCGCAG CCATCGCCAC CTGCCCCAAG TGCCGTTGCA AATTCCGCTT CCGCGACCCG
GAAGCCTCCA TACCGCAGGC CGTGACCCGC GATTCCCATG CGACCTCCGC ATCCCCCGCG
TCCCAAGAGT CTCATGGCGC GACGGGCCCC GCCAGTGCGT CGCCCGAAGT CGTCGATGCG
ACGGCAGCCC CCTCCGGGAT GCCCGCTGCC AGCGACCGGG GCGACACGCC CGCATATCCC
GAACAACCGG TACAGACGGA CACTCCCGAC GCACCAGCTT CAGCACCCGC TGCTGCCTCC
ACCTCCGAAG ACAGCGGTGA CGACCCGCTG CCCCCCGGCG CGGTCATCCC CGGAGCGCCC
CGCCACGAGG CACCGGAAGC GTCCCCCGAG GCCGACCGCA CCGACAGGGC AGTGCCCCCG
TACGTGCAAG GACGTGCCGA CGACAATGCC TCGCCTGCCT CCACCAAGGG CAAGGGCGAC
ACCAAGGGAG ATGTGTGGGA CGCCGTCTCA TCCGTAGGCG ACCGCTGGCG CAAGCTCTAT
GATACGCATA TGGCGCAGGG CACACCTCCC AATCCGGAAG ACGGCACCGG GCAACCCCGC
GAGGGCATCC CGTGGGAGAA CCTCGACCGC CATGGCTTCT TTCCCGGGCT GTACCAGACC
ATTCTTCGCG TCATGTTCGG GGCGCCCCGC TTCTTCACGC AAATCGGTTC CGACGGGCCG
TCCATGCGGC CGGTGGCCTT CTTCATCCTG CTCGGCATCT TCCAGTCGCT GATGGAACGG
CTGTGGTACA TCACCACCTT CAACATGCTT GGCCCGAGCA TCGACGACCC GCAATTGCAT
GCGCTTCTGG GCGGCATCGC GCAGGAGTTC GGCATCGGGG CCACGCTGAT GCTCTCGCCG
TTCACTCTCA TCCTGCAACT GGTCTGCGTC ACCGGGGCCT ACCATTTCAT GATGCGCCTC
GTGCAGCCCG ACAAGGCTCA CTTCGGTACC ATGCTTCGGG TGGTGAGCTA CAGTGCGGCC
CCCACGGTGG TGAGCATCGT GCCGCTGCTG GGGCCCACGG TCGGTTCGCT GTGGTTCGTG
GCGTGCACCG TCATAGGTGT CAAACACGCC TACAGGCTGC CATGGAGCAG GGTTCTGCTG
GCACTTGGCC CGTTGTACAT CCTCGCCATC GCCGTGGGTG TCCAGATGCT CAAGATGGTC
GTCGCGGGCG GCGGAGCCTA G
 
Protein sequence
MLIRCPECGF EREVDEGRLP PSAAIATCPK CRCKFRFRDP EASIPQAVTR DSHATSASPA 
SQESHGATGP ASASPEVVDA TAAPSGMPAA SDRGDTPAYP EQPVQTDTPD APASAPAAAS
TSEDSGDDPL PPGAVIPGAP RHEAPEASPE ADRTDRAVPP YVQGRADDNA SPASTKGKGD
TKGDVWDAVS SVGDRWRKLY DTHMAQGTPP NPEDGTGQPR EGIPWENLDR HGFFPGLYQT
ILRVMFGAPR FFTQIGSDGP SMRPVAFFIL LGIFQSLMER LWYITTFNML GPSIDDPQLH
ALLGGIAQEF GIGATLMLSP FTLILQLVCV TGAYHFMMRL VQPDKAHFGT MLRVVSYSAA
PTVVSIVPLL GPTVGSLWFV ACTVIGVKHA YRLPWSRVLL ALGPLYILAI AVGVQMLKMV
VAGGGA