Gene Dvul_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0844 
Symbol 
ID4663991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1035441 
End bp1036562 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID639819066 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_966292 
Protein GI120601892 
COG category[R] General function prediction only 
COG ID[COG4174] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.367141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.038262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCC GACAGTCATC CCACGGCGGG GCATGGGGCT ACATGCTGCG CCGCCTGCTG 
CTGGTGCTGC CCACGCTTCT CGGCATCGTC ACGATCAACT TCTTCGTGGT GCAACTGGCC
CCCGGCGGCC CGGTGGAACA GTACATCGCC CGTCTCGAAG GTGATGGCGC CGCCTACATG
GAACGCATCG GCGCAGGTGA TGGCGGCGAC ATGCAGCCCG CAGCCGACGA CGGGACAGCC
GCCTACAAGG GGGCGGCGGG ACTCAGCCCG CAGGCTGTGG AGGCCATCCG CCGACAATAC
GGCTTCGACC GCCCCATCCT CGAACGCTAC GTCACCATGC TGGGCGACTT CGCCCTGTTC
AGGTTCGGCG ACAGTCTCTT CAAGGGGCGC AGCGTCATCG ACCTCGTAGG TGACGCCATG
CCCGTATCGC TGTCGCTGGG ACTCTGGAGC ACCCTCGTCA TCTATGCCGT ATCCATCCCG
CTGGGCATGG CGCGCGCGCT GCGCCGCGGC AGCCGATTCG ACACCATGAG CGGCATCGCC
GTCATCGCGG CACACGCCAT CCCCGCCTTT CTGCTGGCGG TGCTGCTCAT CGTGCTCTTC
GCCGGGGGCA GCTACCTGCA ATGGTTCCCG CTGCGGGGGC TGGTGTCGCC GGGGCACGAC
GCGCTGCCTT TCGGGGCACG GGTGCTCGAC TATGCGCACC ACATGGTACT GCCCGTGACC
GCCATGGTCG TGGGCGGTTT CGCGGGGCTG ACCAGCCTGA CGCGCAACGC CTTTCTCGAC
GAACTCGGCA AAGCCTATGT GGAGACGGCC CTCGCCAAGG GCCTCACGCG CAAGGCCGTG
TTGTGGCGGC ACGTCTTCCG CAACGCCATG CTGCTGGTCA TCAGCGGGCT GCCCGGTGCC
TTCGTGCGGG TGTTCTTCAC CGGTTCGCTG CTCATCGAGA CCATCTTCTC GCTCAACGGC
CTCGGACTCA TGGGGTTCGA AGCCGCCATG CAGCGCGACT ACCCTGTCAT GTTCGCCTCT
CTCTATGTCT TCACGCTCAT CGGTCTGACG GCATCCCTTG CCGGAGACAT GCTCTGCATG
GCCGTTGACC CGCGCATCGA CTTCGAAAGG AGGGCGGCAT GA
 
Protein sequence
MRARQSSHGG AWGYMLRRLL LVLPTLLGIV TINFFVVQLA PGGPVEQYIA RLEGDGAAYM 
ERIGAGDGGD MQPAADDGTA AYKGAAGLSP QAVEAIRRQY GFDRPILERY VTMLGDFALF
RFGDSLFKGR SVIDLVGDAM PVSLSLGLWS TLVIYAVSIP LGMARALRRG SRFDTMSGIA
VIAAHAIPAF LLAVLLIVLF AGGSYLQWFP LRGLVSPGHD ALPFGARVLD YAHHMVLPVT
AMVVGGFAGL TSLTRNAFLD ELGKAYVETA LAKGLTRKAV LWRHVFRNAM LLVISGLPGA
FVRVFFTGSL LIETIFSLNG LGLMGFEAAM QRDYPVMFAS LYVFTLIGLT ASLAGDMLCM
AVDPRIDFER RAA