Gene Dvul_0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0956 
Symbol 
ID4662522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1174517 
End bp1175710 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content65% 
IMG OID639819179 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_966404 
Protein GI120602004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000351359 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.276703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAC TTTCCATCAG GAACCTGACC AAGATCTTCG GTCCACACCC CGAGAAGGCC 
CTCGGTCTTC TCGAGCAGGG GCTTGGCAAG GAGGAAATCC ACCGCCGCAC AAGCCATGCC
GTGGGTGTCG ACCGTGCCTC CTTCGATGTG GAGGAGGGCG AGATCGTCGT GGTCATGGGT
CTCTCCGGCA GCGGTAAATC CACATTGGTA CGCTGCCTCA ACCGCCTCAT CGAACCCACG
GCCGGAACCG TCACCGTCGA CGGCCGGGAC GTGACCTCCA TGCCCGTCGA CGAGTTGCGA
CGCCTTCGGC AACGCAGCTT CGGGATGGTC TTCCAGAACT TCGCCCTCTT CCCGCACCGT
ACTGTGCTGC AGAATGCCGC CTTCGGCCTA GAGGCCATGG GCGTGCCCCG TGCCGAACGC
GAGCGTCAGG CCATGGTCTC GCTCGAAAGG GTGGGGCTCG CAGAGTGGGC CGCATCGCGT
CCCGCGCAGC TGTCCGGGGG CATGCAACAG CGTGTGGGGC TTGCAAGGGC CCTTTCCCTC
GACCCCGACA TCCTGCTCAT GGACGAGGCG TTCAGCGCGC TCGACCCACT CATCCGGCGT
GACATGCAGG ACGAACTGCT GCGGTTGCAG GACGACCTGC AGAAGACCAT CGTGTTCATC
AGTCATGACC TCGACGAGGC CCTCAAACTG GGTGACCGCA TCGTGCTCAT GCGCGACGGG
GCGGTGGTGC AGATAGGCAC ACCCGAGGAC ATCCTCACCA ATCCTGCCGA CGACTATGTC
GCCCGCTTCG TGGGCGAGGC CGATGTGACC AAGGTGCTCA CGGCTGGCAG CGTCATGAAG
CGCTCCGAAG CCGTGGCGGT GCTCGGCATA GACGGCCCCC GCACCGCCCT GCGCAAGATG
CGGCGTAACG CCATCGCAAC GCTCTTCGTG CTGGACGAAC GGCACAGGCT GGTGGGGCTC
ATCACCGCAG ACGATGCGGC GCGCCTCGCC GCCGAGGGCG TACGCGAGCT TGGTTCCATC
GTCAGACGTG ACATCGCCAC GGTTCCACCA GAAGCCCCGG CTACGGAACT CATATCCCTC
ATGGCAGACC TGCCGCATCC GCTGGCTGTC GTGGACGAAC GTGGCAGGCT GGCTGGCGTC
ATCGTTCGCG GTCTGCTGCT GGGGGCGCTT GCCGAACGCG GAGGTGTCGC ATGA
 
Protein sequence
MSKLSIRNLT KIFGPHPEKA LGLLEQGLGK EEIHRRTSHA VGVDRASFDV EEGEIVVVMG 
LSGSGKSTLV RCLNRLIEPT AGTVTVDGRD VTSMPVDELR RLRQRSFGMV FQNFALFPHR
TVLQNAAFGL EAMGVPRAER ERQAMVSLER VGLAEWAASR PAQLSGGMQQ RVGLARALSL
DPDILLMDEA FSALDPLIRR DMQDELLRLQ DDLQKTIVFI SHDLDEALKL GDRIVLMRDG
AVVQIGTPED ILTNPADDYV ARFVGEADVT KVLTAGSVMK RSEAVAVLGI DGPRTALRKM
RRNAIATLFV LDERHRLVGL ITADDAARLA AEGVRELGSI VRRDIATVPP EAPATELISL
MADLPHPLAV VDERGRLAGV IVRGLLLGAL AERGGVA