Gene Dvul_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1900 
Symbol 
ID4664000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2214191 
End bp2215381 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content65% 
IMG OID639820141 
Productargininosuccinate synthase 
Protein accessionYP_967343 
Protein GI120602943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.99384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTA TCAAGAAGGT CGTTCTCGCC TACTCTGGCG GTCTCGATAC GTCCGTCATC 
CTGAAGTGGC TCGCCGTCAC CTATAACTGC GAAGTGGTGA CCCTCACCGC CGACCTCGGT
CAGGAAGAGG ACCTCGACGG CGTGGACGAC AAGGCCATGC GCACGGGAGC CTCCCGCGCC
TACGTCGAAG ACCTGCAGGA AGAGTTCGCC CGCGACTTCA TCTTCCCCAT GATGCGCGCC
GGGGCGGTCT ATGAAGGCCG CTACCTGCTG GGCACCTCCA TCGCGCGCCC GCTCATCGCC
AAGCGCCTTG TCGAGATAGC CCGCGCCGAA GGCGCACAGG CCGTGGCCCA CGGCGCCACC
GGCAAGGGCA ACGACCAGGT GCGCTTCGAA CTGGCAGTGA ACGCCCTCGC ACCCGACCTG
CGCGTCATCG CGCCGTGGCG TGAATGGGAC CTGCGCTCGC GCACGCAGCT CAACGCCTTC
GCCGAAGAGC ACGGCATCCC CATCTCCAAT TCCGCCAAGC AGTACAGCAT GGACCGCAAC
ATGCTCCACT GCAGCTTCGA GGGCGGCGAA CTTGAAGACC CGTGGAACGA ACCCGGCCCC
AACAGCTACG TCATGGCCGT TCCCATGGAA CAGGCCCCCG ACGAGGCCGA GTACATCAGC
ATCGACTTCG AACACGGCAA CCCCGTGGCC GTGAACGGCG AACGCCTCTC GCCCGCCGCG
CTGGTCAAGA AGCTCAACAG CATCGGCGGA CGTCACGGCA TCGGCCGTCT CGACATGGTC
GAGAACCGCT TCGTGGGCAT CAAGTCGCGC GGCGTGTACG AGACCCCCGG CGGTACGCTC
ATCCACATCG CCCACCGCGA CCTCGAAGGC ATCTGCATCG ACCGCGAGAC CATGCACCTG
CGCGACGCCA TGCTGCCCCG CTACGCCGCA GCCATCTACA ACGGCTTCTG GTTCGCCCCC
GAACGCGAGG CCATGCAGGC GATGATCGAC GTCTCGCAGC AGCGCGTCAC CGGCACCGTG
CGCCTGAAGC TGTACAAGGG CAATGCGTGG CCCGTGGGTC GCCAGTCGCC CAACACCCTC
TACTGCCACG ACCTCGCCAC CTTCGAAGAC TGCGCCACCT ACGACCACAA GGACGCGGCA
GGCTTCATCA AACTGCAGGG TCTGCGCATC CGCGGTTACA AGAAGGGATA G
 
Protein sequence
MSGIKKVVLA YSGGLDTSVI LKWLAVTYNC EVVTLTADLG QEEDLDGVDD KAMRTGASRA 
YVEDLQEEFA RDFIFPMMRA GAVYEGRYLL GTSIARPLIA KRLVEIARAE GAQAVAHGAT
GKGNDQVRFE LAVNALAPDL RVIAPWREWD LRSRTQLNAF AEEHGIPISN SAKQYSMDRN
MLHCSFEGGE LEDPWNEPGP NSYVMAVPME QAPDEAEYIS IDFEHGNPVA VNGERLSPAA
LVKKLNSIGG RHGIGRLDMV ENRFVGIKSR GVYETPGGTL IHIAHRDLEG ICIDRETMHL
RDAMLPRYAA AIYNGFWFAP EREAMQAMID VSQQRVTGTV RLKLYKGNAW PVGRQSPNTL
YCHDLATFED CATYDHKDAA GFIKLQGLRI RGYKKG