Gene DhcVS_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhcVS_1043 
SymbolargG 
ID8657974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides sp. VS 
KingdomBacteria 
Replicon accessionNC_013552 
Strand
Start bp969154 
End bp970374 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID 
Productargininosuccinate synthase 
Protein accessionYP_003330488 
Protein GI270308430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000173868 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA AAGTTGTTCT GGCATACTCC GGCGGGTTGG ATACCTCAGC CGCTGTGAAA 
TGGCTTCAGG AAAAATACGG CATGGATGTT ATTGCCGTTA CTATAGATGT GGGTAATGAA
AAAGATTTCA CCCTGATAAA GGAAAAAGCC CTCAAAGTAG GCGCTAAAAA GGCCTACGTA
CGGGATGTGT GTAAAGAATT TGCCGAAGAT TATATCTGGA AGGCTATTAA AGCTAATGCC
ATGTACGAAG GTGTTTATCC GTTGGCAACC GCTCTTGCCC GCCCCCTGAT TGCCAAGGTT
ATGGTAGATA TAGCCCTTGA GGAAGGGGCT ACCGCCATTG CCCATGGCTG TACCGGCAAA
GGCAATGACC AGGTACGTTT TGACGTGGGC ATAAACACCC TTGCCCCCCA TTTGAAGATT
ATTGCCCCCG CCCGCCAGTG GGGCATGACC CGTGAGCAGA CTATGGAATA CGCCCAGAAA
TGGGGTATTC CCGTACCTAT TTCAGTCAAA AACCCTTTCT CCATAGATGA AAACCTGTGG
GGGCGGAGTA TAGAGTGCGG TCTGCTGGAA GACCCCTGGA ACGAGCCTAT TCCCGAAGTA
TTTGCCTGGA CCCGCCCTGT GGAAGCAACC CCGGACGCAC CTGAGTACCT GGAAGTAGAG
TTTGAACAGG GCGTGCCGGT AGCTGTAAAC GGGGAAAAGC TGTCTCCTTT GGCACTTATA
CAGAAAGTGC ATGATATTGC CGGTCTGCAT GGGGTAGGCC GTATTGACCA CGTGGAAAAC
CGTCTGGTAG GCATTAAATC CCGCGAGATT TATGAAGCCC CTGCGGCGGT GGTGCTGATT
GCCGCCCACC AGGCTCTGGA AGCCATGACC CTTTCCAAGA GCCAGTTACG CTTTAAGCAG
ATGGTGGAAG CCACCTATTC GGATATTATT TATAACGGGC TGTGGTTCTC TGCCCTGCGA
CAGGATTTGG ACGCCTTTAT AGACTCCAGC CAGCGCTTTG TCAGCGGCAC AGTTCGCTTA
AAGCTTTCCA AGGGCAGCTT CCGGGTAGTG GGACGCAAAT CCCCCTATTC CCTGTACCAC
AAGGGCATGG CCACCTATGA TAAGGGAGAC CAGTTTGACC CGTCTTCGGC AGTGGGTTTC
ATAACCCTGT GGGGACTTCA GGCCAAACTG CAGGCCCAGC TCCAGCCTAT TCTGGGAGAA
GAAAAGGGGA ATAAATCCTA G
 
Protein sequence
MSEKVVLAYS GGLDTSAAVK WLQEKYGMDV IAVTIDVGNE KDFTLIKEKA LKVGAKKAYV 
RDVCKEFAED YIWKAIKANA MYEGVYPLAT ALARPLIAKV MVDIALEEGA TAIAHGCTGK
GNDQVRFDVG INTLAPHLKI IAPARQWGMT REQTMEYAQK WGIPVPISVK NPFSIDENLW
GRSIECGLLE DPWNEPIPEV FAWTRPVEAT PDAPEYLEVE FEQGVPVAVN GEKLSPLALI
QKVHDIAGLH GVGRIDHVEN RLVGIKSREI YEAPAAVVLI AAHQALEAMT LSKSQLRFKQ
MVEATYSDII YNGLWFSALR QDLDAFIDSS QRFVSGTVRL KLSKGSFRVV GRKSPYSLYH
KGMATYDKGD QFDPSSAVGF ITLWGLQAKL QAQLQPILGE EKGNKS