Gene DhcVS_235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhcVS_235 
Symbol 
ID8657179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides sp. VS 
KingdomBacteria 
Replicon accessionNC_013552 
Strand
Start bp224017 
End bp225273 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content54% 
IMG OID 
Productsubtilisin-like serine protease precursor 
Protein accessionYP_003329725 
Protein GI270307667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATATG CAGTGATATC GAAAGGAATC AGTCCGGTAC AGCTCGAAAC TGAGGTGAAG 
AAGGTTGGCG CCGGAAACAT AGTTAAGACA AAGCTTGTGG GACAGCTCTT CTGCGAACTG
GATAATGACC AGGCCAGGAT ACTTGCCAAA GTCCCTGGAC TGATCCTCAA AGAGATCAAG
GAGTACAAAA CCCAGCAAGT GCTAGCTCAG GCTCCGGCCG CGGAGTCTAT TTCTGACGTC
TTCTACCTGC TGCGCTCATA CTTCAACCCG CCGCTTACCG GTACGGGGCT TACCGTTGCA
ATGCTGGATA GCGGTATTCG CAAAACACAC GAAGCGCTGC AGAACAAAGT GGTATTCGAG
GTAAACCTGA CCGACTCTCC CTCCGCTACC GATGTCTTTG GGCACGGCAC CCAGGTAGCC
TTTGTCATCG CCGGTGGTCT CCATGCCATG GGCAGTAAAG CCGGTGTTTC GCCGGGTGCG
ATCCTGATGA ACATCAAGGT CATAAGCGAT GAAGGCATCG GTAACGACGA GGATATCGTT
CTGGGCATCG ATAAAGTCTG CGAGCTGGCC GAGGACGCCA GGAAGAAAGG ACTATGGCCC
ACGGACGATA TGTACCCGAA TGTTATAAAC CTCAGCCTTG GCGCAGAGGA TGATGGCGAC
CCGGATAATC CGGTGAGAGC CGCCTGCCGT AAAGCCAGTA CCGAATACGG GCTGGACGTC
GTTGCCGCGG CCGGCAACTC AGGTCCCGAT ATGACCACTA TTATGCTTCC TGCCTGTGAC
CCTGAAGTGG TGGCAGTCGG AGCCCTCGAA ACCATTGGTG ATCTCGTTAT CTGGGAGAAG
TCATCGCGAG GGCCGACTGT TCAGGGTGAA ACAAAACCTG ATTTCGTCAT CTGGGGCACA
AGCCTCGAGA TGGCTAGTGA AAAAGCTGAC GATGAATACG TCGTCAAATC GGGAACGAGC
TTTGCCGCTC CGATGCTTTC CGGACTAACC GGCTTGCTCT GGGAAAGTGG CCGGAGGGCG
TATGGGGAAA GCTGGCTGTT CCGCTGGACA GATGCCCGGC AGTTTGGTCC CTATTTTTCA
ACCAAACCGG CGGATGCTCC CCTGAACAAA GATAACGCTT ACGGTTATGG TCTCCCGGCT
ATGGGAACGA TGCTGGGCGA AGTGGCCCAG GTAAGTACAC CAAGCCAGGG AGTAAACGAT
ATGTTCCCGA TGGTAATGAT GATGGCGATG ATGTCCGCTC TGACAGGAGG TTTTTAA
 
Protein sequence
MRYAVISKGI SPVQLETEVK KVGAGNIVKT KLVGQLFCEL DNDQARILAK VPGLILKEIK 
EYKTQQVLAQ APAAESISDV FYLLRSYFNP PLTGTGLTVA MLDSGIRKTH EALQNKVVFE
VNLTDSPSAT DVFGHGTQVA FVIAGGLHAM GSKAGVSPGA ILMNIKVISD EGIGNDEDIV
LGIDKVCELA EDARKKGLWP TDDMYPNVIN LSLGAEDDGD PDNPVRAACR KASTEYGLDV
VAAAGNSGPD MTTIMLPACD PEVVAVGALE TIGDLVIWEK SSRGPTVQGE TKPDFVIWGT
SLEMASEKAD DEYVVKSGTS FAAPMLSGLT GLLWESGRRA YGESWLFRWT DARQFGPYFS
TKPADAPLNK DNAYGYGLPA MGTMLGEVAQ VSTPSQGVND MFPMVMMMAM MSALTGGF