Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DhcVS_118 |
Symbol | elsH |
ID | 8657072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides sp. VS |
Kingdom | Bacteria |
Replicon accession | NC_013552 |
Strand | + |
Start bp | 116219 |
End bp | 118138 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | |
Product | endonuclease/exonuclease/phosphatase family |
Protein accession | YP_003329619 |
Protein GI | 270307561 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCTTT CCATGTTCAC AGTCGCTGCC CCCGTAGCTG CTGCCAACCA GGATTGGAGC AAAATCTCCC TGCCCGGCTC CGGCGCAACC GGCGGCTATG TAGTCAGCAC TCCTAGCGTT CGCTTTGACA ACATCGGCAT TTTGGAATGG AACGCAGATA AATCTGCTGT CTATGCCACT GCCGAGTATG ATGGAGTTAT GTGTGTATTT AAATCTACTG ACTCCGGTCG CTCTTGGAAG ATGGTCTTTG AAGACGGTGC TGTTTATACT TTGGCAGCTA GTGGCCGTAT TTTTGATATG GTCGCTTCTA GCGTTAATGC CAATGAAGTT TACTTCACTG ACGGTCATGA CATCTTTACC ACCAAAGATG GTGGTGCAAC TTGGAACAAA ATGTCAAACC TTTATACTTA TGCGGCTTCC AATTTTGTAA CAGCTATTCC TACTGGTATC ATTGTTACTT TGGACACCGC TGTAGTTGGC GCTACTAAAT ACATCTTTGC AGGTACTGCT TCCCTTGGCG TAGCTCCGGG CGCTAACGGC GTGTATATGT GCCAGGAAAC CGCCTTCGGT ATGTCTTGGA CCGATCTTGA TGTTATTAAT CAACGTGAAA CCGCTTGGGG TGGTTGTAAT GTTTGGGATG TAGTTGTTGA TCCCACTGAC TTCGCCACCA AACAGGGTGT TATTGCTGTT GCTGAAGACA CCACTGTGGG TTCCGAAGCT ACTTACATAA CCGCCAAATA TTTTGGCAAC CAGTGGAATA TGCCTTCGAC CTGCATTGAT ACCGCAGTCG AAGAAAACAA CACTGATTCT ATCACCGGTA TTTACCATGC CGAACTCTGG CTGCCCTCTG ACTTCAACTC CAACGTAGCC TCCGGCAAAT TCCAGGGTTG GCTTGCTCTG CAGACTTCTG CTGGTCAAGG TGATGTGTAT TTGTGGTATG GTAATACCTT AACAACCATA GACCTTAACG TCCGTGGTTA TTCTGCTGGT GTTGGCACTG CTACAGAAGT AACCGATATT GATGGTATTG GCGGTATTGC TGATGCCAGA TTGATTATTG GTGGTTATAA TATCGCAACT GCTTCTGCAC CTCTTACTTG GTATTCTTCT GATCGCCTTA CCTTCTCAGC TAATGTTAAA GCCCCTACCG GTGCTGCAAG CTCTTGGGGT CTCTCTGTTG CTACTGTAAC CGTACTTGCC ATGGGTGACT ACAATACTTC CGGCAAGGCT CTTGCTGGCA CTGCTGGCTG GGATTGTGGC GTTTCTTACA CAGCTGATTT TGGCAAAACT TGGAATACTA TTTCCATGAT GCGTCAGGCT ATTGTTAGCG TTGACAATAT GGCTGGTAAT TTATTTATTG TTACTAGTGA TTATGCCGAA GAGAGCGTTT GGCGTTTCTC TGATGGTGTT TACGAGCGTG TCTTCTCAAC CTCTATGGTT AGTGAAGATG GTACTACTAT TGATGTAGCT GTTTCACCCG CAGGTGATGC TTTATTCGTA GCGACCATTA ATGGTACTAT GGTTTGGCGC TCACTGGTTG ATCAGACTAC TGGTAAATTG GGTCAAATAT TTACCGCTCA GGTAAGCAGC ATAACCACAG TTGATAGTGC TGCTCTTATC AATTCATGGT TTGTTTCCAA TGCCAACACT ATCCTGATTG GCGGCAATAA TGTTGTCTAT CAGACCAGCA CCAATGGCGT TCTCTGGTTT ATCCGCACCT GTTCTGTTGG CATTGTTACA TCCTTTGCCG TTTCAACTGA CGGTTCTACT CTGGTAGCCG GCGGTTCCAC TGGCGCAATC TCCAAATCAG TTGATGGTGG CATTACTTGG GGTACCGCTA CTGCTGCTAC AACTATTACT TCATCCGTAC CTGTTGTAAC TTTCCAGAAT GGTTCTAACA GCGTAGTCTA CCTCACTGGC GAAAATGGTG GCGCATTCAA GTATGACTTT GCCGCTACCA CTCCTGCCTG GGTACAGATT GATAACGCTG TTACCACTGG CGAAACCTTT GACAGCGGTG CCTATGCTGA TGTTGATAAC GGCGTAGGTA TCTTCTCCGG TATTGCTTCC GGCGGAAACG TCCTGTATGT ACTGGATGGT ACTTGGGACA ACGTAATCCG CATTACCAAT GTCGGCACTT CCTACAAGGC CGGTATAATT CCTGCTCCCG CTACCGCCGG TTCTGCCACC AAACTGATTG GTGCTGCTGC CCCCGGTGCT AACGTGCTTT ACGGTATCTT TGATGACGAA CTGTATGTTT ACACCGACAA GATGGCTGTA CCCGTAGCCA ACGTTCAGGC CACCAAGATT ACTACCACTG GCTTCACCAT CACTTGGGAT GCTTTGAATG TTGATAATAG CAATATTACA GTAGAGTATT ACATCGTAAT TACCGATGCT GCTACTGCTG TTAAAGTTTG CTATGGTCTT CCCACTGGTG CTGGTACAAC CACCGCTACT ACTCTTAAAG TAACCGGTCT TGACGAAGAC ACCGTTTACC AGATAAGCGT CTGGGCTGTT GATCCGGTCA TGAGCTTCGT GGGTACTGCT TCAGTAGCTA CCCAGCCCGA AAAACTGACC TACACCTTCA ACCTGATGCC CACCAACGGT GCTTCCAATG TACCCGTTAA ACCCGTATTC GCCTGGGATT CAGTTGTTAC TGCTGTTAGC TATGACCTCG TACTCAGCAC CGATCCTACC TTCGCTGATG CTACCAAGGT TTTGGCTACC AAGAACCTGA CCACCAATTA CTGGGCCTAT GACGGCACTT TGAGCAACTC AACCAGCTAC TACTGGAAGG TTCGCGTCAA TACCGCCAAC AGCACCAGTG AATGGTTCCC GGCTGTCTTT ACCACCGTTA AGGCTGATGC AGCCCCGGTA GAGGTCAATA ATCCTCCGGC TATCACTTTG ACCGTACCTC AGGCTGAAAC TCCTGCCTAC ATCTGGCTGA TTGTGGCTGT CGGTGCTGTA CTGACCGTCG CTGTAATCGT CCTGATTGTT CGCACTCGCC GCGTAGTCTA A
|
Protein sequence | MKGFLQAKPG AVEFLLPAMV LTLAISLFPL FSSGMTWILG DRFGQGAGVL GLIALVIFGL SFLAKPLRRF LCSYKAIIFS AGGVAIFSLL AQIGFAEPLI NFICSALGLS LFAVFLAVYL DSARVRGASS VGHFGAGVIF GLLLNAALSA GFSTYDLVNQ PSVLPLLISA VIAAFILFLL SVHMPLAEGP AASNNGLGWL VIGPFLFLEL VVFGNIARLS ALSGFSSPVT SIFVLGALSL GLVGILWVFS LHQRHIRLLS MIASLFLILS LTGISGGVAA ISLAQHFLGQ LAVILLFGVI LRYIGGRKAD GVGESLNLPN GLGMILFVIL LLGYYAVYQV AVPYDNTVLE LVAGLMVVVL GLYSGRECLP LLDFKGERFI LPAMSILLLT LPLLGLLTYK TPPAAPQFSG TLRIMTYNLH NGFNTQGRLD MEALVRVIEN SGADVVALQE ISRGWVISGR VDMLEWLAQR LNMYSAFGAT AGEYWGNAIL SKYPILDTHN VSLESDGLPI KRGYLNAVLD LGGRYLYLAA THLHHVPEEG DVRLIQAGEL ADFWDNAPST VILGDFNAEP NSAEIGLLRQ AGLSDSLEGQ TSVLTYHSAD LYQRIDYIWA SPDIVYVDSF TIFSLASDHL AVIADIRLS
|
| |