Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DehaBAV1_0165 |
Symbol | |
ID | 5131575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides sp. BAV1 |
Kingdom | Bacteria |
Replicon accession | NC_009455 |
Strand | + |
Start bp | 162990 |
End bp | 165971 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640529068 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_001213634 |
Protein GI | 147668816 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.410725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTCTC AGTTTAGTAG GCGAGATTTC CTCAAGATCA GCGGAGGGAC CGCGGGTCTC TTTGCCACAG CTGGTCTTTT CCGTGGACCC ATTAAGGGTG TTGAGCCCAA AATGGTTGAC ACGCGCCCCC GGTGGGTAAA AGAAACCACC ACCATTTGTC CCTTTGATGC CAGCGGTTGC GGATTTATCT GCTATACCGA TTCGGCCGGC AACCTCACCA ATATGGAAGG TGATCCAGAC AATCCCGTAA ATCGGGGGAG TGCCTGTTCC AAGGGTGCTT CACTGATTCA ACTTCACAAT AATAAACACC GTTTAGAAAA GGTGCTTTAC CGCGCCGCCG GTGCGAATGA TTGGGAAGAA AAGTCATGGG ATTGGGCGTT TGGTGAAATT GCCAAAAAGG TAAAGGCCAG CCGTGACAGC GGTTTTGTAG CTAAAAATGC TGCCGGCAAG ACCGTAAACC GCACCGAGGC TATTGCCTCG CTGGGCGGAG CTACTCTGGA CAACGAAGAA TGTTATCTTC TGTCCAAGAT GCTTCGTGCC CTGGGCATTG TCTACCTGGA TAGCCAGGCC AGACTGAGCA CCGCTTCCAG CCTGGAAGCT TTGGCAGCTT CGTTCGGACG AAACGCCATG ACCAACAGCT GGACCGATGT TTCAAACGCC AACGTCATTC TGGATATGGG TGGCAACCCA GCTGAAAACT ACCCTGCCTG TTTCTCCCAT CTGGGCGAAG CTATGGGCAA AGGTGCAGAG CTTATCAGCG TTGACTGCCG CCTGACCCGG ACTGCCGCCA AGGCCGGCAC ATTCGTTGCC GCCCGCTCCG GTTCGGAGAT TGCCTTTATC GGCGGGCTTA TCAAATATGT CATTTCTGAC ATTGAAGCTC ACCCCGCAAA TTACAACCTG ACCTATATTA CCGAATACAC CACTGCCAGC ATGGTAGTAA ACTCCAACTT CAAGGGTCCG GCTGATTTGG ACGGCATGTT TGTAGGTTAC AACACCGCCA GCCATTCATA CGACCAATCA AACTGGAACT TTACCAAGGC CAGCAACGGC GAACCGGTAA TAGACAAAAC CCTTACCAAC CCCAATTGTG TCTTCCAGCT TTTGAAGAAG CAATATGCCC GTTACACTCC GGAAATGGTG TCCGCAACCT GTGGCTGCTC TCAGGCTGAC TTCGCAAAGG TCTGTGCCTC TTTTGCCGCA ACCGGCAAAG CAGACAAGGC CGGCGCTATT GTCTACGCCA TGAGCTCCAC CCAGCACACC TCCGGCGCTC AGACAGTTCG GGCTTATGCC GTACTTCAGA TGCTGCTGGC CAATATCGGC GTATCCGGCG GCGGTTTATT CGCCCTTGGA GGTGAATCTA ACGTACAGGG TGCTGCCGAC AACGGTGTCT CATGGAACAA ACTGCCCGGT TATTTATCTG CCCCCAACGA AGACGATACC ACTTTTGCCG CCTATGCTTC TAAAGCCGGC GACCAAAGTG CAGCCAGCCT TTTGAAGGCC TGGTACGGCT CTAATGCCTC CAGCTCAAAC GGCTTCGGCT TCGGCTTCCT GCCCAAAACA GATAATAATA CCAATTACTC TTACATAAAC GCTATCAAAC AGATGGCTGC CGGCAAAATA AAGGGCGCTT TCGCCTGGGG TGCAAACCCG GTGGTTGAGA GTGCTGCTGC CGGTCAGGCT ACTCAGGCTT TGGCCAAACT GGACTGGCTG GTAGTGACAG ATATGTTTGA GTCTGAAACT GCTGCTTTCT GGAAGAAAGA CGGGGTGACT TCGCAGACCG AAGTCTTCCT GCTGCCCACT GCCTGTTCTT ACGAAAAAGA AGGCAGTGTA ACCAATGCTG ACCGCTGGAT TCAATGGCGA AACAAGGCCG TCAGCGCTTC CGGTGAAGCC AAATCTGAGC TTGAGGTTAT CAGCTCCCTG CTTAGCCGCC TTCAGACCCT TTATCAGGGC GAAAGCGGCC TCAATGCAGA AGCTATTACC AAACTCAGCT GGACTTACAG CACCAACCCC TCTGCGGCTG ATGTAGCCAA AGAGATGAAC GGATACAAGA TATCTTCCCT GCAGCAGCTT GAAAGTCCTG AAGACCTAAG GGCTGACGGA ACAGTTGCCT GCGGCAATAG CCTTTATATC GGCAGCTTTA CAAATGCCGG AAACCAAATG GCACGGCGTG AACAGACCGG TGCTTTCCAT GCCGGTTGGG CTTGGAGCTG GCCGATGAAT ACCCGCATAA TGAACAACCG CGCTTCAGTT GACCTCAGCG GCAAGCCCTT CAATGCGGCC AAACCCGTTA TCAGCTGGAA CGGCATTAGC TGGACCGGTG ACAATGTTGA CGGTAAAAAC GCTCCGTTTA ATCAGAGCGG CGGACTGCCC TTTAAAGGTA CCAATGGCAA TACCGCACAG CTCTTTGCCG CCATGGCTGA CGGCCCCATG CCTGAACATT ACGAACCTTG GGAAAGCCCG GTGGACAATG CTCTGTCCGG CACCCAGAAT AATCCGACCG CTATGGTCTT CGAGGGCGAT ACCCAGGGCA GTGCCGGTGA ATATCCGGTT ATCTGTACCA CTATACGCAC CGTTGAGCAT AGCTTGGGCG GCCAGTTAAC CCGCAATATG CCTTTCCTGG TAGAACTTGC CCCGGCTGCT TATGTGGAAA TCTCCGAACA GCTGGCCAGC GAAAAGGGTA TCAAGGCCGG TGATATTGTA AAACTCACTT CGGCCAGAGG CAGCGTATCA GTGGCTGCAG CCGTTACCAA ACGGCTTAAA CCCTTCAGCA TTGGCGGCAA AACTGTTCAC CAGGTAGCCA TACCCAATCA CTGGGGCTTT ATGGGAATTG CACAGGGTGA CAGTGCCAAC GTACTTGCCC CGAACACCGC AGACTCCAAT TCCTCCACTC CTGAGTTCAA GGCGTTCTTG GTCAAGGTTG AAAAAGCAGC CGACGGTACC GTACCCACCA TCACCGGACG CTATAAGGTA CTTGAGGACT AA
|
Protein sequence | MRSQFSRRDF LKISGGTAGL FATAGLFRGP IKGVEPKMVD TRPRWVKETT TICPFDASGC GFICYTDSAG NLTNMEGDPD NPVNRGSACS KGASLIQLHN NKHRLEKVLY RAAGANDWEE KSWDWAFGEI AKKVKASRDS GFVAKNAAGK TVNRTEAIAS LGGATLDNEE CYLLSKMLRA LGIVYLDSQA RLSTASSLEA LAASFGRNAM TNSWTDVSNA NVILDMGGNP AENYPACFSH LGEAMGKGAE LISVDCRLTR TAAKAGTFVA ARSGSEIAFI GGLIKYVISD IEAHPANYNL TYITEYTTAS MVVNSNFKGP ADLDGMFVGY NTASHSYDQS NWNFTKASNG EPVIDKTLTN PNCVFQLLKK QYARYTPEMV SATCGCSQAD FAKVCASFAA TGKADKAGAI VYAMSSTQHT SGAQTVRAYA VLQMLLANIG VSGGGLFALG GESNVQGAAD NGVSWNKLPG YLSAPNEDDT TFAAYASKAG DQSAASLLKA WYGSNASSSN GFGFGFLPKT DNNTNYSYIN AIKQMAAGKI KGAFAWGANP VVESAAAGQA TQALAKLDWL VVTDMFESET AAFWKKDGVT SQTEVFLLPT ACSYEKEGSV TNADRWIQWR NKAVSASGEA KSELEVISSL LSRLQTLYQG ESGLNAEAIT KLSWTYSTNP SAADVAKEMN GYKISSLQQL ESPEDLRADG TVACGNSLYI GSFTNAGNQM ARREQTGAFH AGWAWSWPMN TRIMNNRASV DLSGKPFNAA KPVISWNGIS WTGDNVDGKN APFNQSGGLP FKGTNGNTAQ LFAAMADGPM PEHYEPWESP VDNALSGTQN NPTAMVFEGD TQGSAGEYPV ICTTIRTVEH SLGGQLTRNM PFLVELAPAA YVEISEQLAS EKGIKAGDIV KLTSARGSVS VAAAVTKRLK PFSIGGKTVH QVAIPNHWGF MGIAQGDSAN VLAPNTADSN SSTPEFKAFL VKVEKAADGT VPTITGRYKV LED
|
| |