Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_3023 |
Symbol | |
ID | 6282301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | + |
Start bp | 3408131 |
End bp | 3411079 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642622591 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001896639 |
Protein GI | 187924997 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0162015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0152175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG TGCTCAACTC TCCCACTGGC GGCTGCGGTT CCGGCAACTG CGCGTGTAAA TCGCAGGCGC AGCAACAGCG CGCGCCCCGC TCCTTCTTCG ACGACACCGA TTTCGGCACG CCCGAGCGTC ACGCCGATAT CGACATCACG CTGGAAATCG ACGGCCAGTC CGTCACCGTT CCCGCCGGCA CCTCAGTCAT GCGCGCCGCC GTCGAAGCGG GCGTCAACGT ACCGAAGCTC TGCGCAACCG ATTCGCTGGA ACCGTTCGGC TCGTGCCGTT TGTGTCTGGT GGAAATCGAA GGCAAGCGCG GTTATCCCGC TTCGTGCACG ACGCCGGTCG AAGCCGGCAT GAAAGTCCGC ACGCAAACCG ATCGTCTGCA GTCGTTGCGC CGCAATGTGA TGGAGTTGTA TATCTCCGAT CACCCGCTCG ACTGCCTCAC CTGCCCCGCC AACGGCGACT GTGAACTGCA GGACATGGCG GGCGTAACGG GTTTGCGCGA AGTGCGCTAC GGCTTCGACG GCGCGAATCA CCTGAAGGAC AAGAAAGACG AGTCGAATCC GTACTTCACC TACGACGCGT CCAAGTGCAT CGTCTGCAAT CGCTGCGTGC GCGCGTGTGA GGAAACGCAA GGCACATTCG CGCTGACCAT CGCCGGACGT GGTTTCGAAT CGCGCGTGGC GGCCAGCGAA AACGTGCCGT TCATGGAATC GGAATGCGTG TCGTGCGGCG CGTGCGTGGC CGCGTGCCCG ACCGCTACCT TGCAGGAAAA GACCGTCATC ATGCTCGGTC AGGCCGAGCA TTCGGTCGTC ACCACCTGCG CGTATTGCGG CGTCGGCTGC TCGTTCAAGG CGGAGATGAA GGGCAACGAG GTCGTGCGGA TGGTGCCGCA CAAGAACGGC CAGGCCAATG AAGGCCACGC CTGCGTGAAG GGCCGCTTTG CCTGGGGCTA CGCGACGCAC AAGGACCGCA TCACCAAGCC GATGATTCGC GCGAAGATCA CCGATCCGTG GCGCGAAGTC AGTTGGGAAG AAGCGCTCAC GTATGCGGCC TCGGAATTCC GCCGCATTCA GGCCAAGCAT GGGCGCGATT CGATCGGCGG CATTACGTCG TCGCGTTGCA CGAACGAAGA AACGTACCTG GTGCAAAAGC TCGTGCGCGC GGCGTTCGGC AACAACAACG TCGACACCTG CGCGCGCGTT TGCCACTCGC CGACTGGCTA TGGACTGAAG ACCACGCTCG GCGAATCGGC GGGCACGCAG ACTTTTGCGT CCGTGGATAA GGCCGACGTG ATTATGGTGA TCGGCGCGAA TCCGACCGAC GGCCATCCGG TGTTCGGCTC GCGTCTGAAG CGGCGTGTGC GTGAAGGCGC GAAGCTGATC GTCGTGGATC CGCGGCGCAT CGATATCGTC GATACGCCGC ACGTGAAGGC CTCGCATCAT CTGCAATTGC GGCCGGGCAC CAACGTGGCC GTCGTGAATG CGCTGGCGCA CGTGATCGTC ACCGAAGGCC TCGTCAATGA AGCGTTCGTC GCCGAGCGCT GCGAAACGCG CGCGTTCGAG CAGTGGCGCG ATTTCGTCGC GCTGCCGGAG AACGCACCCG AGGCAACCGA GGCTGTGACC GGCGTGCCGG CTCAATTCGT GCGCGAAGCC GCCCGCATCT ATGCGACCGG CGGCAACGCC GCGATCTACT ACGGCCTCGG CGTCACCGAA CACGCGCAAG GTTCGACGAT GGTGATGGGC ATCGCCAACC TCGCGATGGC GACGGGTAAC ATCGGTCGCG AAGGCGTGGG CGTGAATCCG CTGCGCGGCC AGAACAACGT GCAGGGCTCG TGCGACATGG GCTCGTTCCC GCACGAACTG CCGGGCTATC GCCACATCAG CGACACGGTC ACGCGCACGC TGTTCGAAGA CGCCTGGAAC GTCACGCTGC AACCGGAACC GGGCCTGCGC ATTCCGAACA TGTTCGACGC CGCCGTGCAC GGCACCTTCA AGGGGCTGTA CTGCCAGGGC GAAGACATCG TGCAGTCGGA TCCGAATACG CATCACGTAT CGGCCGCGTT GTCGTCGATG GAATGCATCG TCGTGCAGGA TATTTTCCTG AACGAGACTG CCAAGTATGC GCACGTGTTG CTGCCGGGTT CGTCGTTCCT CGAAAAGGAC GGTACGTTCA CCAACGCGGA ACGCCGCATT TCGCGCGTAC GCAAGGTCAT GCCGCCGGTG CCGGGTTACG CCGACTGGGA AGTCACGGTG ATGCTCTCGC GCGCGCTCGG CTATGAAATG GACTACACGC ACCCGTCGCA GATCATGGAC GAGATCGCGC GTCTCACGCC CACTTTCCAC GGCGTCTCGT ACAAAAAGCT CGACGAGATG GGCAGCATTC AGTGGCCTTG CAACGAAGAT GCACCGGACG GCACGCCGAC CATGCACATC GACAGTTTCG TGCGCGGCAA GGGCAAGTTC GTGATCACCA AGTTCATCGC GACGCCCGAG AAAGTTACGC GCAAATTTCC GATGCTGCTC ACTACGGGCC GTATCCTGTC GCAGTACAAC GTGGGCGCGC AAACCCGCCG CACCGAGAAC TCGCGCTGGC ACGACGAGGA CCGGCTGGAA ATCCATCCGC ATGACGCCGA AGAGCGCGGC ATCAAGACCG ACGACTGGGT CGGCATCGAA TCGCGCGCGG GACAAACCGT GTTGCGCGCG AAGGTGACGG AGCGCATGCA ACCGGGCGTC GTTTATACGA CATTCCACTT TCCTGAGTCG GGCGCGAACG TGATCACCAC CGATAGCTCG GACTGGGCGA CCAACTGTCC GGAGTACAAG GTGACCGCGG TGCAGGTAAT GCCGGTCGAA CAGCCGTCGC AATGGCAGAA AGAGTATTCG CGTTTCAACA CCGAACAGCT CGATCTGCTG AAGCAACGCG AGTTGGCCAA CGCGACATCG GGCAAGTGA
|
Protein sequence | MSDVLNSPTG GCGSGNCACK SQAQQQRAPR SFFDDTDFGT PERHADIDIT LEIDGQSVTV PAGTSVMRAA VEAGVNVPKL CATDSLEPFG SCRLCLVEIE GKRGYPASCT TPVEAGMKVR TQTDRLQSLR RNVMELYISD HPLDCLTCPA NGDCELQDMA GVTGLREVRY GFDGANHLKD KKDESNPYFT YDASKCIVCN RCVRACEETQ GTFALTIAGR GFESRVAASE NVPFMESECV SCGACVAACP TATLQEKTVI MLGQAEHSVV TTCAYCGVGC SFKAEMKGNE VVRMVPHKNG QANEGHACVK GRFAWGYATH KDRITKPMIR AKITDPWREV SWEEALTYAA SEFRRIQAKH GRDSIGGITS SRCTNEETYL VQKLVRAAFG NNNVDTCARV CHSPTGYGLK TTLGESAGTQ TFASVDKADV IMVIGANPTD GHPVFGSRLK RRVREGAKLI VVDPRRIDIV DTPHVKASHH LQLRPGTNVA VVNALAHVIV TEGLVNEAFV AERCETRAFE QWRDFVALPE NAPEATEAVT GVPAQFVREA ARIYATGGNA AIYYGLGVTE HAQGSTMVMG IANLAMATGN IGREGVGVNP LRGQNNVQGS CDMGSFPHEL PGYRHISDTV TRTLFEDAWN VTLQPEPGLR IPNMFDAAVH GTFKGLYCQG EDIVQSDPNT HHVSAALSSM ECIVVQDIFL NETAKYAHVL LPGSSFLEKD GTFTNAERRI SRVRKVMPPV PGYADWEVTV MLSRALGYEM DYTHPSQIMD EIARLTPTFH GVSYKKLDEM GSIQWPCNED APDGTPTMHI DSFVRGKGKF VITKFIATPE KVTRKFPMLL TTGRILSQYN VGAQTRRTEN SRWHDEDRLE IHPHDAEERG IKTDDWVGIE SRAGQTVLRA KVTERMQPGV VYTTFHFPES GANVITTDSS DWATNCPEYK VTAVQVMPVE QPSQWQKEYS RFNTEQLDLL KQRELANATS GK
|
| |