Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3089 |
Symbol | fdsA |
ID | 5155640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 3233422 |
End bp | 3236301 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640557959 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001239113 |
Protein GI | 148254528 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.440267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTCA TCCACGAAAC CGATTACGGC ACCCCGCGCT CCAAGTCCGA AACGATGGTG ACGCTCACCA TCGACGGCCA GAGCATCACC GTTCCCGCGG GCACCTCGAT CATGCGCGCG GCCATGGAGG CCGGCACGCA GATCCCGAAA CTATGCGCGA CCGACATGGT CGACGCGTTC GGCTCCTGCC GACTCTGCCT CGTCGAGGTC GAGGGCCGTG CCGGCACGCC GGCCTCCTGC ACCACGCCGG TGGCGCCGGG GCTCGTCGTC CACACCCAGA CCGAGCGGCT GAAGAAGCTG CGCAAAGGCG TGATGGAGCT CTACATCTCC GACCATCCGC TCGACTGCCT GACCTGCGCC GCCAATGGCG ATTGCGAGTT GCAGGACATG GCCGGCGCGG TCGGCCTGCG CGACGTGCGC TACGGCAACC AGGGTGAGAA CCACGTGTTC GCCGACGCCT GCGGCGAAGC CAACCCGGCC TGGCTGCCGA AGGACGAGTC GAACCCCTAT TTCACCTACG ATCCCTCCAA GTGCATCGTC TGCTCGCGCT GCGTCCGCGC TTGCGAGGAG GTCCAGGGCA CGTTCGCGCT GACCATCTCC GGCCGCGGCT TCGACAGCCG GGTGTCGCCG GGCATGAGCG AGAGCTTCCT CGGTTCGGAG TGTGTGTCCT GCGGCGCCTG CGTGCAGGCC TGCCCGACCG CGACCCTGAC CGAGAAGAGC GTGATCGAGA TCGGCCAGCC CGAGCATTCG GCGGTCACCA CCTGCGCCTA TTGCGGCGTC GGCTGCACCT TCAAGGCGGA GATGCGCGGC GAGGAAATCG TGCGCATGGT GCCGTACAAG GACGGCAAGG CCAATCGCGG CCATTCCTGC GTCAAGGGCC GCTTCGCCTA TGGCTATGCG ACCCACAAGG AGCGCATCCT CAAGCCGATG ATCCGCGAGA GCATCGATCA GCCCTGGCGC GAGGTTTCCT ATGACGAGGC CTTCACCTTT GCGGCCCAGA AGATGCGCGG CATCCAGGCC AAGTATGGCC GCGACTCGAT CGGCGGCATC ACCTCGTCGC GCTGCACCAA CGAAGAGACC TATCTGGTGC AGAAGCTGAT CCGCGCCGGC TTCGGCAACA ACAATGTCGA CACCTGCGCC CGCGTCTGCC ACTCGCCCAC CGGCTACGGC CTCGCCACCA CCTTCGGCAC TTCGGCCGGC ACGCAGGACT TCGACTCGGT CGAGGACTCC GACGTCATCA TGGTGATCGG CGCCAATCCG ACCGACGCAC ATCCGGTGTT CGGCTCGCGC ATGAAGAAGC GGCTGCGCCA GGGCGCCAAG CTGATTGTGG TCGATCCGCG CAAGATCGAT CTCGTGAAGT CGGCGCATAT CGAGGCAGAC TATCACTTGC CGCTGCTGCC CGGCACCAAC GTCGCGATCA TGACCGCGAT GGCGCATGTG ATCGTCACCG AAGGCCTCGT CAACGAGACC TTCGTGCGCG AGCGCTGCGA TTGGAGCGAG TTCCAGGACT GGGCCGAGTT CGTCGCGCTG GAGAAGAACA GCCCGGAGGC AATCGCCGCC ATCTCGGGCG TCGACCCGGA AGCGATCCGC GGCGCCGCCC GCCTCTACGC CACCGGCGGC AATGGCGCGA TCTATTACGG CCTGGGCGTC ACCGAGCACA GCCAGGGCTC GACCACGGTG ATCGCAATCG CCAACCTCGC GATGGCGACC GGCAATATCG GTCGCCGCGG CGTCGGCGTG AACCCGCTGC GCGGCCAGAA CAACGTGCAG GGCTCCTGCG ACATGGGCTC GTTCCCGCAC GAACTGCCGG GCTACCGGCA CATCTCGGGC GACGCGGTGC GCGACCAGTT CGAGGCGATG TGGAACGTCA AGCTCAACCC TGAGCCGGGC CTGCGCATCC CCAACATGTT CGACTCCGCG ATCGACGGCA CCTTCAAGGG GCTGTACGTG CAGGGCGAGG ACATCCTGCA GTCGGATCCG AACACCACGC ATGTCGTGCA GGCGCTGTCG GCGATGGAAT GCGTCATCGT CCAGGACCTG TTCCTGAACG AGACCGCCAA CTACGCCCAC GTCTTCCTGC CCGGCTCGAC CTTCCTGGAG AAGGACGGCA CCTTCACCAA CGCCGAGCGT CGCATCCAGC GCGTCCGCAA GGTGATGACG CCGCGCAACG GCCTCGCCGA CTGGGAGGTC ACGATCGGTC TCGCCAAGGC GATGGGCTTC GAGATGAAGT ACAACCATCC TTCGGAGATC ATGGACGAGA TCGCGGCGCT GACGCCGACC TTCACCGGCG TCTCCTATCA GAAGCTCGAG GAGATGGGCT CGGTGCAGTG GCCCTGCAAC GAAACCTATC CCGAGGGCTC GCCGATCATG CATGTCGACG GCTTCGTCCG CGGCAAGGGC AAGTTCGTGG TCACCGAATA CGTCGCGACC GACGAGCGCA CCGGCCCGCG CTTCCCGCTG TTGCTCACCA CCGGCCGCAT CCTGTCGCAG TACAATGTCG GCGCCCAGAC GCGGCGCACC GACAACGTGG TGTGGCATGG CGAGGACGTG CTGGAGATCC ATCCGCACGA CGCCGAGCAG CGCGGCATCC GCGACGGCGA CTGGGTGCGG CTGACCAGCC GTGCCGGCGA GACCACCTTG CATGCGCTGA TCTCCGAGCG TGTGGCGCCG GGCGTGGTCT ACACCACCTT CCACCATCCG CTGACCCAGG CCAACGTCAT CACGACGGAC TATTCCGACT GGGCGACCAA TTGTCCGGAG TACAAGGTCA CGGCGGTGCA GGTGTCGCAG TCCAACGGCC CGTCGGACTG GCAGAAGGCC TATGACGAGC AGGCCCGCAA CTCGCGCCGC ATCGCCCCGA TCGTGGAGGC GGCGGAGTAG
|
Protein sequence | MSLIHETDYG TPRSKSETMV TLTIDGQSIT VPAGTSIMRA AMEAGTQIPK LCATDMVDAF GSCRLCLVEV EGRAGTPASC TTPVAPGLVV HTQTERLKKL RKGVMELYIS DHPLDCLTCA ANGDCELQDM AGAVGLRDVR YGNQGENHVF ADACGEANPA WLPKDESNPY FTYDPSKCIV CSRCVRACEE VQGTFALTIS GRGFDSRVSP GMSESFLGSE CVSCGACVQA CPTATLTEKS VIEIGQPEHS AVTTCAYCGV GCTFKAEMRG EEIVRMVPYK DGKANRGHSC VKGRFAYGYA THKERILKPM IRESIDQPWR EVSYDEAFTF AAQKMRGIQA KYGRDSIGGI TSSRCTNEET YLVQKLIRAG FGNNNVDTCA RVCHSPTGYG LATTFGTSAG TQDFDSVEDS DVIMVIGANP TDAHPVFGSR MKKRLRQGAK LIVVDPRKID LVKSAHIEAD YHLPLLPGTN VAIMTAMAHV IVTEGLVNET FVRERCDWSE FQDWAEFVAL EKNSPEAIAA ISGVDPEAIR GAARLYATGG NGAIYYGLGV TEHSQGSTTV IAIANLAMAT GNIGRRGVGV NPLRGQNNVQ GSCDMGSFPH ELPGYRHISG DAVRDQFEAM WNVKLNPEPG LRIPNMFDSA IDGTFKGLYV QGEDILQSDP NTTHVVQALS AMECVIVQDL FLNETANYAH VFLPGSTFLE KDGTFTNAER RIQRVRKVMT PRNGLADWEV TIGLAKAMGF EMKYNHPSEI MDEIAALTPT FTGVSYQKLE EMGSVQWPCN ETYPEGSPIM HVDGFVRGKG KFVVTEYVAT DERTGPRFPL LLTTGRILSQ YNVGAQTRRT DNVVWHGEDV LEIHPHDAEQ RGIRDGDWVR LTSRAGETTL HALISERVAP GVVYTTFHHP LTQANVITTD YSDWATNCPE YKVTAVQVSQ SNGPSDWQKA YDEQARNSRR IAPIVEAAE
|
| |