Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_2023 |
Symbol | |
ID | 5152747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 2086119 |
End bp | 2088401 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640556963 |
Product | putative aldehyde dehydrogenase precursor |
Protein accession | YP_001238119 |
Protein GI | 148253534 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGC ACGTCAAGGC AGCGCCGTCG ATGGCGCCCG ATCTCAGCCG CCGCTCCTTC CTGGTCGGCA CCGCCGCGAC CGGCCTCGTG CTTGGCTATG CCGGCCTCGC CGATAGCGCG CTCGCCGCAA CGACACCTGC GAGCTTCGAG CCCAGCGTCT GGTATTCGAT CGCGCCGGAT GGCCTGGTTA CCGTGACCTG CGGCAAGGCC GATATGGGTC AGCACATCGC CTCGACGATG GCGCAGATCA TCTGCGAGGA GCTCGGTGCG GCCTGGAAGG ACATGCGCGT CCAGCTCGCC TCCAACGATC CGAAGTTCAA CGATCCCGTG CTCGGCGCCC AGATCACCGG CGGCAGCTGG AGCACGATGA TGAACTTCGA CGCCATGAGC CGCGCCGGCG CCGCCGGCCG CATCGCGCTG ACCGAGGCCG CCGCATCAGC GATGGGCGTC CCGGCGAAGG AACTCGTGGT GCGCGACGGC GTCGTGATGC ATCCGAAGTC GAAGAAGCAG ATGAGCTATG CCGAGATCGT CAAGAGCGGC AAGATCACCA AGAGCTTCAC GGCCGACGAA CTGAAGGCGC TGACCTTGAA GACCCCCGAT CAGTACACGA TGATCGGCGT CTCGGTGCCG CAGCTCGACA TTCCCGCCAA GACCAACGGC ACGGCGAAAT ACGGCATCGA CACGATGCTG CCGGGCATGG TCTACGGCAA GGTGGTGACG CCGCCGGTGC GCTTCGGCGC CACCGTCAAG TCCGTCGATG ACAGCGAGGC CAAGAAGGTC CCGGGCTTCA TCAAGGCGGT CGTGCTCGAC GACAAGACCG GCTCCACGTC AGGCTGGGTG GTCGCGGTGG CCTCGACCTT CGCCAATGCC AAGAAGGCCG CGGACGCGCT GAAGATCAGC TATGACAAGG GTCCGTATGC CAATGTCAGC ACCGACAGCA TCATCACCGA GGCGATGCGG CTGCAGGCGC AGGACGATGC CGGGCAGTTC TTCGTCAAGG ATGGCGATGC CAATGCGGCA CTGGCGGGCG CCGCGAAAGT GCTGGAGGCG GAGTACACCA CCAGCATCAA CATCCACGCG CCGATGGAGC CGATGAACGC CACCGCCGAG TTCAAGGGCG ACATCCTGCA CATCTACTCC GGCAATCAGT TCGCGACCCG CTCCGGCGCG ATCGCGGCGG GCGCCGCCGG GATCGATCCG AAATATGTCG TGATGCACCA GGCCTGGCTT GGCGGCGGCT TCGGCCGCCG GCTCGATGCC GACATGATGG TGCCGGCGGT GCAGGCGGCG AAGGCCGTGG GCAAGCCGGT CAAGGTGATC TATTCCCGCG AGAACGACAT GACGATGGAC TACTCGCGGC CGCTCACCTA CCAGAAAGTG AAGGCCGGGC TCGACAGCAA TGGCAAGCTC ATTGCTCTGA GCCACGACGT CGTCTCGGCC TGGCCGACGG CGCGCTGGGG CATCCCGGAC TTCCTGACGC CGTCGGTCGA CAAGAAGGGC CCGCTCGACT CGTTCACGGT CAACGGCGCC GACTTCTTCT ACACCGTGCC CAACCACCAT GTGCGCGCGA TCAAGAACGA GCTCGCGCAT AATGCGACGC CGTCCGGCCA GCTCCGCTCG GTGGCGCCGG GTTGGACGTT CTGGGCGGTC GAGAGCATGA TCGACGAGAT CGCCGCCGCG TCCGGCCAGG ACCCGGCGCA GTTCCGTATC GCGCTGCTCG ACGGCAAGGG CAAGAACGAT GGCGGCGCGC AGCGGCTGCG CAACACGCTG CTGGCGGCGA TGGGCCTGTC CGGCTACGGG TCGAAGAAGC TGTCGAAGGG CGAGGGCATG GGCGTCGCCT GCGTGTCGTC GCAGGAGCGG GCGTCCGCGA GCTGGACGGC CTGCGTCGCC CATGTCGCGG TGGCTGACAA CGGCGCGGTG ACGGTGAAGA AGCTCACCGT GGCGACCGAC GTCGGCACCC AGGTGCATCC CGACAACATC CGCGCCCAGG TCGAGGGCGC GGCGCTGTGG GGATTGTCGC TGGCGATGTA CGAGAAGGCG ACGTTGAAGG ATGGCGGCAT CGAGCAGACC AACTTCGACA GCTACACGCC GCTGCGGATG AGCCAGGTGC CGGAGGTCGC GATCGCCGTG ATCGCCAATG GCGAGAAGGC GACCGGCGTC GGCGAGCCCG CGGTGACCGT GGTCGCGCCG GCGCTCGGCA ACGCCATCTA CAACGCCTGC GGTGCCCGCC TGCGCTCGCT GCCGATCACC GCGGAAGCGG TGAAGGCCAA CATGAAGGCG TAG
|
Protein sequence | MNQHVKAAPS MAPDLSRRSF LVGTAATGLV LGYAGLADSA LAATTPASFE PSVWYSIAPD GLVTVTCGKA DMGQHIASTM AQIICEELGA AWKDMRVQLA SNDPKFNDPV LGAQITGGSW STMMNFDAMS RAGAAGRIAL TEAAASAMGV PAKELVVRDG VVMHPKSKKQ MSYAEIVKSG KITKSFTADE LKALTLKTPD QYTMIGVSVP QLDIPAKTNG TAKYGIDTML PGMVYGKVVT PPVRFGATVK SVDDSEAKKV PGFIKAVVLD DKTGSTSGWV VAVASTFANA KKAADALKIS YDKGPYANVS TDSIITEAMR LQAQDDAGQF FVKDGDANAA LAGAAKVLEA EYTTSINIHA PMEPMNATAE FKGDILHIYS GNQFATRSGA IAAGAAGIDP KYVVMHQAWL GGGFGRRLDA DMMVPAVQAA KAVGKPVKVI YSRENDMTMD YSRPLTYQKV KAGLDSNGKL IALSHDVVSA WPTARWGIPD FLTPSVDKKG PLDSFTVNGA DFFYTVPNHH VRAIKNELAH NATPSGQLRS VAPGWTFWAV ESMIDEIAAA SGQDPAQFRI ALLDGKGKND GGAQRLRNTL LAAMGLSGYG SKKLSKGEGM GVACVSSQER ASASWTACVA HVAVADNGAV TVKKLTVATD VGTQVHPDNI RAQVEGAALW GLSLAMYEKA TLKDGGIEQT NFDSYTPLRM SQVPEVAIAV IANGEKATGV GEPAVTVVAP ALGNAIYNAC GARLRSLPIT AEAVKANMKA
|
| |