Gene BBta_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2023 
Symbol 
ID5152747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2086119 
End bp2088401 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content67% 
IMG OID640556963 
Productputative aldehyde dehydrogenase precursor 
Protein accessionYP_001238119 
Protein GI148253534 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC ACGTCAAGGC AGCGCCGTCG ATGGCGCCCG ATCTCAGCCG CCGCTCCTTC 
CTGGTCGGCA CCGCCGCGAC CGGCCTCGTG CTTGGCTATG CCGGCCTCGC CGATAGCGCG
CTCGCCGCAA CGACACCTGC GAGCTTCGAG CCCAGCGTCT GGTATTCGAT CGCGCCGGAT
GGCCTGGTTA CCGTGACCTG CGGCAAGGCC GATATGGGTC AGCACATCGC CTCGACGATG
GCGCAGATCA TCTGCGAGGA GCTCGGTGCG GCCTGGAAGG ACATGCGCGT CCAGCTCGCC
TCCAACGATC CGAAGTTCAA CGATCCCGTG CTCGGCGCCC AGATCACCGG CGGCAGCTGG
AGCACGATGA TGAACTTCGA CGCCATGAGC CGCGCCGGCG CCGCCGGCCG CATCGCGCTG
ACCGAGGCCG CCGCATCAGC GATGGGCGTC CCGGCGAAGG AACTCGTGGT GCGCGACGGC
GTCGTGATGC ATCCGAAGTC GAAGAAGCAG ATGAGCTATG CCGAGATCGT CAAGAGCGGC
AAGATCACCA AGAGCTTCAC GGCCGACGAA CTGAAGGCGC TGACCTTGAA GACCCCCGAT
CAGTACACGA TGATCGGCGT CTCGGTGCCG CAGCTCGACA TTCCCGCCAA GACCAACGGC
ACGGCGAAAT ACGGCATCGA CACGATGCTG CCGGGCATGG TCTACGGCAA GGTGGTGACG
CCGCCGGTGC GCTTCGGCGC CACCGTCAAG TCCGTCGATG ACAGCGAGGC CAAGAAGGTC
CCGGGCTTCA TCAAGGCGGT CGTGCTCGAC GACAAGACCG GCTCCACGTC AGGCTGGGTG
GTCGCGGTGG CCTCGACCTT CGCCAATGCC AAGAAGGCCG CGGACGCGCT GAAGATCAGC
TATGACAAGG GTCCGTATGC CAATGTCAGC ACCGACAGCA TCATCACCGA GGCGATGCGG
CTGCAGGCGC AGGACGATGC CGGGCAGTTC TTCGTCAAGG ATGGCGATGC CAATGCGGCA
CTGGCGGGCG CCGCGAAAGT GCTGGAGGCG GAGTACACCA CCAGCATCAA CATCCACGCG
CCGATGGAGC CGATGAACGC CACCGCCGAG TTCAAGGGCG ACATCCTGCA CATCTACTCC
GGCAATCAGT TCGCGACCCG CTCCGGCGCG ATCGCGGCGG GCGCCGCCGG GATCGATCCG
AAATATGTCG TGATGCACCA GGCCTGGCTT GGCGGCGGCT TCGGCCGCCG GCTCGATGCC
GACATGATGG TGCCGGCGGT GCAGGCGGCG AAGGCCGTGG GCAAGCCGGT CAAGGTGATC
TATTCCCGCG AGAACGACAT GACGATGGAC TACTCGCGGC CGCTCACCTA CCAGAAAGTG
AAGGCCGGGC TCGACAGCAA TGGCAAGCTC ATTGCTCTGA GCCACGACGT CGTCTCGGCC
TGGCCGACGG CGCGCTGGGG CATCCCGGAC TTCCTGACGC CGTCGGTCGA CAAGAAGGGC
CCGCTCGACT CGTTCACGGT CAACGGCGCC GACTTCTTCT ACACCGTGCC CAACCACCAT
GTGCGCGCGA TCAAGAACGA GCTCGCGCAT AATGCGACGC CGTCCGGCCA GCTCCGCTCG
GTGGCGCCGG GTTGGACGTT CTGGGCGGTC GAGAGCATGA TCGACGAGAT CGCCGCCGCG
TCCGGCCAGG ACCCGGCGCA GTTCCGTATC GCGCTGCTCG ACGGCAAGGG CAAGAACGAT
GGCGGCGCGC AGCGGCTGCG CAACACGCTG CTGGCGGCGA TGGGCCTGTC CGGCTACGGG
TCGAAGAAGC TGTCGAAGGG CGAGGGCATG GGCGTCGCCT GCGTGTCGTC GCAGGAGCGG
GCGTCCGCGA GCTGGACGGC CTGCGTCGCC CATGTCGCGG TGGCTGACAA CGGCGCGGTG
ACGGTGAAGA AGCTCACCGT GGCGACCGAC GTCGGCACCC AGGTGCATCC CGACAACATC
CGCGCCCAGG TCGAGGGCGC GGCGCTGTGG GGATTGTCGC TGGCGATGTA CGAGAAGGCG
ACGTTGAAGG ATGGCGGCAT CGAGCAGACC AACTTCGACA GCTACACGCC GCTGCGGATG
AGCCAGGTGC CGGAGGTCGC GATCGCCGTG ATCGCCAATG GCGAGAAGGC GACCGGCGTC
GGCGAGCCCG CGGTGACCGT GGTCGCGCCG GCGCTCGGCA ACGCCATCTA CAACGCCTGC
GGTGCCCGCC TGCGCTCGCT GCCGATCACC GCGGAAGCGG TGAAGGCCAA CATGAAGGCG
TAG
 
Protein sequence
MNQHVKAAPS MAPDLSRRSF LVGTAATGLV LGYAGLADSA LAATTPASFE PSVWYSIAPD 
GLVTVTCGKA DMGQHIASTM AQIICEELGA AWKDMRVQLA SNDPKFNDPV LGAQITGGSW
STMMNFDAMS RAGAAGRIAL TEAAASAMGV PAKELVVRDG VVMHPKSKKQ MSYAEIVKSG
KITKSFTADE LKALTLKTPD QYTMIGVSVP QLDIPAKTNG TAKYGIDTML PGMVYGKVVT
PPVRFGATVK SVDDSEAKKV PGFIKAVVLD DKTGSTSGWV VAVASTFANA KKAADALKIS
YDKGPYANVS TDSIITEAMR LQAQDDAGQF FVKDGDANAA LAGAAKVLEA EYTTSINIHA
PMEPMNATAE FKGDILHIYS GNQFATRSGA IAAGAAGIDP KYVVMHQAWL GGGFGRRLDA
DMMVPAVQAA KAVGKPVKVI YSRENDMTMD YSRPLTYQKV KAGLDSNGKL IALSHDVVSA
WPTARWGIPD FLTPSVDKKG PLDSFTVNGA DFFYTVPNHH VRAIKNELAH NATPSGQLRS
VAPGWTFWAV ESMIDEIAAA SGQDPAQFRI ALLDGKGKND GGAQRLRNTL LAAMGLSGYG
SKKLSKGEGM GVACVSSQER ASASWTACVA HVAVADNGAV TVKKLTVATD VGTQVHPDNI
RAQVEGAALW GLSLAMYEKA TLKDGGIEQT NFDSYTPLRM SQVPEVAIAV IANGEKATGV
GEPAVTVVAP ALGNAIYNAC GARLRSLPIT AEAVKANMKA