Gene BBta_5079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5079 
Symbol 
ID5151369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5297906 
End bp5299969 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content64% 
IMG OID640559856 
Producthypothetical protein 
Protein accessionYP_001240985 
Protein GI148256400 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0326804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACC TTGCGATTGT TGGTGGCGGT CCGGGCGGCC TGATGAGCGC CTGGTACTTG 
AGACGGAAGC TTGGCGATCT CTGCAAGGTC ACGATTTTCG AAGCCTCCGA CCGGCTCGGC
GGCAAGATCG TGACGCGAAA ATTCGATACG GCGCCGGCGA TGTATGAAGC GGGTGTCGCT
GAGATCTACG ACTACTCGAT GACCGGGCCC GATCCGCTGC GCGAGCTGAT CCAGCATTTC
GGCCTGCAGA CGATTCCGAT GGACGCGCTT CAGGTGCAGC TCGACGGCGA GCTGCTCGAC
GACGTGCCGG GCCTGCGCCG CAAATACGGG CCGAAGACGG CAGCCGCCGT CGAGGCGTTC
CGCAAGCGCT GCTCCGAGGT GATGTCGCCG GTCGAGTATT ACGAGGGCGT CGGCGCGCAC
GACAACGAGC ACCCCTGGGC CTACAAGACC TGCGAGGAGC TGCTCGACGA GGAGGTCGAG
GATCCCACCG CCAAGCGCTT CTTCAAGGTG ATGTCGCGCT CCGACATCGC AACCGAGGCC
CACAACACCA ACGGCCTCAA CGCGCTCAAG AACTTCGTGA TGGATATCGA CGACTATATC
GGTCTGTATT CCATCCAGAA CGGCAATGAG CAGCTGATCG AGTGCCTGCG CTCGGAGGTC
GACGCCGACA TCCAGCTCAA TCATCGTGTG CTGCGCATCG GCAAGACCGA GCAGGGCCGC
TACCGGCTCA ACATGATGAA CGGCAAGGGC CCGGAGACCC GCGAGTTCGA TCTCGTGCTG
GTGTGCCTGC CGCATTCCTG GCTGTCGACG GTCGGCTGGG AGGGCGAGAA GCTGCGCCGG
TCGATGGTCA AGCACATCGC CTATTTCGAC CGTCCCGCGC ATTACCTGCG CGTCTCGATC
CTGTTCGATT CGCCGTTCTG GGGCGACAAG ATCCCGGGCT CCTGGTTCAT GTCCGAGGCG
TTCGGCGGCT GCTGCATCTA CAATGAAGGC TCGCGCCACG ACGTCGGCAA GCACGGCGTG
CTGAACTGGC TGATCGCCGG CTCCGATGCG CTGGCCTTCG CCAATCTGTC CGATCAGGAG
CTGATCGACG CCGCGCTGAA ATCGCTGCCG GCGGCGCTCG GCGATGCGCG GGCGCATTTC
ATGGAAGGCA AGATCCATCG CTGGCTGTCG TCGGTGAACG CGTTGCCGGG CGGCCTGCCG
GTGCGCGACG TCATGACGAA TCACCGGCCA GAGCCGAAGG AGCATCCCGG CATCGTCGTG
GTCGGCGACT ATCTGTTCGA TTCGACGCTC AATGGCCTGC TCGATTCGTC GGATGCCGCC
ACCGACATCA TCCTGACCGA GATGATGCGG CTGCGCCGCG CCCGCGCGCA GGCGGAGAAG
CCGCTGTCGG ACAAGATCGA CCGCGACTAT TTCGACAATT ACCGCGGGCA GGGGCCGTAC
AGTGAAGCCT GGTCGCAGTT CACCGATCCG GACTATCTGA CCAGCCTGAT CAAGATCGTC
TGGAACAAGG GCAAGGGCAA AGGCTACAAG CTTCTCGTCG CAGGCTCTGC CAGTGGCGAG
CTGGTCGGTG CGCTGCGCGA CCGCGGCATC GATGCCTGGG GCATCGAGAA CAACCGCTAT
ATCCACGGCA AGACGCCGAA GGCGCTGAAG AAGTACAACA AGCTCGGCAC GATCACCGAC
CTGCCGTTCA AGGCAGGTGA GTTCGATTTC GTGTTCGAGA CCAGCCTCTG TCATCTCGGC
GACAAGCAGG TGGCGCGGGC GATCCGCGAA CTGAACCGCG TGGTCAAGAC CGGCCTGGTC
TTCGGGTCGA TCACCTCGGA CATGGCGCCG GCGCTGGTCG ACCGCTACGA CCTGCTGCGC
GGCGTCAAGA AGCTCGGCAC CTGGTGGGAA TGGTCCGAGC TTTTCTTCGG CAATGGCTTC
GACCTCGCGA TGCACCGCCG CGACTGCACC GACGAGGTGT GGGCCGCGAC GCTCGCTGCC
AACAAGGGCC CGGGCCAGTG GTACGCGGAC GCTGACAGCC TGCGCTACTC CTTCTTCGAC
AAGGTCGAGG ACGACGAGGA CTAG
 
Protein sequence
MLDLAIVGGG PGGLMSAWYL RRKLGDLCKV TIFEASDRLG GKIVTRKFDT APAMYEAGVA 
EIYDYSMTGP DPLRELIQHF GLQTIPMDAL QVQLDGELLD DVPGLRRKYG PKTAAAVEAF
RKRCSEVMSP VEYYEGVGAH DNEHPWAYKT CEELLDEEVE DPTAKRFFKV MSRSDIATEA
HNTNGLNALK NFVMDIDDYI GLYSIQNGNE QLIECLRSEV DADIQLNHRV LRIGKTEQGR
YRLNMMNGKG PETREFDLVL VCLPHSWLST VGWEGEKLRR SMVKHIAYFD RPAHYLRVSI
LFDSPFWGDK IPGSWFMSEA FGGCCIYNEG SRHDVGKHGV LNWLIAGSDA LAFANLSDQE
LIDAALKSLP AALGDARAHF MEGKIHRWLS SVNALPGGLP VRDVMTNHRP EPKEHPGIVV
VGDYLFDSTL NGLLDSSDAA TDIILTEMMR LRRARAQAEK PLSDKIDRDY FDNYRGQGPY
SEAWSQFTDP DYLTSLIKIV WNKGKGKGYK LLVAGSASGE LVGALRDRGI DAWGIENNRY
IHGKTPKALK KYNKLGTITD LPFKAGEFDF VFETSLCHLG DKQVARAIRE LNRVVKTGLV
FGSITSDMAP ALVDRYDLLR GVKKLGTWWE WSELFFGNGF DLAMHRRDCT DEVWAATLAA
NKGPGQWYAD ADSLRYSFFD KVEDDED