Gene BBta_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1939 
Symbol 
ID5151645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2002463 
End bp2003464 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content68% 
IMG OID640556882 
Productputative fumarylacetoacetate hydrolase 
Protein accessionYP_001238038 
Protein GI148253453 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.615073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCG CAACCTATCG GAACGGCAGC CCCGATGGCG GCCTGCTCAT CGTCTCCCGG 
GATCAGGCGC GCGCGTGCAA TGCCAGCGCC ATCGCACCGG ATCTGCTGAC GGCGCTGCGG
AATTGGCAGC AGGTCGAGCC GGCGTTGCGC CGGCTCGCCG AGAGCCTCGA GGATGGATCG
GCTGCCGATG TGATGGCGTT CGATCCCACG CGATGCCTTG CACCGCTGCC GCGCGCACCG
CAATGGCTGG ATGGCTCGGC CTTCCTGAAT CACGGACGGC TGATGGATGT CGCCTTCAAC
AAGCCGCCGA TCCCCGACTT CGACACCATC CCGGTGATGT ACCAGGGCGC CAGCGACGAT
TTCCTCGGGC CGCAGGCCGA CGTGCCCTTC GTGACGGAGG CGGATGGCAT CGACTTCGAG
GGCGAGTTCG GCGTCATCGT CGATGATGTG CCGATGGCCG TGTCGCCAGA GCAGGCCGCG
CAACGGATTC GGCTGCTGGT TCAGATCAAT GATTGGAGCC TGCGCGCGGT CGGTGCGCGC
GAGGTGCGCA CCGGCTTCGG CTTCCTGCAG GCGAAGCCCT CGACGAGCTT CGCGCCGATG
GCGGTGACTC CGGACGAGGT CGGCGACGCC TGGCGTGACG GCCGGCTCGA CATGGCCCTC
CATGTTCACC GCAATGGCGA GCGGATCGGC GCGGCCTCAG GCCGCGAGAT GGCGTTCTCG
TTTCCGCAAC TCATCGCCCA TGCCGCCCGG ACGCGGCGGT TGACGGCCGG CACGATCATC
GGCTCGGGCA CGGTGTCGAA TGCCGATCGC GCCGCCGGAT CGAGTTGCCT GGCTGAGGTC
AGGGCGATCG AGATGATCGA ACGCGGCGAG GCGCGCACGC CCTTCCTGCG CTTCGGTGAC
GAGGTGACCA TGCAGGCCTG CTTCGCCGAT GGCCGGGGAG GCCCGTTCGG ACGCATCGCG
CAGCGCGTGG TTCGTGCGGC GAGCACCGAT CGGCCGGAGT GA
 
Protein sequence
MRLATYRNGS PDGGLLIVSR DQARACNASA IAPDLLTALR NWQQVEPALR RLAESLEDGS 
AADVMAFDPT RCLAPLPRAP QWLDGSAFLN HGRLMDVAFN KPPIPDFDTI PVMYQGASDD
FLGPQADVPF VTEADGIDFE GEFGVIVDDV PMAVSPEQAA QRIRLLVQIN DWSLRAVGAR
EVRTGFGFLQ AKPSTSFAPM AVTPDEVGDA WRDGRLDMAL HVHRNGERIG AASGREMAFS
FPQLIAHAAR TRRLTAGTII GSGTVSNADR AAGSSCLAEV RAIEMIERGE ARTPFLRFGD
EVTMQACFAD GRGGPFGRIA QRVVRAASTD RPE