Gene BBta_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2939 
Symbol 
ID5154959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3057769 
End bp3059307 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content66% 
IMG OID640557818 
Productputative 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase, HpcC-like protein 
Protein accessionYP_001238972 
Protein GI148254387 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGG CCACACCCAA AGCCGACATC TACAAGGCCA ATCTCGACCG CGCCGCCCCC 
CTGCTCGCCA AGCTCAAGGA CGAGGGCATC GGCCACTTCA TCGATGGCAA GGTGGTACCG
GCGATTTCGG GCGCGACGTT CGAGACGAAG TCGCCAATCG ACAACACCGT GCTGGCCAAA
GTAGCGCGCG GCAATGCCGA GGACATCGAC CGCGCGGCGA CCTCTGCTGC GCTCGCTTTC
AAATCCTGGC GCGAGATGGC GCCGGCGATG CGGCGCAAGC TCCTGCACCG CGTGGCCGAT
GCGATCGAGG ATCATGCCGA CGACATCGCC GTGCTCGAAT GCATCGACAC CGGTCAGGCC
TATCGCTTCA TGGCCAAGGC GGCGATCCGC GCCGCCGAGA ATTTCCGGTT TTTCGCCGAC
AAATGCACCG AGGCGCGCGA CGGGCTGAAT ACGCCGAGCG AAGAGCACTG GAACATCTCC
ACGCGGGTAC CGATCGGCCC GGTCGGCGTG ATCACCCCGT GGAACACGCC GTTCATGCTG
TCGACCTGGA AGATCGCCCC GGCGCTCGCG GCAGGCTGCA CGGTCGTGCA CAAGCCGGCC
GAATGGTCGC CGATCACGGC GCACTGGCTG GCAAAACTCG CCAAGGACGC CGGTATCCCC
GATGGCGTGC TCAACACTGT CCATGGCTTC GGCGAGGAGG CCGGCAGGGC GCTGACCGAA
CATCCTGCCA TCAAGGCGAT CGGCTTCGTC GGCGAGAGCA CGACCGGGTC GGCGATCATG
GCGCAGGGCG CGCCGACTCT GAAACGCGTG CATTTCGAGC TTGGCGGCAA GAACCCGGTG
ATCGTGTTCG ACGATGCCGA CCTCGATCGC GCGCTCGACG CCGTCGTGTT CATGATCTAT
TCGCTCAACG GCGAGCGCTG CACCTCCTCG AGCCGCCTGC TGGTGCAGCA GAACATTGCC
GAGACATTCA CGGCCAAGCT CGCGGCGCGC GTGCGTGCGC TGAAGGTCGG TCATCCGCTC
GATCCCGCGA CCGAGGTCGG GCCGCTGATT CATGAGCGGC ACCTCGCCAA GGTCTGCTCT
TATGTCGACA TCGCGCGCCA GGACGGGGCG ATCATTGCCG TCGGCGGCAA GCCGGTCGCT
GGTCCGGGCG GCGGCCACTA CGTCGAGCCC ACGCTCGTCA CGGGCGCGAG CCAGATCATG
CGCGTGGCGC AGGAAGAGGT GTTCGGGCCA TTCCTGACCG TGATCCCGTT CGGCGACGAG
GCGGACGCGA TCGCGATCGC CAATGACGTG CAGTATGGCC TCACCGGCTA TGTCTGGACC
GGCGACATGG GCCGCGCGCT GCGCGTCGCC GACGCGCTGG AGGCCGGCAT GATCTGGCTG
AACTCGGAGA ACGTCCGCCA TCTGCCGACG CCGTTCGGCG GCATGAAGGC GTCCGGCATC
GGCCGCGACG GCGGCGACTA TTCATTCGAC TTCTACATGG AGACCAAGCA CGTCTCGCTC
GCGCGCGGCA CCCACAAGAT CCAGCGGCTC GGCGTGTAA
 
Protein sequence
MDKATPKADI YKANLDRAAP LLAKLKDEGI GHFIDGKVVP AISGATFETK SPIDNTVLAK 
VARGNAEDID RAATSAALAF KSWREMAPAM RRKLLHRVAD AIEDHADDIA VLECIDTGQA
YRFMAKAAIR AAENFRFFAD KCTEARDGLN TPSEEHWNIS TRVPIGPVGV ITPWNTPFML
STWKIAPALA AGCTVVHKPA EWSPITAHWL AKLAKDAGIP DGVLNTVHGF GEEAGRALTE
HPAIKAIGFV GESTTGSAIM AQGAPTLKRV HFELGGKNPV IVFDDADLDR ALDAVVFMIY
SLNGERCTSS SRLLVQQNIA ETFTAKLAAR VRALKVGHPL DPATEVGPLI HERHLAKVCS
YVDIARQDGA IIAVGGKPVA GPGGGHYVEP TLVTGASQIM RVAQEEVFGP FLTVIPFGDE
ADAIAIANDV QYGLTGYVWT GDMGRALRVA DALEAGMIWL NSENVRHLPT PFGGMKASGI
GRDGGDYSFD FYMETKHVSL ARGTHKIQRL GV