Gene BBta_5124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5124 
Symbol 
ID5153337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5352520 
End bp5354049 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content68% 
IMG OID640559899 
Productputative aldehyde dehydrogenase family protein 
Protein accessionYP_001241027 
Protein GI148256442 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.982748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.331095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATT TCATCGATTC CCGCTTGCGC GACGCCGCGC TGCCGTTCTG GCCTGATGTC 
GCGAGCGTCG GCTCGCATGT CGGCGGTGAT CTCGTCATCG GCCGCGGCGC GACGATCTCC
GTCAATGATC CCGCCACCGG GCAATTGCTG TTCAGCTATG CGGATGCCGG CGCCGACGTG
GCGGCCAAGG CCGCGGCGGC GGCCGTGGCG GGGCAGGCCG AATGGGCGCG CCTCACGGCG
GCCGGTCGCG GCCGCATGAT GCAGGCGATC GCCCGCGCGA TCCTCGCCGC CGCCGAGCCA
TTGGCGCAGC TTGAAGCCTT GTCCGCCGGC AAGCCGATCC GCGACACCAG AGGCGAGGTC
GCCAAGGTCG CGGAGATGTT CGAATATTAT GCCGGATGGG CCGACAAATT CCACGGCGAG
GTCATCCCGG TGCCCTCGAG CCATCTCAAC TACACACGCC GCGAGCCCAT GGGCGTGGTG
TTGCAGATCA CGCCGTGGAA TGCGCCGATC TTCACCGCCG GCTGGCAGAT CGCGCCTGCC
ATCAGCATGG GCAATGCGGT CATCCTCAAG CCCTCGGAGC TGACGCCGCT GACCTCGCTG
GCGCTGGCGC TGATCGCCGA GCGCGCGGGC CTGCCGAAGG GCGTCGTCAA CGTGCTCGCC
GGCTTCGGTC ACAGCACGGG GCAGGCGGCA CTGGCGCAGC CGGCGGTCAA GAAGGTGGTG
TTCGTCGGCT CGGTGCCGAC AGGGCGGCTG ATCGGCGAGG CTGCGGCGCG GCGGCTGTTG
CCGTCGGTGC TCGAGCTCGG CGGCAAGTCG GCCAATATCG TGTTCGCCGA TGCCGATCTC
GAACGCGCCG CCATCGGCGC GCAGGCGGCG ATCTTCGGCG GCGCCGGGCA GAGCTGCGTT
GCGGGCTCCC GGCTGCTGGT GCAGAGCGCG GTCTATGACC GCTTCGTCGA TCTGGTCGCG
CAGGGCGCGG CCAAGATCAA ATGCGGCGAT CCGCTGTCGG CCGAGACGGA GATCGGTCCC
ATCAACAATG CCAAGCAATA TGATCACGTG CTGTCGTTGA TCCGCGAGGG CGCGGACGAG
GGCGCCGAGA TCGTGGCCGG CAGCAATGGT GAGAGCACGC CCGGCGGCTA CTACGTGAGG
CCGACCGTGC TCAAGAACGT CACCAACACG ATGGGCATCG CGCGCAAGGA AGTGTTCGGC
CCCGTCGTCG CCGCGATCCG CTTCGAGACC GAAGAGGAGG CGATCGCCAT CGCCAATGAC
AGCGAGTTCG GCCTCGCCGG CGCGGTGTGG ACGACGGATG TCGCGCGCGC ACATCGCGTG
GCCGCACAGG TCAAAGCCGG CACGTTCTGG ATCAACTCCT ACAAGACGAT CAACGTCGCC
TCACCCTTCG GTGGCTACAA TCACAGCGGT CACGGCCGCT CCTCCGGCGT CGAGGCGCTG
TATGAATATA CGCAGGTGAA GAGCGTATGG GTCGAGACCG CGGCGGAGCC GGCGGTGGCG
TTCGGCTACG CGCCGGGCCT GCGCGAATGA
 
Protein sequence
MTHFIDSRLR DAALPFWPDV ASVGSHVGGD LVIGRGATIS VNDPATGQLL FSYADAGADV 
AAKAAAAAVA GQAEWARLTA AGRGRMMQAI ARAILAAAEP LAQLEALSAG KPIRDTRGEV
AKVAEMFEYY AGWADKFHGE VIPVPSSHLN YTRREPMGVV LQITPWNAPI FTAGWQIAPA
ISMGNAVILK PSELTPLTSL ALALIAERAG LPKGVVNVLA GFGHSTGQAA LAQPAVKKVV
FVGSVPTGRL IGEAAARRLL PSVLELGGKS ANIVFADADL ERAAIGAQAA IFGGAGQSCV
AGSRLLVQSA VYDRFVDLVA QGAAKIKCGD PLSAETEIGP INNAKQYDHV LSLIREGADE
GAEIVAGSNG ESTPGGYYVR PTVLKNVTNT MGIARKEVFG PVVAAIRFET EEEAIAIAND
SEFGLAGAVW TTDVARAHRV AAQVKAGTFW INSYKTINVA SPFGGYNHSG HGRSSGVEAL
YEYTQVKSVW VETAAEPAVA FGYAPGLRE