Gene BBta_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5003 
Symbol 
ID5150239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5233458 
End bp5234336 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content66% 
IMG OID640559784 
Productputative short-chain dehydrogenase/reductase (SDR) 
Protein accessionYP_001240913 
Protein GI148256328 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.874894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0924335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGC TCGACGGCAA GGTGGCGCTG ATCACCGGCG CTGGTGGTGG CCTCGGCGAG 
GCCTATGCAA GGCTGTTCGC GCGCGAGGGC GCGGCGGTGG TGGTCAACGA TCTCGGCGGC
CCGCGTGATG GGTCCGGCTC GGATCTGTCG ATGGCCGGGC AGGTGGCGGC CGCGATCACG
GCCGAGGGCG GCCGCGCCGT CGCCAATGGC GCTGACATCT CGACCATGGC GGGCGGGCAG
TCGGTGTTCG ACGATGCGAT CCGGCATTTC GGCCGCGCCG ACATTCTGGT CAACAATGCC
GGCATCCTGC GCGACCAGAC CTTTGCCAAG TCCAGCGAGG CCGATTGGGA CAAGGTGATC
CAGGTCCACC TCAAAGGCAC TTTCTGCTGT ACCTTGCCGG TGTTCCGCTG GATGCGCGAC
AATGGCGGCG GCGTCATTGT CAACACGTCC TCGACCTCCG GGCTGATCGG CAATTTCGGC
CAGTCCAATT ACGGCGCGGC GAAGGGCGGT ATTTGGGGCC TGTCCAACGT GCTGGCGGTG
GAGGGCCGCA AGTACAACAT CCGGGTGTGG ACCCTGGCGC CGGGCGCGCT GACACGGATG
ACCGCCGACC TGCCGCGCTA CAAGGAAAAT CCGGGCGCCG CACTGACGCC CGAAGGCATC
GCGCCGGCGG TGCTGTATAT GGTCAGCCAC CTCTCCGGCG ATCAGACCGG CAAGGTGCTC
GGCGTCTCCG GCCCGCGCGG CGTGCGCGAG TTGCGCATGA TGGAAATGGA CGGCTGGAAG
CCGCCATCCT CGGCCTGGCG GCCCGAGGAC ATCGCTGTTC ATGCAGAGGA GATATTCTTT
TCGGAGGCCG ACATTCAAAA GTCCGCCCGG CGGTTTTGA
 
Protein sequence
MGLLDGKVAL ITGAGGGLGE AYARLFAREG AAVVVNDLGG PRDGSGSDLS MAGQVAAAIT 
AEGGRAVANG ADISTMAGGQ SVFDDAIRHF GRADILVNNA GILRDQTFAK SSEADWDKVI
QVHLKGTFCC TLPVFRWMRD NGGGVIVNTS STSGLIGNFG QSNYGAAKGG IWGLSNVLAV
EGRKYNIRVW TLAPGALTRM TADLPRYKEN PGAALTPEGI APAVLYMVSH LSGDQTGKVL
GVSGPRGVRE LRMMEMDGWK PPSSAWRPED IAVHAEEIFF SEADIQKSAR RF