Gene Bpro_4519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4519 
Symbol 
ID4012794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4777476 
End bp4778951 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content67% 
IMG OID637944171 
Productbenzaldehyde dehydrogenase (NAD+) 
Protein accessionYP_551303 
Protein GI91790351 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.825064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCC TTCTTTATCC GCCGCCGAAC TCAGGGCACT GGACGGGAAA GATTTACAGC 
GACGGTTGGG TCGCGGCGCA AGGCGGCACC GCTGCGGTGA TGGAGCCTGC CACCGGCGGC
ACGCTGGGCC AGATCGGTAT CGCCTCGACC GCAGACGTTG ACCGTGCCGC GCGGAGTGCC
CAATCGGCCC GCGCCGCGTG GGCCGCGACC CCCTTTGACC AACGCGCGGC CGTGATGCGC
GAGGCCGCGC GGCTGCTGAA GGAGCGCGCC GGCGAAATCA ACGGCTGGAA TGTGCGCGAA
TGCGGCTCCA TCATGCCCAA AGCCGAATGG GAGCTGAGCG CCACCTACGA ACAAATGCTG
ATGGCCGCCG CGCTGCCGAT GCAGGCCAAC GGGCAGATGT TCCCCTCGAC CATGCCCGGG
CGAACCAACT TGTGGCGGCG CGTGCCGATC GGCACGGTGG GCGTGATTGC CCCGTGGAAC
TTCCCGCTGC TGCTCGCCAT GCGCTCGGTG GCGCCGGCGC TGGCGCTGGG CAACGCGGTG
CTGCTCAAGC CCGATGCGCA GTCCGCCGTC ACGGGTGGCA TGTTGATCGC CCAGGTGTTT
GCCGACGCCG ACCTGCCGGC GGGCGTTTTG CATGTGCTGC CGGGCGGCCC CGCCACGGGC
GATGCAGTCG TGCGGCACCC GGCGGTCAAC ATGATCTCCT TCACCGGCTC GACGGCGGTG
GGGCGGCAAA TTGGCGAAGT ATGCGGTGGC TTGCTCAAAA AGGTGGCGCT CGAGTTGGGC
GGCAACAACG CCATCGTCGT GCTGGACGAT GCCGACCTGG ACGCCGCCAG TTCCTGTGGC
GCCTGGGGCG CTTTTTTGCA TCAGGGGCAA ATCTGCATGC AGGCGGGCCG CCACCTGGTG
CACCGCAGCG TGGCCGAGGC CTATTCCCAG CGACTGAAGC AGCGCGCCGA GGCGCTGCAT
GTGGGCAACC CGCACGCCGG CCCCGCACAT TTGGGACCTG TCATCAACGC CAAACAGCGC
GACCGCATTG ACCAGATCGT GCAGGCCTCG GTGGCGCAAG GCGCGCGCGT GGTGACTGGC
GGCACCTACG AGGGCTTGCT CTACCGCCCG ACCGTGCTCA CCGAGGTGCA GCCGGAGATG
CCGGCGTTCA CCGATGAAAT TTTTGGCCCG GTCGCCCCCA TCACCGTATT CGACACCGAC
GAGGAGGCGG CGGCGCTGGT CAACGCCTCG GCCTACGGTC TGGCGGCGGC CATCCACAGC
AGCAACATTT CGCGCGCCAT GGCGCTGGCC AACCGCTTGC ACACCGGAAT GATTCACATC
AACGACCAGA CGGTGAACAA CGAATTCCAC GTGCCCTTTG GCGGCATGGG CGCCTCGGGC
AATGGCGGGC GCTTTGGCGG TCCGGCCAAC CTGGAGGAGT TCACCCAAAG CCAGTGGGTC
AGCGTGATGG ACAAGCCCAT CGTGTACCCG TTCTGA
 
Protein sequence
MDTLLYPPPN SGHWTGKIYS DGWVAAQGGT AAVMEPATGG TLGQIGIAST ADVDRAARSA 
QSARAAWAAT PFDQRAAVMR EAARLLKERA GEINGWNVRE CGSIMPKAEW ELSATYEQML
MAAALPMQAN GQMFPSTMPG RTNLWRRVPI GTVGVIAPWN FPLLLAMRSV APALALGNAV
LLKPDAQSAV TGGMLIAQVF ADADLPAGVL HVLPGGPATG DAVVRHPAVN MISFTGSTAV
GRQIGEVCGG LLKKVALELG GNNAIVVLDD ADLDAASSCG AWGAFLHQGQ ICMQAGRHLV
HRSVAEAYSQ RLKQRAEALH VGNPHAGPAH LGPVINAKQR DRIDQIVQAS VAQGARVVTG
GTYEGLLYRP TVLTEVQPEM PAFTDEIFGP VAPITVFDTD EEAAALVNAS AYGLAAAIHS
SNISRAMALA NRLHTGMIHI NDQTVNNEFH VPFGGMGASG NGGRFGGPAN LEEFTQSQWV
SVMDKPIVYP F