Gene B21_01288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01288 
SymbolpuuC 
ID8115581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1348514 
End bp1350001 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content57% 
IMG OID644847537 
Producthypothetical protein 
Protein accessionYP_002999110 
Protein GI251784806 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTC ATCATCTGGC TTACTGGCAG GATAAAGCGT TAAGTCTCGC CATTGAAAAC 
CGCTTATTTA TTAACGGTGA ATATACTGCT GCGGCGGAAA ATGAAACCTT TGAAACCGTT
GATCCGGTCA CCCAGGCACC GCTGGCGAAA ATTGCCCGCG GCAAGAGCGT CGATATCGAC
CGTGCGATGA GCGCAGCACG CGGCGTATTT GAACGCGGCG ACTGGTCACT CTCTTCTCCG
GCTAAACGTA AAGCGGTACT GAATAAACTC GCCGATTTAA TGGAAGCCCA CGCCGAAGAG
CTGGCACTGC TGGAAACTCT CGACACCGGC AAACCGATTC GTCACAGTCT GCGTGATGAT
ATTCCCGGCG CGGCGCGCGC CATTCGCTGG TACGCCGAAG CGATCGACAA AGTGTATGGC
GAAGTGGCGA CCACCAGTAG CCATGAGCTG GCGATGATCG TGCGTGAACC GGTCGGCGTG
ATTGCCGCCA TCGTGCCGTG GAACTTCCCG CTGTTGCTGA CTTGCTGGAA ACTCGGCCCG
GCGCTGGCGG CGGGAAACAG CGTGATTCTA AAACCGTCTG AAAAATCACC GCTCAGTGCG
ATTCGTCTCG CGGGGCTGGC GAAAGAAGCA GGCTTGCCGG ATGGTGTGTT GAACGTGGTG
ACGGGTTTTG GTCATGAAGC CGGGCAGGCG CTGTCGCGTC ATAACGATAT CGACGCCATT
GCCTTTACCG GTTCAACCCG TACCGGGAAA CAGCTGCTGA AAGATGCGGG CGACAGCAAC
ATGAAACGCG TCTGGCTGGA AGCGGGCGGC AAAAGCGCCA ACATCGTTTT CGCTGACTGC
CCGGATTTGC AACAGGCGGC AAGCGCCACC GCAGCAGGCA TTTTCTACAA CCAGGGACAG
GTGTGCATCG CCGGAACGCG CCTGTTGCTG GAAGAGAGCA TCGCCGATGA ATTCTTAGCC
CTGTTAAAAC AGCAGGCGCA AAACTGGCAA CCGGGCCATC CACTTGATCC CGCAACCACC
ATGGGCACCT TAATCGACTG CGCCCACGCC GACTCGGTCC ATAGCTTTAT TCGGGAAGGC
GAAAGCAAAG GGCAACTGTT GTTGGATGGC CGTAACGCCG GGCTGGCTGC CGCCATCGGC
CCGACCATCT TTGTGGATGT GGACCCGAAT GCGTCCTTAA GTCGCGAAGA GATTTTCGGT
CCGGTGCTGG TGGTCACGCG TTTCACATCA GAAGAACAGG CGCTACAGCT TGCCAACGAC
AGCCAGTACG GCCTTGGCGC GGCGGTATGG ACGCGCGACC TCTCCCGCGC GCACCGCATG
AGCCGACGCC TGAAAGCCGG TTCCGTCTTC GTCAATAACT ACAACGACGG CGATATGACC
GTGCCGTTTG GCGGCTATAA GCAGAGCGGC AACGGTCGCG ACAAATCCCT GCATGCCCTT
GAAAAATTCA CTGAACTGAA AACCATCTGG ATAAGCCTGG AGGCCTGA
 
Protein sequence
MNFHHLAYWQ DKALSLAIEN RLFINGEYTA AAENETFETV DPVTQAPLAK IARGKSVDID 
RAMSAARGVF ERGDWSLSSP AKRKAVLNKL ADLMEAHAEE LALLETLDTG KPIRHSLRDD
IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP
ALAAGNSVIL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI
AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANIVFADC PDLQQAASAT AAGIFYNQGQ
VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIREG
ESKGQLLLDG RNAGLAAAIG PTIFVDVDPN ASLSREEIFG PVLVVTRFTS EEQALQLAND
SQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL
EKFTELKTIW ISLEA