Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01288 |
Symbol | puuC |
ID | 8115581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1348514 |
End bp | 1350001 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644847537 |
Product | hypothetical protein |
Protein accession | YP_002999110 |
Protein GI | 251784806 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTC ATCATCTGGC TTACTGGCAG GATAAAGCGT TAAGTCTCGC CATTGAAAAC CGCTTATTTA TTAACGGTGA ATATACTGCT GCGGCGGAAA ATGAAACCTT TGAAACCGTT GATCCGGTCA CCCAGGCACC GCTGGCGAAA ATTGCCCGCG GCAAGAGCGT CGATATCGAC CGTGCGATGA GCGCAGCACG CGGCGTATTT GAACGCGGCG ACTGGTCACT CTCTTCTCCG GCTAAACGTA AAGCGGTACT GAATAAACTC GCCGATTTAA TGGAAGCCCA CGCCGAAGAG CTGGCACTGC TGGAAACTCT CGACACCGGC AAACCGATTC GTCACAGTCT GCGTGATGAT ATTCCCGGCG CGGCGCGCGC CATTCGCTGG TACGCCGAAG CGATCGACAA AGTGTATGGC GAAGTGGCGA CCACCAGTAG CCATGAGCTG GCGATGATCG TGCGTGAACC GGTCGGCGTG ATTGCCGCCA TCGTGCCGTG GAACTTCCCG CTGTTGCTGA CTTGCTGGAA ACTCGGCCCG GCGCTGGCGG CGGGAAACAG CGTGATTCTA AAACCGTCTG AAAAATCACC GCTCAGTGCG ATTCGTCTCG CGGGGCTGGC GAAAGAAGCA GGCTTGCCGG ATGGTGTGTT GAACGTGGTG ACGGGTTTTG GTCATGAAGC CGGGCAGGCG CTGTCGCGTC ATAACGATAT CGACGCCATT GCCTTTACCG GTTCAACCCG TACCGGGAAA CAGCTGCTGA AAGATGCGGG CGACAGCAAC ATGAAACGCG TCTGGCTGGA AGCGGGCGGC AAAAGCGCCA ACATCGTTTT CGCTGACTGC CCGGATTTGC AACAGGCGGC AAGCGCCACC GCAGCAGGCA TTTTCTACAA CCAGGGACAG GTGTGCATCG CCGGAACGCG CCTGTTGCTG GAAGAGAGCA TCGCCGATGA ATTCTTAGCC CTGTTAAAAC AGCAGGCGCA AAACTGGCAA CCGGGCCATC CACTTGATCC CGCAACCACC ATGGGCACCT TAATCGACTG CGCCCACGCC GACTCGGTCC ATAGCTTTAT TCGGGAAGGC GAAAGCAAAG GGCAACTGTT GTTGGATGGC CGTAACGCCG GGCTGGCTGC CGCCATCGGC CCGACCATCT TTGTGGATGT GGACCCGAAT GCGTCCTTAA GTCGCGAAGA GATTTTCGGT CCGGTGCTGG TGGTCACGCG TTTCACATCA GAAGAACAGG CGCTACAGCT TGCCAACGAC AGCCAGTACG GCCTTGGCGC GGCGGTATGG ACGCGCGACC TCTCCCGCGC GCACCGCATG AGCCGACGCC TGAAAGCCGG TTCCGTCTTC GTCAATAACT ACAACGACGG CGATATGACC GTGCCGTTTG GCGGCTATAA GCAGAGCGGC AACGGTCGCG ACAAATCCCT GCATGCCCTT GAAAAATTCA CTGAACTGAA AACCATCTGG ATAAGCCTGG AGGCCTGA
|
Protein sequence | MNFHHLAYWQ DKALSLAIEN RLFINGEYTA AAENETFETV DPVTQAPLAK IARGKSVDID RAMSAARGVF ERGDWSLSSP AKRKAVLNKL ADLMEAHAEE LALLETLDTG KPIRHSLRDD IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP ALAAGNSVIL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANIVFADC PDLQQAASAT AAGIFYNQGQ VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIREG ESKGQLLLDG RNAGLAAAIG PTIFVDVDPN ASLSREEIFG PVLVVTRFTS EEQALQLAND SQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL EKFTELKTIW ISLEA
|
| |