Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4994 |
Symbol | |
ID | 5902456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5396711 |
End bp | 5397763 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565515 |
Product | aldo/keto reductase |
Protein accession | YP_001686612 |
Protein GI | 167648949 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.570117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACG TGGAACTCGG CCGGACCGGC ATCCAGGTCT CGCGCTGCTG CCTGGGCACC ATGACCTGGG GCTCGCAGAA CAGCGAGGCG GAAGCCCACG AACAGATGGA CTACGCCCTG GGGCAGGGCG TGACCTTCTG GGACACCGCC GAGATGTATT CCAGCCCGCC CAATCCCGAG ACTCAAGGCA ATACCGAGCG CCATATCGGC TCGTGGCTGG CCAAAACCGG CAAGCGCCAG GAGATTGTCC TGGCCTCGAA GGTGGCCGGC CGGGGCAATG CGTTCGGCGG CCTGTCGTGG ATGCGGCCCG ACGGCGGCTC CACCCGCCAG ACCAAGGCCC AGATCGACTA CGCGGTCGAG CAATCGCTCA AGCGCCTCAA CACCGACTAT CTCGACCTCT ACCAGCTGCA CTGGCCCGAC CGGCCGGTGC GGGTGTTCGG CGGCCAGACC TTCAAGGACT ACGAGCAGGA CTTCGAGGCG TTCGGCGACA TTCTCGAGGC GCTGGACGCC CACGTGAAGA AGGGGTCGAT CCGTTCGGTC GGCGTCTCCA ACGAATTTCC GTGGGGCGTG ATGCGCTTCC TGGCCGAGGC TGAGACGCGC GGCCTGCCGC GCATCGCCTC GATCCAGAAC GCCTACCACC TGGCTAACCG CACCTTCGAA TACGGCCTGG CCGAGATCGC CCTGCGCGAA CAGGTGGGCC TGCTGGCCTA TTCGCCCCTG GCCCAGGGCG CCCTGACCGG CAAGTACCTG GACGGCAAGC TGCCCGACGG TTCGCGCAAG GCGCTCTACA ACCGCATGCA GCGCTACGAG GGTCCCGGCG CCGAGGAGGC GATCCGCGGC TATGTGGATC TGGCCGCCCA TTTCGGCGTC GATCCGGCCC AGCTGGCCCT GAAGTTCTGC GACACGCGGG AATTCGTCAC CGCCACCATC ATTGGCGCCA CCTCGATGGA CCAGCTGAAG ACCAACATCG CCGCCTTCGA CCTGACCTGG ACCGAGGAGA TGGAGAGGGC GGTCAACGCC CTGCACGCCC TGCGGCCCAA TCCGTGTCCG TGA
|
Protein sequence | MDYVELGRTG IQVSRCCLGT MTWGSQNSEA EAHEQMDYAL GQGVTFWDTA EMYSSPPNPE TQGNTERHIG SWLAKTGKRQ EIVLASKVAG RGNAFGGLSW MRPDGGSTRQ TKAQIDYAVE QSLKRLNTDY LDLYQLHWPD RPVRVFGGQT FKDYEQDFEA FGDILEALDA HVKKGSIRSV GVSNEFPWGV MRFLAEAETR GLPRIASIQN AYHLANRTFE YGLAEIALRE QVGLLAYSPL AQGALTGKYL DGKLPDGSRK ALYNRMQRYE GPGAEEAIRG YVDLAAHFGV DPAQLALKFC DTREFVTATI IGATSMDQLK TNIAAFDLTW TEEMERAVNA LHALRPNPCP
|
| |