Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2669 |
Symbol | |
ID | 8545056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 3680124 |
End bp | 3681104 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646387364 |
Product | hypothetical protein |
Protein accession | YP_003267093 |
Protein GI | 262195884 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.40549 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.125112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAGA TCTCCCTCGA CAGCTTCGAG CGCGACAGCG ACGCCTTCGA TGAGGCCGTG GCCGCCTCGT CCGGCATCGA TCGCTTCTGC TCGTCCTCGG CCTGGATCCT GCCCGCGCAG GCGACCCTGA TGCCGCCGCG CCAGCCCTGG CTGTTTCGCG ACCAGCACGG CTACGTCGCC ATGATGCGCG GTCGCCACAT CGACGGGTGG TCATACGTCG AGCCGCTCGA GGCCATGTGG TACCTCGCCT GCCCGCTCGT CGGCCCAACC CCGCGCGAGC TGGCCGCGCG CTTTGGCGAG CTGTGCCGCG GCCGCCCCGA CGACTGGGAC GTCGCCCTCA TCGGCGGCCT GGAACCCAAT TCGGTGCTCA GCGAGGAGCT GGCCACCTAT CTATCGATGT TCTGCCGCCT GCGCCTGGCC CCGCCCACCA TCCGCCACGT CGCCGAGCTC GGCGACGGCT TCGAGCGCTA CCTCGGCCGC CGCTCGCGCA ACTTCCGCAA ATCCCTGCGC CGGGCCGACG ACGCCGCGCG CGCCGCCGGC ATCCGCTTCG AACGCGTCAG CGCCCGCGAC AGCGACCAGG CCGCCGCCCT GTACCGGCGC GCGGTCGCCA TCGAGGAGCG CTCGTGGAAG GGCCGCGCCG GCGTCGGCAT TCAGGATGGC GCCATGCACG CCTTCTACCA GCAGATGCTG CCGCGCCTGG CCGCGCGCGG GCGTCTGCGC GCCATCTTCG CCAGCCACCG GGGCCGCGAC GTCGCCTTCA TCCTCGGCGG CGTATACCTC GACACCTACC GCGGCCTGCA ATTCAGCTTC GACGCCGACT ACAGCGAACT CTCCCTCGGC AACCTGTGCC AGCGCGAACA GATCGCGGCC CTGTGCGAAG AGGGCGTGTC CCGATACGAT CTCGGCACTG ATATGGAATA CAAGCGCCGC TGGGCCGACA CCACCCACGA GACCATCGCC CTGCTCGCCA TTCGCCGCTG A
|
Protein sequence | MEEISLDSFE RDSDAFDEAV AASSGIDRFC SSSAWILPAQ ATLMPPRQPW LFRDQHGYVA MMRGRHIDGW SYVEPLEAMW YLACPLVGPT PRELAARFGE LCRGRPDDWD VALIGGLEPN SVLSEELATY LSMFCRLRLA PPTIRHVAEL GDGFERYLGR RSRNFRKSLR RADDAARAAG IRFERVSARD SDQAAALYRR AVAIEERSWK GRAGVGIQDG AMHAFYQQML PRLAARGRLR AIFASHRGRD VAFILGGVYL DTYRGLQFSF DADYSELSLG NLCQREQIAA LCEEGVSRYD LGTDMEYKRR WADTTHETIA LLAIRR
|
| |