Gene Hoch_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2669 
Symbol 
ID8545056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3680124 
End bp3681104 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID646387364 
Producthypothetical protein 
Protein accessionYP_003267093 
Protein GI262195884 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.40549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.125112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGA TCTCCCTCGA CAGCTTCGAG CGCGACAGCG ACGCCTTCGA TGAGGCCGTG 
GCCGCCTCGT CCGGCATCGA TCGCTTCTGC TCGTCCTCGG CCTGGATCCT GCCCGCGCAG
GCGACCCTGA TGCCGCCGCG CCAGCCCTGG CTGTTTCGCG ACCAGCACGG CTACGTCGCC
ATGATGCGCG GTCGCCACAT CGACGGGTGG TCATACGTCG AGCCGCTCGA GGCCATGTGG
TACCTCGCCT GCCCGCTCGT CGGCCCAACC CCGCGCGAGC TGGCCGCGCG CTTTGGCGAG
CTGTGCCGCG GCCGCCCCGA CGACTGGGAC GTCGCCCTCA TCGGCGGCCT GGAACCCAAT
TCGGTGCTCA GCGAGGAGCT GGCCACCTAT CTATCGATGT TCTGCCGCCT GCGCCTGGCC
CCGCCCACCA TCCGCCACGT CGCCGAGCTC GGCGACGGCT TCGAGCGCTA CCTCGGCCGC
CGCTCGCGCA ACTTCCGCAA ATCCCTGCGC CGGGCCGACG ACGCCGCGCG CGCCGCCGGC
ATCCGCTTCG AACGCGTCAG CGCCCGCGAC AGCGACCAGG CCGCCGCCCT GTACCGGCGC
GCGGTCGCCA TCGAGGAGCG CTCGTGGAAG GGCCGCGCCG GCGTCGGCAT TCAGGATGGC
GCCATGCACG CCTTCTACCA GCAGATGCTG CCGCGCCTGG CCGCGCGCGG GCGTCTGCGC
GCCATCTTCG CCAGCCACCG GGGCCGCGAC GTCGCCTTCA TCCTCGGCGG CGTATACCTC
GACACCTACC GCGGCCTGCA ATTCAGCTTC GACGCCGACT ACAGCGAACT CTCCCTCGGC
AACCTGTGCC AGCGCGAACA GATCGCGGCC CTGTGCGAAG AGGGCGTGTC CCGATACGAT
CTCGGCACTG ATATGGAATA CAAGCGCCGC TGGGCCGACA CCACCCACGA GACCATCGCC
CTGCTCGCCA TTCGCCGCTG A
 
Protein sequence
MEEISLDSFE RDSDAFDEAV AASSGIDRFC SSSAWILPAQ ATLMPPRQPW LFRDQHGYVA 
MMRGRHIDGW SYVEPLEAMW YLACPLVGPT PRELAARFGE LCRGRPDDWD VALIGGLEPN
SVLSEELATY LSMFCRLRLA PPTIRHVAEL GDGFERYLGR RSRNFRKSLR RADDAARAAG
IRFERVSARD SDQAAALYRR AVAIEERSWK GRAGVGIQDG AMHAFYQQML PRLAARGRLR
AIFASHRGRD VAFILGGVYL DTYRGLQFSF DADYSELSLG NLCQREQIAA LCEEGVSRYD
LGTDMEYKRR WADTTHETIA LLAIRR