Gene Hoch_4948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4948 
Symbol 
ID8547356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6823331 
End bp6824380 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content73% 
IMG OID646389622 
Productprotein of unknown function DUF323 
Protein accessionYP_003269330 
Protein GI262198121 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.864459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAC CGTCTCGCCT TTCTCCGGCC CTGCTGGGCG CCGCCTTGCT CGCGCTCGCC 
GCCGCGCCCT GGCTCAGCTC TGCGGCCCGC GCCGGCGCCC CCGATGGCGC CCCCGATGAC
GCCCACGACA GCGCCGATAC CCAAGCGCGC GCGCGCGTCC CGGACGCCGC CGCTCGCGCC
CAGATCGCGC GCATCGAACA CCGCGACCCG AGCATGGTGC CGGTGCCGGC CGGGCCCTTC
CGCATGGGCC CCGACCTGGC CGAGCTCGAG TCGCTGCTGC GCGTGTGCCA CGTGCAGTTC
GGGGCCGCCC AGGAGAACTG CGACAACGAC ATCAATCGCG CCCTGCGCGA GCGCGAGGTG
TTTCTCGACG CCTTCGCCAT CGACCGCCAC GAGGTCGCCG CCGCCGCCTA CCGCGCCTGC
GTGGACGCGG GCGCGTGTTC GGTGTCGGCG CTGGTGGCGG GCGACGAGCG CTTCATCCGC
CCCGAGTGGC CCATGGTCAA CGTCACCTGG CAGGACGCGG CCGACTACTG CGCCTGGGCC
GGCAAGCGCC TGCCCTCGGA GGCGGAGTGG GAAAAGGCCG CGCGCGGCAG CGACGGCAAG
CGCTGGCCGT GGGGCGATCA CGAGCGCCGC GACGGCGCCA ATCACGGCCG CAGCGAGAGC
GACATGATGA TGCTCTCGCG CAGCGACCTG TCGGGCCCGA TGAGCGGACC CGCGGTGCTG
CTGTTTGCAC CCGACGACAG CGACGGCTAC CCGGCCCTGG CGCCGCCGGG GGCCCTGCGC
TGGGGCGAGA GCCCCTACGG CGCGTTTGAT ATGGCCGGCA ACGCGGCCGA GTGGGTGCAG
GACTTCTACG CCGATGAGGG CTACGAGGAC CTGCCGCGCT TCAACCCCCT GCGCTCGATG
CCGAGCGAGA AAAACCACGG CGTGCGCGTG GTCCGCGGCG GCTCGTGGAT GGATCCGGGC
TTTTTCGGAC GCACCTACTA CCGCCGCTGG GCCAACGAGC GCGCGCGCTC CGAGCGCATC
GGTTTCCGCT GCGCCCGCGA TCTCGATTAA
 
Protein sequence
MSSPSRLSPA LLGAALLALA AAPWLSSAAR AGAPDGAPDD AHDSADTQAR ARVPDAAARA 
QIARIEHRDP SMVPVPAGPF RMGPDLAELE SLLRVCHVQF GAAQENCDND INRALREREV
FLDAFAIDRH EVAAAAYRAC VDAGACSVSA LVAGDERFIR PEWPMVNVTW QDAADYCAWA
GKRLPSEAEW EKAARGSDGK RWPWGDHERR DGANHGRSES DMMMLSRSDL SGPMSGPAVL
LFAPDDSDGY PALAPPGALR WGESPYGAFD MAGNAAEWVQ DFYADEGYED LPRFNPLRSM
PSEKNHGVRV VRGGSWMDPG FFGRTYYRRW ANERARSERI GFRCARDLD