Gene Hoch_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1913 
Symbol 
ID8544295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2629479 
End bp2631050 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID646386618 
Productprotein of unknown function DUF1058 
Protein accessionYP_003266353 
Protein GI262195144 
COG category[T] Signal transduction mechanisms 
COG ID[COG3103] SH3 domain protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0257561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCTT TCAAACACAT CTTCTTCTGC GTCGGTCTGG TCTTGGTTCT CGCGACCGGG 
CAGGCTCACG CGGAGAAGGT GTGGACCAAG AGCGAGACCA GTCTGCGCGC CGACCCCGAT
GACGCCAGCT CGCGCGTCGC CCGTATCCAG GGAGATCGCG AGCTCAGCGT CATCGAGAAG
AAGGGCAACT GGTATCGCGT CAAGGTCGGC GTGAAGACCG GATGGATCCG CCGCAGCGAT
CTTCGGGAGC GACCCTCGAG CAACTCCGCG ACAGCGGAGT CTTCATCAGA GTCCGCCGCA
CGGGCGACGG GTTCTGGTGG TGGCTCCGGC GCACGCGCCA ACAAGAACAA CAAGAGCAAC
AAGAACAACC GCCGCAAACG ACGCTCGCGG CGCAACTGCC GCGAGGGCTC GGCGTGGTGC
GACAGCGACG GCGACGCCAT GCGCGTGGTG GTCGTGGTCA ACCGCGTCGA GGCCTACAAA
GAGCCACGCG ACGACGAGGA TATCGCGTTC TTGGCCTCGC AGGACCAGGA GCTCGTGGTC
CTCGGCCATC ACAGCCCCGA TTGGATGTAC GTCCAGACCC TGGACGGCAA GCTGGGGTGG
ATTCACGACG ACACGGTGCG CCAGAAGGGC AACCTGGTGA GCGCGCGCGG CACGGCGATC
GACGCCCCGC GCACGCAGGG CTCGGGGACC GAGTCCGGCA GCGATGGTGA TACCGGCGCT
GAGACCGGCA CGGGAGCGGA CGACGCGGCG CTCTCAGCTT CCGCTTCGGC TTCGGCGAGC
GCGGCCGGCG ACGCCACCCG GGTCACGGCT CGGCGCGGCG AGGGCGAGAG CCTCGAGCCC
GAAGGGCCTT CGCGCTTCGA CGTGCGTCTG TCGATGGGGG CGGGCGCGGC GCTGAGCGGA
CGCGCGCTCA CGGCGACCAG CGGCGAGGCC GGCAACTACG AGACCCGCTC GACCGGGCTG
GTGACCTCGC TGGCCGGCGA TATTCGCTAC GAACTCTCCG GCCCCTGGCA CGTCGGCGTC
GACGGCAACT TCGCGCTGGT TAGCGGACTG GCCGGGCTCG AGTACAACCC CGCGGCGGCC
GGTTCGGTGG AGGCGGGCAA TTATCTGCAC CACCGCACCG AGCTGAGCGC ACAGGTCGGC
TACGAGACCG ATAGCTGGAA TAGCTATCTG CACGTCGGCG GCGCCGTGGG CATCTTCTAT
ATTCAGGACC TGTTCAATGA GGCGGCGCTG CCGCGCGAGC GTCTGCTCAG CCCGCTTGTC
GGGATCAACG CCGATTTCGC GCTGAGTCCC AGCTTCGAGC TCGGCGTGCG AGCCGATGCC
CTGGTGCTGG GCGCGCTGGC GCAGACGCGG GGCCGCGAGG ACGGCACCTT CTCGTCGATG
CTGGCGGTGG CGGCGCAGGC CGAGGGCACC TACAGCCTCT CGGACACGCT CGGGCTGTTG
GCCGCGTTCC GCTTCGATCG CGTGTTCCCC GAGTGGACCG GGGCGTCGGT GCGTGAGGCC
GGGGTCGAAG GCGCCGCTCG CACCGACCAG ATGCTGCGTT TTCTGGTCGG CGTTCAGGCG
CGCTTTCAGT AG
 
Protein sequence
MIAFKHIFFC VGLVLVLATG QAHAEKVWTK SETSLRADPD DASSRVARIQ GDRELSVIEK 
KGNWYRVKVG VKTGWIRRSD LRERPSSNSA TAESSSESAA RATGSGGGSG ARANKNNKSN
KNNRRKRRSR RNCREGSAWC DSDGDAMRVV VVVNRVEAYK EPRDDEDIAF LASQDQELVV
LGHHSPDWMY VQTLDGKLGW IHDDTVRQKG NLVSARGTAI DAPRTQGSGT ESGSDGDTGA
ETGTGADDAA LSASASASAS AAGDATRVTA RRGEGESLEP EGPSRFDVRL SMGAGAALSG
RALTATSGEA GNYETRSTGL VTSLAGDIRY ELSGPWHVGV DGNFALVSGL AGLEYNPAAA
GSVEAGNYLH HRTELSAQVG YETDSWNSYL HVGGAVGIFY IQDLFNEAAL PRERLLSPLV
GINADFALSP SFELGVRADA LVLGALAQTR GREDGTFSSM LAVAAQAEGT YSLSDTLGLL
AAFRFDRVFP EWTGASVREA GVEGAARTDQ MLRFLVGVQA RFQ