Gene Hoch_1379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1379 
Symbol 
ID8543761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1846233 
End bp1847843 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content73% 
IMG OID646386091 
Producthypothetical protein 
Protein accessionYP_003265826 
Protein GI262194617 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGC TCACGCGGAT CACAAAGGGC GCGCTGAGCT GGCGAGCCTC GGCGCGCTTG 
TCCGAACCGA AGCCGGGCGC GCAGTGCGTG GTGCTCAGCC GCTATGACCG GTGGCATTTC
GAGCGGCTGG GCGCGCACGA CCAGGCGCTC ATGCAGCGGC TGCAGCGCAC GCCCGCGCTG
GTGCCGGTGG CCTTCGATCT CTACGGGTTT CTGTACAAGC CGCGGCCACA GCGGGTGCCG
GGCGCCGCGC TGACTCTGGC CGTGAAGGTG CTGGCCAAGC TGGCAGAGCT CGGCGAGCTG
GCTGCGATTC GCCAGCGCAC GGTGCTGAGC GAGTGGGCCA CGGTGATGCA CCTGAACGCG
CTGCTCGGGC CGGTCGAGGA GTTTTTGCAG CGCGGCGGCG GGCAGAGCGG CGCCGGCGGG
CACGGCGACG CCAACGCCGA GGGCGCGGGG GGCGCGGGGG GCGGTGACGC CGAGGTGCTC
GAAGCGTTCG AGCGCGCCCA CGGCCTGGGC CAGCAGCCGG CGCGCGCGTC GCAGCCCGGG
CGTTCGCAGC CCGAGCAGTC GGACGAGCAC TCGGACGAGC AGACGGGCGA GCACCTCGCG
CCCGGGAAGC ACGGGCTGGC GGATACCTCG TCCGAGACCG ACGCGCGCGA TGCCGATCTG
CGGGCGGCCT TGCAGCAGGC GTGCGCGGAG CTGGCCGAGG CCTTGCGGGC GGTGGACGAA
GTCGCCGAGG CCATGCAGGA GTGCGACACC TGGGGCCGCG AGCCCGGGGA TTTCGGACGC
CTGCCCATCG AAGAGTTTCA GCGCCTGAGC CAGGTCTTGC GCGAGACTCC CTCGGTGCGC
AAGATCGTCG AGCTGGCCGG GCGCTGGAGT GAGCTGTTGA AGCCGCGGCT CAAGCGCGGG
CACTCGCCGC GCGGTCGCAG CGAGCTGGTC GGCGTGACGC TGGGCGGCGG GCTCGAGCGG
CTGTGTGCCA CGGAGCTGAT CAAGCTGCGC CATCCGGCGC TCAGGCGCGT GCTTCTCGGT
CAGCTCGCCG AGCGGCGGGC GCTGGTGCAC GAGCTGCGCG GCCCCGATGT GCTGGGTCGC
GGGCCCATGA TCTTGGTGGT CGATACCTCC GGGTCCATGC ACGGCGCGCG CATGACCATG
GCCAAGTCGC TGATGCTGGC GCTGGCGCTG CATTGCTGGG AGCAGCGGCG GCCGCTGCGC
GTGCTCACCT TTGGCGCGCC GGGCGAGATG CACGAGAGCG AGGTGGCCGT GGACGAGCCC
TTCTGGACCC GCCTCGAGCA GTGTCTGAGC GTGGCCTTCG GCGGCGGCAC GGACTTCGAC
GGGCCGCTGC TGCGCGTGTG CGAGATCGTC GGCGAGCGGC CCTGGCGGCG GGCGGACGCC
GTGTTTTTGA CCGACGGCGA GTGCTGCGTC GCCGAGGCCA CGCGCGCGCA GCTCGCCCGC
ACCCGGGCAC GCGTCGCGCT CAACATCATC GGCGTGCTGG TCGGCCGTGG GCGCGGACTC
GACGGCGTGG CCGACATCGC GTACCGCGCT CGCGACGGCC GCGGCTGGCG CGGTGATGTC
GATCCCCGGA CCTGGGAGGT GCTCGAGCGC GATATCGTTG TTGACGTGTG A
 
Protein sequence
MGALTRITKG ALSWRASARL SEPKPGAQCV VLSRYDRWHF ERLGAHDQAL MQRLQRTPAL 
VPVAFDLYGF LYKPRPQRVP GAALTLAVKV LAKLAELGEL AAIRQRTVLS EWATVMHLNA
LLGPVEEFLQ RGGGQSGAGG HGDANAEGAG GAGGGDAEVL EAFERAHGLG QQPARASQPG
RSQPEQSDEH SDEQTGEHLA PGKHGLADTS SETDARDADL RAALQQACAE LAEALRAVDE
VAEAMQECDT WGREPGDFGR LPIEEFQRLS QVLRETPSVR KIVELAGRWS ELLKPRLKRG
HSPRGRSELV GVTLGGGLER LCATELIKLR HPALRRVLLG QLAERRALVH ELRGPDVLGR
GPMILVVDTS GSMHGARMTM AKSLMLALAL HCWEQRRPLR VLTFGAPGEM HESEVAVDEP
FWTRLEQCLS VAFGGGTDFD GPLLRVCEIV GERPWRRADA VFLTDGECCV AEATRAQLAR
TRARVALNII GVLVGRGRGL DGVADIAYRA RDGRGWRGDV DPRTWEVLER DIVVDV