Gene Hoch_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0231 
Symbol 
ID8542610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp344278 
End bp345807 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content72% 
IMG OID646385027 
Productvon Willebrand factor type A 
Protein accessionYP_003264765 
Protein GI262193556 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGA GCGAGCTGCA CGTCATCGTC TGCGACGGCT ACGACCGCGA GGTCTTCGCC 
CGTCTGCTGC GCGAGAAGCG CTCCATGGGC GAGTGCCGCG AGCGCCTGGG CCGACTGCTG
CCGCACCCCG AGCCGCTGCT GTGCGACCTG TTCAGCGTGC TGTTCAAGCT CAACGTTGTC
GTACGAGCGG CCGAAGAACT CGCCGCCGCG GTGCAGATCC ATCACCGCCT GGTCACGGCC
GTGAGCCAGG CCCGCGACCT GGCCGCCCTG CGCGCGCGTA CCGAGCTGCG CGAGAACGAG
TGCGCGGCGC TGCTGCCCGG TCTGGTCGAG CGTATCCTCA CGGCCATGAA GCGCGACTTC
TACATCGGCC CGCAGGAGCT TCTCGAGGCC GCCGAGGTGG CCCACGACGA GGACACCCTG
GCGCAGCGCG AGGCCGAGCG CGAGCATCTA CGCGAGCTGC CCGAGGACGC CTTTGACGAC
GACGAGCGCG AGCGCCTCGA GGGCGATCTC GACGGCGAGA TCGACGCCCT GCGCGAGCGC
ATCGACGAGG CCCGCGCCCG CCAGGCGCGC GTCGCCGACA AGATCACGAG CGACCTCGAC
GACACCATCG GCCGCAAGGT CTCGGTGCTG CCCGATCAGC TCGAGCAGGG CGAGGATCTG
CGCCGCAGCA TGGGCCTGGG CAGCGGCCGC GAGGGCCAGG TGGGCGCGGC CGAGCGGCTC
GAGCTGGGCG AGCGCCTGAT GCGCAGCCGC AAGCTCAAGC TGCTGGCCAA GCTGGTGGGC
GCGTTCCGCG AGGTCGCGTT CGAGGCCCGG CGCCGGCGCG TCGTCCGAAC TCCCCAGGTG
ATGCACGAGG TCGGCCGCGG CGCGCATCTC GACCGCCTGC TGCCCTCGGA GCTGCTCGGC
CTGCCGCGCC ACCGCGGCGC CCTGCACCGC GAGTTCGTGC GCCGCCTGGT CGAGGGCGAG
CTGCTCGAGT ACGAGCTGCG CGGGGCCTCG TCGCGCGGGC CGATGGTGGT GTGCGTCGAC
GGCAGCGGCT CGATGCAGGG CACCAAGGAG ATCTGGGCCA AGGCCGTGGC GCTCACGCTC
ACCGAGATCG CCCGGCGCGA GCGCCGCCGC TGCCTGGCCA TCGTGTTCTC GTCGGGGCAC
GCGCTGTTCG AGGTCGAGCT GCTCGGCGCC AAGGGCCGCT CGAACGTGCG CGCGCCCATG
CTCGACGACA ACGTGCTGGC CTTTGCCGAG CACTTCCCCG GCGGCGGTAC CGACTTCGAG
CCGCCCATGC GGCGCGCGCT CGCGGCCGTG AGCGAGGGCA ACTACCGGCG CGGCGATATC
GTGTTCATCA CCGACGGCCA GGCCCAGGTG TCCGAGAACC TGATCGCCGA CATCACCAAG
GCGCGCAAGA AGCACCGCTT TCGCGTGCGC GGCATCTTGG TGGACGTCGC CGACAGCGAC
CGCGGCAGCC TGCTGCGCTT CTGCGACGAG GTCCGCGAGG TCACCGACCT GGTCGCCGAT
TCGCTCGGCG ATCTCTTCGC CAGCGTGTGA
 
Protein sequence
MPPSELHVIV CDGYDREVFA RLLREKRSMG ECRERLGRLL PHPEPLLCDL FSVLFKLNVV 
VRAAEELAAA VQIHHRLVTA VSQARDLAAL RARTELRENE CAALLPGLVE RILTAMKRDF
YIGPQELLEA AEVAHDEDTL AQREAEREHL RELPEDAFDD DERERLEGDL DGEIDALRER
IDEARARQAR VADKITSDLD DTIGRKVSVL PDQLEQGEDL RRSMGLGSGR EGQVGAAERL
ELGERLMRSR KLKLLAKLVG AFREVAFEAR RRRVVRTPQV MHEVGRGAHL DRLLPSELLG
LPRHRGALHR EFVRRLVEGE LLEYELRGAS SRGPMVVCVD GSGSMQGTKE IWAKAVALTL
TEIARRERRR CLAIVFSSGH ALFEVELLGA KGRSNVRAPM LDDNVLAFAE HFPGGGTDFE
PPMRRALAAV SEGNYRRGDI VFITDGQAQV SENLIADITK ARKKHRFRVR GILVDVADSD
RGSLLRFCDE VREVTDLVAD SLGDLFASV