Gene Hoch_5121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5121 
Symbol 
ID8547532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7058512 
End bp7059771 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content73% 
IMG OID646389797 
Productvon Willebrand factor type A 
Protein accessionYP_003269502 
Protein GI262198293 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGG AGCTGCTCAC AGACGTCGAA GACGCCGGCG CCTCGGTCTT TCTGCTGGTC 
CGTATCGAGG CGCAGGCGAC CGAGAGTTCA GCGCGTATGC CGGTCAATCT GGCCCTGGTC
ATCGACCGCT CGTCGTCGAT GCGCGGGCCG CGGCTGGCCA GCGCCATCGT GGCCGCGCGC
CAGGTCGTCG AGCAGCTCGA CGAGCGCGAC CGGCTCTCGG TCATCGCCTT CGACGCCACG
GCGCGAACCA TCTTCGGTCC CATGAGCGTG ACCGACGAGG CCCGCCAAAC CCTCGAACAG
GCCCTGGCCG GCCTGCGCAC CGGCGTCGGC ACCAACCTCG CCGCGGGCAT GAAAAAAGGC
GCCGAGGCGG TGCGCTCGGG CTTTGTGCGC GGCGCCCTCT CCCGCCTGGT GCTGCTCACC
GACGGCCAGC CCTCGCTGGG CATCACCGAC AACGACCGGC TGTGCGCGCT GGCGCAGAAA
GAGGCCGATC GCGGGGTCAC CATCACGACC ATGGGCCTGG GCCAGGGCTT CGACGACGAG
CTGCTCGCCG ACCTCGCCCA CAGCGGCCGC GGCGGCTTTC ACTATCTGGC CAGCGCGGCC
GACATCCCGG GCGCCTTCGG CCGCGAGCTG AGCGGCGTGT TCGCCATCGC CGCCACCCAG
ACCGAGATCG GCCTGCGCCC GGCGCAGCAG ATCGACGCCG CCGAGGTGCT GCACCGCCTG
CCCTCGCGGC CGCTCGACGA CGGACTGGCG GTCGAACTCG GCGAGCTGGC CGCGGGCACG
CCGCGCCAGG TGCTGTTCCG CCTCAGCCGT CGCAGCGGCG ACATCGAAGC CCGCTGCGGC
ACCCTCACCG TCACCTACCG CAGCTCCGAG GGCACCCCGG GCGATGCCCA CCTGCTCGGC
ATCGAGGTCC CGGCCCAGCC CGACCCGGCC CACCGGCGCA TCATCGCGCT CGAGCGCATG
CGCCTGGCCG TGGCCAGCGC CGTGGACGTG GCCTGGGCGC GCCGGGCCAG CGGCGACAGC
CTGCGCGCGC TGGGCGCCCT GAGCGAGATC AAGCTCGAGG TGTCGCAGCT CAAAGAGTCC
GAGGGGGCCG ATCCCGACGC CCTCGACGTG CTCTTGCGCG ACATCGGCGA AGCCGAGTCA
GCCGTGGTCA AGAGTTCGGC CGAACGCGAG CGCGCCCGCC GCAGCATGCG CGAGCGCAGC
CATATCACCC TGCTCGGCCA ATCCCAGACC CAGGCGGCGC CGCCCCGCGA TGACGACTGA
 
Protein sequence
MRVELLTDVE DAGASVFLLV RIEAQATESS ARMPVNLALV IDRSSSMRGP RLASAIVAAR 
QVVEQLDERD RLSVIAFDAT ARTIFGPMSV TDEARQTLEQ ALAGLRTGVG TNLAAGMKKG
AEAVRSGFVR GALSRLVLLT DGQPSLGITD NDRLCALAQK EADRGVTITT MGLGQGFDDE
LLADLAHSGR GGFHYLASAA DIPGAFGREL SGVFAIAATQ TEIGLRPAQQ IDAAEVLHRL
PSRPLDDGLA VELGELAAGT PRQVLFRLSR RSGDIEARCG TLTVTYRSSE GTPGDAHLLG
IEVPAQPDPA HRRIIALERM RLAVASAVDV AWARRASGDS LRALGALSEI KLEVSQLKES
EGADPDALDV LLRDIGEAES AVVKSSAERE RARRSMRERS HITLLGQSQT QAAPPRDDD