Gene Hoch_5567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5567 
Symbol 
ID8547981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7643309 
End bp7645363 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content69% 
IMG OID646390240 
Productvon Willebrand factor type A 
Protein accessionYP_003269942 
Protein GI262198733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.634157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTC GTCAGCGAAC CAACCTGCGC ACGAAGTCAG TAGCGCGCCG CCCCGCAGCG 
AGGAGTTCGC TGCGTGTAGG CAACATCGCC CTCGCCGGCG TCTTCGCCGG TGCCGCCCTG
CTCGGCGCCT GCGACATGAG CGTCGGCCAG AGCACGAGCA GCTTGGTCAC CGTCACCGCG
GTGCCCAACC CGCCGCTGGA TCCGCGCTGC GGTCTCGACG CAGTGATCGT GCTCGACGCA
TCGTCCTCGG TGCGCAACTA CAACAACCCG CCCGACGCCA ACGGCGCCGT CGACCTGATC
GCCGGCGCCG GCAACGCGTT CCTGGGCGCG TTCGCCGACA CCAACAGCCG CGTGGCCGTG
GTGTCCTACA ACGCCGACCC GCGCCTGCAG CTCGACCTCA CCGCGGTGAC CACCGACTCG
CTCGCCGCCG GCGGCGCCCA CGGCATCGCG ATGGGCGATC CCGGTGGCCC CCAGGGCCCG
ATGTCGCCGA CCACGGGTTA CAGCGAGCAC GCGCGCAACG GCTCGGGCAC CAACTGGGAA
GCCGGCCTGG TGTACGCCCA GAACGTGCTC GAGAACAACG GGCGCGCCGA CGTGCCCAAG
CTGGTGATCC ACGTGACCGA CGGCCGGCCC ACGCGTCACC TGACCCCGGA CGGCACGGTC
ACCGATGAGG GCGGCATGGC CGTCCACGTG GCCGAAGCCG CCGAGGTCGC CGATCAGCTC
AAGGCCTCGG GCGTGCACAT CTTCGCCGTC GGCGTCGGCC GCGCGCCGCA GTTCTCGGAA
GAGCTGCAGG CTACCTCCGG CCCCGACGTA TTCGACCAGA CCCAGCCCGG CGACGCCTTC
GACGTGGTCA ACGACGACGT CATCCTGGCC GCCGACTTCG ACCAGCTCGA GGAGCTGCTG
CGCGGCGTCG CCGATCAGAT CTGCGGCGCT TCGCTGACCA TCACCAAGCT GTCCTCGACC
CCCGAGGCGC CCAACAGCTT CGCGCCGGCC GAAGGCTGGG CCTTCAGCGC CAGCGTCGAC
GCCGCCGCGG GCGGCTACAA CTGGACGCTG CCCGACGCCG CGCCGGCCAC CGAGAAGACC
GCGATCAGCG ACGCCGAGGG CAACGCCGAC TTCCAGTGGC AGATCTTCGA CGACGCCGCC
TGGGGCGCGG GTACGGTCAC CGTGACCGAA AGCCAGCAGA GCGGCTACGC CATGCAGAAC
CGGGCCCTAT GCGTGCGCAC ACGAGCTGGC GCCGTGGACT TCTTCTTCGT GAACGTGTCC
CTGCCAGCGG GTAGCTTCGA CGTCGAGCTG CAGGCCGGCG ACGACGTCAA CTGCGTGGTC
CGCAACCGCG CCGATCCCAA CGCGACCGTG CCCGCCGACA TCGAGGTGGT CAAGACCGCG
AGCACCAACC TGCTGCCCGA GGAGGGTGGC GACGTGACCT TCACCTTCAC GGTCACCGAG
AGCACGGGCA ACGGCTCGGT CGAGCTGCAC ACCCTCACCG ACTCCATCTA CGGCGACCTC
AACGGTCAGG GCGACTGCTC CGTGCCGCAG ACCCTGGCGC CGAGCGGCTC CTACACCTGC
TCGTTCACGA CCACGCTCAC CGGCTTCAAC GCCGGCTTCA CCGAGACCAA CGTGGTCACG
GCCGAGGGCA CTGACGAGAA CGGCGTCGAG GTCTCGGACA CCGACGACGA GACCGTGGTC
ATCGGCGACA ACCCGCCCGA CATGCACATG TGCAAAGTCA TCAACCCGGA GGAGGCGCCG
GCCTCGGGCG GCTACATTGA GCTGCTGGTG TACATCTTCA ACGACAGCAC CTGGAGCGAT
CGCATCACCA TCCAGTCGGT GACCGACGAG CGCTTCGGCG ACCTGCTCGA CCCCGCCAAC
CCGCTGGTCG CGGACGGCTA CTGCACCGCG GGCCTCGACC CGGGCGGCTC GTACATGTGC
GCCTACACCG TGTTCTTTGC CGGTGGCACC CCGGGCGACG TGTACTACGA CACCGCCACG
GTGGTGGCGA CCGACGACGA GGGCAACTAC GTGTCCTCGT CGTACGAGAC CGAGCTGCGC
GTCGTCGCCG ACTGA
 
Protein sequence
MPIRQRTNLR TKSVARRPAA RSSLRVGNIA LAGVFAGAAL LGACDMSVGQ STSSLVTVTA 
VPNPPLDPRC GLDAVIVLDA SSSVRNYNNP PDANGAVDLI AGAGNAFLGA FADTNSRVAV
VSYNADPRLQ LDLTAVTTDS LAAGGAHGIA MGDPGGPQGP MSPTTGYSEH ARNGSGTNWE
AGLVYAQNVL ENNGRADVPK LVIHVTDGRP TRHLTPDGTV TDEGGMAVHV AEAAEVADQL
KASGVHIFAV GVGRAPQFSE ELQATSGPDV FDQTQPGDAF DVVNDDVILA ADFDQLEELL
RGVADQICGA SLTITKLSST PEAPNSFAPA EGWAFSASVD AAAGGYNWTL PDAAPATEKT
AISDAEGNAD FQWQIFDDAA WGAGTVTVTE SQQSGYAMQN RALCVRTRAG AVDFFFVNVS
LPAGSFDVEL QAGDDVNCVV RNRADPNATV PADIEVVKTA STNLLPEEGG DVTFTFTVTE
STGNGSVELH TLTDSIYGDL NGQGDCSVPQ TLAPSGSYTC SFTTTLTGFN AGFTETNVVT
AEGTDENGVE VSDTDDETVV IGDNPPDMHM CKVINPEEAP ASGGYIELLV YIFNDSTWSD
RITIQSVTDE RFGDLLDPAN PLVADGYCTA GLDPGGSYMC AYTVFFAGGT PGDVYYDTAT
VVATDDEGNY VSSSYETELR VVAD