Gene Hoch_3261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3261 
Symbol 
ID8545649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4495915 
End bp4498056 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content72% 
IMG OID646387928 
Productvon Willebrand factor type A 
Protein accessionYP_003267656 
Protein GI262196447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00897171 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCGCCA GCTTCCCCAT CGCGCTCTCG AGCGTGCTCG CGCTCGCGGC CGCGCTCGGC 
GGCCTGGCCG TCGTCGCCTA CATCCTCAAG ATGCGCCGGC GCCGCTTCGA GGTGCCGTTC
TCGGCGCTGT GGCACCGGGT GCTGGCCGAG AAGGACAGCC AGAGCCTGTT CAAGAAGCTC
AAGCGGCTGC TGTCGCTGCT GCTCCAGCTC GTCATCCTGG CGCTGCTGCT CTTCGCCGTC
ATCGACCCGC GCCTGGGCGA GGACGAGCGC GAGGCGCGCA GCGTGGTCAT CATCCTCGAC
GCCTCGGCGT CGATGAAGGC CACGGACGAG CGCGACGGCG GGCCCTCGGT GTTCGCCGAG
CCCGCCGGCG ATGGCGACGC CAGTGACGAC GATAGTGCGG GCGGCGACGG CAACAGCGGC
GGCGCGGATT CCGTCCAGGC GGACGACGAC GGCCCGCCGC GCACGCGCAT GGCCATCGCC
AAGGCGCGGG CGCGCGAGCT GCTCGACGCC ATGGGCGGCG GTGACGCGGC CATGATCATC
CGCATGGACG GGCAGACCAC GCCGCTGAGC CGCTTCGACA GCGATATGGC GCTGCTCAAG
CGCACGGTCT CGGGCATCGA GGCCAGCGAC ACCCCGGCGG ATCTGTCGCG GGCGCTGAGC
GCGGCCGCCG ACGCCCTGCG CGGGCGTGCC CAGCCGATGA TCGTGCTCAT CGGCGACGGC
GCCTACCCCG ACGAGGTGCG CGAGCGCGTC ATCTGGGAGC CGCTGCCCGA GGGCGCGGAC
TCCGAGGCGC GGCTCGACGC CATCGATCTC TCGGGCATCG ACGTGCGCTT CGTCCCGGTC
GGCCGCCGCG GCGACAATGT CGGCATCGTG GCCTTCAACG CCCGCCGCTA CCTCACCGAC
AAGACCAGCT ACGCGGTCTT CGTCGAGGTG CAGAACTTCG GCCAGGAGCC GGCCGCGCGC
AAGCTCGTGG TCTACAGCGG CAGCGATCCC ATCGATGTGC AAACCGTCGA GCTGGCCGCG
GGCGAGCGCC TGCGCAAGCT GTACCCCAAC CTCGGCGGCG GCCAGGGCAA CCGCCTGCGC
GCGGTGCTGC AGCCGGTCGA GGCCGGCGAG AATGGCGCCG ACATCTTCCC GCTCGACGAC
GAGGCCTTTG CGCTGCTGCC GGCGCGCAAG CGGCAAGAGG TCCTGCTGGT CACCGAGGAC
AACCTGTATC TCGAGGGCGC CATGCTGGTC TACGACAGCA TCCAGGTCGA CAAGCTGGTG
CCCGCGGAGT ACGAGCAGGC GCTGGCCGAG GAGCGCTTGC CCGCGTACGA CGCCGTGGTC
TTCGACGACT TCGCGCCGAG CGAGCTGCCG CCCGCGCCCA CCAACCTGAT GTATTTCGGA
CCGCGGGGCG AAGACAGCCC CTTCCCCATC CGCCGCACGG TGAGCGGGCC GCGCATCACC
GAGGTCAACG ACAGCCACCC GGTCATGCGC TGGGTGGTGC TGGCCGACGT CAACTTCGAC
GAGTCGGCGG TGTTCGCCGT GGACGCGGCC GCGGGCGAAT CGATGCTGGC CGCCTACGTG
CGCGATCCCC TCATCGCGGC CAGACGCGAG GGCGCGCGCA AGATCGTCGC CTTTGGCTTC
TCGCTCACCG GCACCGACCT CACCCTGCGC GTGGCCTTTC CGCTCATCCT GGTCAACGCC
CTGGACTGGT TCGCGGGCGA CGACGCCGAC CTCATCACCA CCTACCGCAC CGGGCAGCGC
TTCCGGGTGC CGCTCGACGG CGTCTTCGAC GTACCCGAGG TCGAGGTCGT GCTGCCCGAG
GGCCGGCGCA CGCGCGCGCC CGTGAGCGAG GGCCACGCCT CGTTCTACGG CCACCGCATC
GGCGTCCATC AGCTCACCGC GCGCGCCGCT CCCGGCCCCG ACGGCAGCGA CGGCGCCGCG
GGCCCGGTGA TCGCCCAGCT CGAGCTGGCC GCCAACCTGG CCAACCCCGC CGAGTCGCAG
GTCGCGCCCG CGCCCGCGCT GTCGCTGGGC GGCCGCGCGC TGCCCGCGCC CGAGGGTTTT
CGCGTGTCGG TGCGGCGCTC GCTGTGGCTG TATCTGGCCC TGGCGGCGCT GGCCCTGCTC
ATGCTCGAGT GGATCACCTA CCACCGCCGG ATCACGGTCT GA
 
Protein sequence
MSASFPIALS SVLALAAALG GLAVVAYILK MRRRRFEVPF SALWHRVLAE KDSQSLFKKL 
KRLLSLLLQL VILALLLFAV IDPRLGEDER EARSVVIILD ASASMKATDE RDGGPSVFAE
PAGDGDASDD DSAGGDGNSG GADSVQADDD GPPRTRMAIA KARARELLDA MGGGDAAMII
RMDGQTTPLS RFDSDMALLK RTVSGIEASD TPADLSRALS AAADALRGRA QPMIVLIGDG
AYPDEVRERV IWEPLPEGAD SEARLDAIDL SGIDVRFVPV GRRGDNVGIV AFNARRYLTD
KTSYAVFVEV QNFGQEPAAR KLVVYSGSDP IDVQTVELAA GERLRKLYPN LGGGQGNRLR
AVLQPVEAGE NGADIFPLDD EAFALLPARK RQEVLLVTED NLYLEGAMLV YDSIQVDKLV
PAEYEQALAE ERLPAYDAVV FDDFAPSELP PAPTNLMYFG PRGEDSPFPI RRTVSGPRIT
EVNDSHPVMR WVVLADVNFD ESAVFAVDAA AGESMLAAYV RDPLIAARRE GARKIVAFGF
SLTGTDLTLR VAFPLILVNA LDWFAGDDAD LITTYRTGQR FRVPLDGVFD VPEVEVVLPE
GRRTRAPVSE GHASFYGHRI GVHQLTARAA PGPDGSDGAA GPVIAQLELA ANLANPAESQ
VAPAPALSLG GRALPAPEGF RVSVRRSLWL YLALAALALL MLEWITYHRR ITV