Gene Hoch_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3071 
Symbol 
ID8545459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4235489 
End bp4238692 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content69% 
IMG OID646387742 
ProductPKD domain containing protein 
Protein accessionYP_003267470 
Protein GI262196261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.491387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACCA CTCGATCAAT ACGATATCTG GCGTTGGCGC TGCTCGTCAG CGTGCCCGCG 
GGCTGTAGCA CGCCGCGCGA CAGCGGCGAC GACGGCGACA GCGACGGCGT CGAACTCGCC
GACAAGCGCG GCACCGAGGC CAGCAAACGC GGCCAGGAGG TGCACGGCAC CATCATCGCC
GACCTGCCGG CCATAGACGG CTCGCAGAAC GGACCGCTCG TACCCGTCCC CGACGTCCGC
GTATGGGTCA AGAACGTGTC GAGCGGCGTG CAGAGCGACC CGGTCGAGAG CGATCTCGCC
GGCCGCTACA TCATCCCGCG CCAGTACGAG GGCGATTACC TGCTGTGCTG GGACAAAGAC
GGCTGGGAGG CCGCGTGCAC GCGCGAGCCC TTCCGCCTGA GCAAGACCAC CTACTTCCCC
GGCCTCACCA AGATCGCGCC CGAGCGCGAC GACGGCAGCA GCGACACCGG CGTGATCCGC
GGCCGCGTGC AGCTCGCCGA CGAGACCTCG TGCTGGTACG AGAGCGAGTT TTTCGCGCGC
GAGGAGACCG CGCGCGTCGA GCTGCTCAAC ATTTTCGGCC AGGTGGTCCA GGACATCCGC
GCCAACGACC ACGGCGACTT CGTGCTCACG CACGTGCCCT ACAGCTACTT CCAGGTGCGC
AGCACCTGCG GCGAGGAGGT CCGGCGCGCC GGCCTCAGTT ACAGCGGCGT GGACATGTCC
GGGGCCACGC CCATCGCTGC CCAGGAGCTG CCCAACCGAC GCCCGTCGGT GCACACCGCG
GTGGCCTACG CCGACGGCGA GGGCGTGCGT CACGTGCGCG CCGGTGAGAT CGTCGAGGTC
GTGGCCGAGG CCAGCGACCC CGACGGCGAC GATCTGCGCT TCGAGTGGAA GGTGCAGGAG
GGCTCGGGCG AGCTGAGCAG CTCGAGCAGC GACGCGGTCG GCTGGCGGCT GCCCGAGGCG
CCCGGCCTGC ACAGCCTGTA CGTGGTCGCC AGCGACGATC GCGGCGGCGT CCACCAGCAC
AAGATCACGC TCGAGGTCGA GTCGCGCGAG GTGCTGTTCT CGGGCAAGGT CCTGGGCAGC
GACGGCGCCG AGCTCGCGGG CGCCGTGGTC ACGGTCAACG GCAGGACGGC CGAGGCCGGG
CAGGGCGGCG CGTTCGCGCT CTCGCTCGAG CGCAGCGACT CCTACACCCT GCACATCGAG
GCCGAGGGCT ACGCCGAGCT GGGCAAGCGC GTGGACCTCG CGCTCAGCGG CAACCGCTGG
GTGCTCACCC GCGCCCACCG CCAGACCATC GATCCCACCA TCGAGAACGT CATCATCGAC
AAGCGCAAGG ACTGGCTCAA CCCGCGCGAC GACAAGGAGT ACCGGCGTCG CCCGGCGCGC
GTGACCCTGC CCGCCGATGC CCTGGTCGAC GCCGAGGGTC GCCGCGCCGA TCCGGCCAAC
GGCCCGTACT CGGCGTACAT CGCCACCATC GACCCCACCG GCGAGATGAT GGCCGGCGAC
TTTGGCGCGC GCAATCTCGA CGGTGAGAGC CCCTACCTGG TGTCCTTTGG CGCCTTGTTC
ATCGAGGTGC GCGACGCCGC CGGTCGCACC TACAACCTGG CCGAGGGCGA GCGCGCGCTG
CTCGAGAGCC CCATCCAGGA TCCGCTGCTG CAGGAGGACG TGCCGTCCGA AATCGACATG
TGGACCTTCG ACCCGGACAG CGGCGACTGG CTGCAGGACG AGATCAACGC GCGCCGCGAC
GGCGACGTCT ACGTGACCGA GCTGGCCAGC TTCTCCACCC ACAACGCCGA CCTGCAGAAG
ACCGATCCGG CGTGCGTGCG CGTCGTCGCC AGCCCGGCGC TGCTGGCGCT CGGCGACCTG
GTGGCGCGCA TCGACGTGGC CACCGGCCCC AGCAGCACGC GCCGCTACGC GGTCAACATC
GACGACCAGA ACAACGTGCT CTACAACCTG CCCGACAACG CTCCCTTCAC CCTCGAGCTG
TTCCAGGGCG TGGCGCCGAA CGACGTGCTC ATCCACGCCG AGCCGGGCAA CACCGGCAGC
CCGTGGTCGG CCACAGCCGG CGCGCCGCCG TATCCCTACT CGGCCTGCGA CGCCACCGTG
ACCCTCGACG TGCCCGCGCT GCCGCCGGCG TTCCTGCAGT ACAGCAAGGG CACCGGCTCC
GCGGCCCAGG CCGCCGGCTA CTATGCGGCC ATCGACCCGC TGGACGAGCG CACCACGCTC
GGCGACTGGT GGGCCATCAA CGGCTTCGAC CCGAGCACGG GCGCCGGCGG CGTGCGCGCC
TCCTTCGGCA ATGACAACGA CCTCGGCTTT GGCCGCGACA TGCACTGCCT GAGCAGCGGC
GCCGACGTGG CCTGCTTTGT GACCAATTAC GGACAGCCGG ACCAAGACCC CGGCAACGCC
GACCAGGCCC TGCTCGCCGA CACCACCCAG GCCGTGGCCA CCGTGACCAT GGAGTACTCG
GCCGTGCCCG GGTATTCGTC CTCCGATCGC ATCGTCAAGT TCTACGTCTA CCAGGGCGGC
GGACCCAGCG GCGTGCGTCT CGACAGCGCC GACCTCGACA AAACCGGGCC CAAGTTCGTG
CCCAACCTGT GCCTGGTGTG CCACGGCGGC AACTACAACC CCGTGGATCC CGCCAACCCG
AGCTTCAGCG AGATCAACGC CGGCGCCAGC TTCCGCGAGT TCGACACCCA CTCGTTCACC
TATCCGGGCG CGTCGCCGCA AGCCGACCAG GAGGACGAAT TCTACGACCT CAACCAGCTC
GTGCTGCTGA GCAACCCGGC GCCCGCGATC GTCGAATTGG TCGGCCAGTT CTACGCCAAT
GGCCAGACCG TTGATGCCAA CGCTGTGCCC GCGGACTGGC AGGCGGCGCA GACCAGCGGC
TCGAACCTGC CCGCGGGTCT GTACCTGGAC GTGGTCGGCA AATCCTGCCG CACCTGCCAC
GTGGCCCAGC CCGACTACAA CCCGCTCGCG CTCAATAACA GCGCCTACCC CGACTGGAAC
AGCTACAGCA TGTTCCGCGA TGTCCGCCAG TTCTCGCACT TCCTGGTGTG CGACGCCAAG
ATCATGCCCA ACGCGCTGGT GACCTTCAAA AACTTCTGGC TCAGCCTGGG GCCGCATCGG
CCCGAGCGCT TCGCCGACTT TGTCGACCCG GCGTCCGGCT GGCCGAGCTC GTTGAGCAAC
GACATCGGCC CCTGCGCGCC CTGA
 
Protein sequence
MSTTRSIRYL ALALLVSVPA GCSTPRDSGD DGDSDGVELA DKRGTEASKR GQEVHGTIIA 
DLPAIDGSQN GPLVPVPDVR VWVKNVSSGV QSDPVESDLA GRYIIPRQYE GDYLLCWDKD
GWEAACTREP FRLSKTTYFP GLTKIAPERD DGSSDTGVIR GRVQLADETS CWYESEFFAR
EETARVELLN IFGQVVQDIR ANDHGDFVLT HVPYSYFQVR STCGEEVRRA GLSYSGVDMS
GATPIAAQEL PNRRPSVHTA VAYADGEGVR HVRAGEIVEV VAEASDPDGD DLRFEWKVQE
GSGELSSSSS DAVGWRLPEA PGLHSLYVVA SDDRGGVHQH KITLEVESRE VLFSGKVLGS
DGAELAGAVV TVNGRTAEAG QGGAFALSLE RSDSYTLHIE AEGYAELGKR VDLALSGNRW
VLTRAHRQTI DPTIENVIID KRKDWLNPRD DKEYRRRPAR VTLPADALVD AEGRRADPAN
GPYSAYIATI DPTGEMMAGD FGARNLDGES PYLVSFGALF IEVRDAAGRT YNLAEGERAL
LESPIQDPLL QEDVPSEIDM WTFDPDSGDW LQDEINARRD GDVYVTELAS FSTHNADLQK
TDPACVRVVA SPALLALGDL VARIDVATGP SSTRRYAVNI DDQNNVLYNL PDNAPFTLEL
FQGVAPNDVL IHAEPGNTGS PWSATAGAPP YPYSACDATV TLDVPALPPA FLQYSKGTGS
AAQAAGYYAA IDPLDERTTL GDWWAINGFD PSTGAGGVRA SFGNDNDLGF GRDMHCLSSG
ADVACFVTNY GQPDQDPGNA DQALLADTTQ AVATVTMEYS AVPGYSSSDR IVKFYVYQGG
GPSGVRLDSA DLDKTGPKFV PNLCLVCHGG NYNPVDPANP SFSEINAGAS FREFDTHSFT
YPGASPQADQ EDEFYDLNQL VLLSNPAPAI VELVGQFYAN GQTVDANAVP ADWQAAQTSG
SNLPAGLYLD VVGKSCRTCH VAQPDYNPLA LNNSAYPDWN SYSMFRDVRQ FSHFLVCDAK
IMPNALVTFK NFWLSLGPHR PERFADFVDP ASGWPSSLSN DIGPCAP