Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3071 |
Symbol | |
ID | 8545459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4235489 |
End bp | 4238692 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646387742 |
Product | PKD domain containing protein |
Protein accession | YP_003267470 |
Protein GI | 262196261 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.192813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.491387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACCA CTCGATCAAT ACGATATCTG GCGTTGGCGC TGCTCGTCAG CGTGCCCGCG GGCTGTAGCA CGCCGCGCGA CAGCGGCGAC GACGGCGACA GCGACGGCGT CGAACTCGCC GACAAGCGCG GCACCGAGGC CAGCAAACGC GGCCAGGAGG TGCACGGCAC CATCATCGCC GACCTGCCGG CCATAGACGG CTCGCAGAAC GGACCGCTCG TACCCGTCCC CGACGTCCGC GTATGGGTCA AGAACGTGTC GAGCGGCGTG CAGAGCGACC CGGTCGAGAG CGATCTCGCC GGCCGCTACA TCATCCCGCG CCAGTACGAG GGCGATTACC TGCTGTGCTG GGACAAAGAC GGCTGGGAGG CCGCGTGCAC GCGCGAGCCC TTCCGCCTGA GCAAGACCAC CTACTTCCCC GGCCTCACCA AGATCGCGCC CGAGCGCGAC GACGGCAGCA GCGACACCGG CGTGATCCGC GGCCGCGTGC AGCTCGCCGA CGAGACCTCG TGCTGGTACG AGAGCGAGTT TTTCGCGCGC GAGGAGACCG CGCGCGTCGA GCTGCTCAAC ATTTTCGGCC AGGTGGTCCA GGACATCCGC GCCAACGACC ACGGCGACTT CGTGCTCACG CACGTGCCCT ACAGCTACTT CCAGGTGCGC AGCACCTGCG GCGAGGAGGT CCGGCGCGCC GGCCTCAGTT ACAGCGGCGT GGACATGTCC GGGGCCACGC CCATCGCTGC CCAGGAGCTG CCCAACCGAC GCCCGTCGGT GCACACCGCG GTGGCCTACG CCGACGGCGA GGGCGTGCGT CACGTGCGCG CCGGTGAGAT CGTCGAGGTC GTGGCCGAGG CCAGCGACCC CGACGGCGAC GATCTGCGCT TCGAGTGGAA GGTGCAGGAG GGCTCGGGCG AGCTGAGCAG CTCGAGCAGC GACGCGGTCG GCTGGCGGCT GCCCGAGGCG CCCGGCCTGC ACAGCCTGTA CGTGGTCGCC AGCGACGATC GCGGCGGCGT CCACCAGCAC AAGATCACGC TCGAGGTCGA GTCGCGCGAG GTGCTGTTCT CGGGCAAGGT CCTGGGCAGC GACGGCGCCG AGCTCGCGGG CGCCGTGGTC ACGGTCAACG GCAGGACGGC CGAGGCCGGG CAGGGCGGCG CGTTCGCGCT CTCGCTCGAG CGCAGCGACT CCTACACCCT GCACATCGAG GCCGAGGGCT ACGCCGAGCT GGGCAAGCGC GTGGACCTCG CGCTCAGCGG CAACCGCTGG GTGCTCACCC GCGCCCACCG CCAGACCATC GATCCCACCA TCGAGAACGT CATCATCGAC AAGCGCAAGG ACTGGCTCAA CCCGCGCGAC GACAAGGAGT ACCGGCGTCG CCCGGCGCGC GTGACCCTGC CCGCCGATGC CCTGGTCGAC GCCGAGGGTC GCCGCGCCGA TCCGGCCAAC GGCCCGTACT CGGCGTACAT CGCCACCATC GACCCCACCG GCGAGATGAT GGCCGGCGAC TTTGGCGCGC GCAATCTCGA CGGTGAGAGC CCCTACCTGG TGTCCTTTGG CGCCTTGTTC ATCGAGGTGC GCGACGCCGC CGGTCGCACC TACAACCTGG CCGAGGGCGA GCGCGCGCTG CTCGAGAGCC CCATCCAGGA TCCGCTGCTG CAGGAGGACG TGCCGTCCGA AATCGACATG TGGACCTTCG ACCCGGACAG CGGCGACTGG CTGCAGGACG AGATCAACGC GCGCCGCGAC GGCGACGTCT ACGTGACCGA GCTGGCCAGC TTCTCCACCC ACAACGCCGA CCTGCAGAAG ACCGATCCGG CGTGCGTGCG CGTCGTCGCC AGCCCGGCGC TGCTGGCGCT CGGCGACCTG GTGGCGCGCA TCGACGTGGC CACCGGCCCC AGCAGCACGC GCCGCTACGC GGTCAACATC GACGACCAGA ACAACGTGCT CTACAACCTG CCCGACAACG CTCCCTTCAC CCTCGAGCTG TTCCAGGGCG TGGCGCCGAA CGACGTGCTC ATCCACGCCG AGCCGGGCAA CACCGGCAGC CCGTGGTCGG CCACAGCCGG CGCGCCGCCG TATCCCTACT CGGCCTGCGA CGCCACCGTG ACCCTCGACG TGCCCGCGCT GCCGCCGGCG TTCCTGCAGT ACAGCAAGGG CACCGGCTCC GCGGCCCAGG CCGCCGGCTA CTATGCGGCC ATCGACCCGC TGGACGAGCG CACCACGCTC GGCGACTGGT GGGCCATCAA CGGCTTCGAC CCGAGCACGG GCGCCGGCGG CGTGCGCGCC TCCTTCGGCA ATGACAACGA CCTCGGCTTT GGCCGCGACA TGCACTGCCT GAGCAGCGGC GCCGACGTGG CCTGCTTTGT GACCAATTAC GGACAGCCGG ACCAAGACCC CGGCAACGCC GACCAGGCCC TGCTCGCCGA CACCACCCAG GCCGTGGCCA CCGTGACCAT GGAGTACTCG GCCGTGCCCG GGTATTCGTC CTCCGATCGC ATCGTCAAGT TCTACGTCTA CCAGGGCGGC GGACCCAGCG GCGTGCGTCT CGACAGCGCC GACCTCGACA AAACCGGGCC CAAGTTCGTG CCCAACCTGT GCCTGGTGTG CCACGGCGGC AACTACAACC CCGTGGATCC CGCCAACCCG AGCTTCAGCG AGATCAACGC CGGCGCCAGC TTCCGCGAGT TCGACACCCA CTCGTTCACC TATCCGGGCG CGTCGCCGCA AGCCGACCAG GAGGACGAAT TCTACGACCT CAACCAGCTC GTGCTGCTGA GCAACCCGGC GCCCGCGATC GTCGAATTGG TCGGCCAGTT CTACGCCAAT GGCCAGACCG TTGATGCCAA CGCTGTGCCC GCGGACTGGC AGGCGGCGCA GACCAGCGGC TCGAACCTGC CCGCGGGTCT GTACCTGGAC GTGGTCGGCA AATCCTGCCG CACCTGCCAC GTGGCCCAGC CCGACTACAA CCCGCTCGCG CTCAATAACA GCGCCTACCC CGACTGGAAC AGCTACAGCA TGTTCCGCGA TGTCCGCCAG TTCTCGCACT TCCTGGTGTG CGACGCCAAG ATCATGCCCA ACGCGCTGGT GACCTTCAAA AACTTCTGGC TCAGCCTGGG GCCGCATCGG CCCGAGCGCT TCGCCGACTT TGTCGACCCG GCGTCCGGCT GGCCGAGCTC GTTGAGCAAC GACATCGGCC CCTGCGCGCC CTGA
|
Protein sequence | MSTTRSIRYL ALALLVSVPA GCSTPRDSGD DGDSDGVELA DKRGTEASKR GQEVHGTIIA DLPAIDGSQN GPLVPVPDVR VWVKNVSSGV QSDPVESDLA GRYIIPRQYE GDYLLCWDKD GWEAACTREP FRLSKTTYFP GLTKIAPERD DGSSDTGVIR GRVQLADETS CWYESEFFAR EETARVELLN IFGQVVQDIR ANDHGDFVLT HVPYSYFQVR STCGEEVRRA GLSYSGVDMS GATPIAAQEL PNRRPSVHTA VAYADGEGVR HVRAGEIVEV VAEASDPDGD DLRFEWKVQE GSGELSSSSS DAVGWRLPEA PGLHSLYVVA SDDRGGVHQH KITLEVESRE VLFSGKVLGS DGAELAGAVV TVNGRTAEAG QGGAFALSLE RSDSYTLHIE AEGYAELGKR VDLALSGNRW VLTRAHRQTI DPTIENVIID KRKDWLNPRD DKEYRRRPAR VTLPADALVD AEGRRADPAN GPYSAYIATI DPTGEMMAGD FGARNLDGES PYLVSFGALF IEVRDAAGRT YNLAEGERAL LESPIQDPLL QEDVPSEIDM WTFDPDSGDW LQDEINARRD GDVYVTELAS FSTHNADLQK TDPACVRVVA SPALLALGDL VARIDVATGP SSTRRYAVNI DDQNNVLYNL PDNAPFTLEL FQGVAPNDVL IHAEPGNTGS PWSATAGAPP YPYSACDATV TLDVPALPPA FLQYSKGTGS AAQAAGYYAA IDPLDERTTL GDWWAINGFD PSTGAGGVRA SFGNDNDLGF GRDMHCLSSG ADVACFVTNY GQPDQDPGNA DQALLADTTQ AVATVTMEYS AVPGYSSSDR IVKFYVYQGG GPSGVRLDSA DLDKTGPKFV PNLCLVCHGG NYNPVDPANP SFSEINAGAS FREFDTHSFT YPGASPQADQ EDEFYDLNQL VLLSNPAPAI VELVGQFYAN GQTVDANAVP ADWQAAQTSG SNLPAGLYLD VVGKSCRTCH VAQPDYNPLA LNNSAYPDWN SYSMFRDVRQ FSHFLVCDAK IMPNALVTFK NFWLSLGPHR PERFADFVDP ASGWPSSLSN DIGPCAP
|
| |