Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_6627 |
Symbol | |
ID | 8549044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 9083893 |
End bp | 9087198 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646391287 |
Product | NLP/P60 protein |
Protein accession | YP_003270986 |
Protein GI | 262199777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.813247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000190725 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACTCTG ATGAGCGCGG CGGCGTCGGC TTCAACCGCA GCCGCAGCCG CAGCCGCAGC CGTAGCCGTA GCCGTGGTCG CGGTATCGGC GTGCTCGTCG CCCTCGGGGC GGCGTCGCTG GTGGCCCTGC TCGGCTGCGG CGGCAAAGCC GAGCAGACGC CGCGGTCCGA AGCGCCGGTC GCCAGCGCCG AGGACGAGCA GGCGATGGAA TCGCCCACGT GTCCGGCCAA CGCGTCGGCG CCGGCGTTGC CGCCCGGTAC CGAGCCCGCC CAGGTGCAGC TCGACTACTG GCTCGAGCGC GTCGGCGCGG CTCACGATCT CGATCAGGTG CTGCTGTCGC CGGCCGAGAT CGCGCGCCTC AACCAGGCCC AGCGCGTGCC CCGCGAGCAC TTCCACGCCC AGCGCGATCT GCTCGAGCCG CTCGCCGAGG ACGAGATCGC GCGCGACATC GACGAGCGCG TGCTCTGGTA CCGCGAGCGC TTCGCCAGCG GCCGTTACGA GAGCGCGGCC GGCGGCGCGC TGCCGGACGA GCTGCAGGCG GATGTGAAAC CGTCCATCCG TCCGAGCTTG CGCGTGGCCC TGGGCCAGGT GCCGTTTCGC TGCGCGCCCG TGGATACGGC CTTCCTGGCC CCGGGCGGCA ATCCCAACAT CGATCACAAC CGCTGCAGCA CCGCGCACGC CCAGGAGCCG GTGGAGGTGC TGGCCGACTG GCCAGGGCCG ATGCAGCTCG CGCGCACGCG CTACACCTGG GGCTTCATCG CCGACGACGC GCCGCTGTCA CCGCCGCTGC CGGCGGCCCA GGCGCAGCGC TTCGTGCGCG GTCCCTCGGT GACCCTGGCG TCCGATGCCC AGCTCGACGA CGCCCTGCCG CTCGGCCGGG TGCTGCCGCG CGGTCGCGGC GATACCGTGC TCGTGGCCAC CGCGGACGGC GTGCGCGAGC TCGCCGTGCC CGCGCAGGCG CTGCGTTCGA CGCCGCGTCC GCTCACCCGC CGCGCCTTCC TCGAGGAGGC GTTTCGCTAC CTCGACACGC CGTACGGTTA TGGCGGCACC GGCGGCGGCC GCGACTGCTC GCGCCTGATG CTCGACGTGT TCGAGAGCTT TGGCATCGCG CTGCCGCGGC ACAGCGCCTG GCAGGCGCGC GCCGGTTCCT ACAGCATCGA CGTAGCCAGC GCCAGCGAGA CCGAGCGGCT ATTCCTCATC GACGCCGCGG CCGAGCGCGG CATCGTGCTC CTGCATCTGC CCGGCCACAT CATGCTGTAC CTCGGCCGCG ACCAGGGCGA GCGCCCCATG GCCATGCACG CGCTGGCCGA ATACAAGGCG CGCTGTCCGG CCGACGAGGG CGAGACCCTG TTCTACGCCG ATCGCGTGCT GGTGAGCGAT CTCGAGCTCG GCCGCGACAC CGAGAAGACC GCGTTCATCG AGCGTATCGA CCGCATCACG GTGTTTGGCG AGGCGCCCGG CCCGGAGCTG GCCGGCTCGG CCGAGCTGCG ACCGGCCGCG CCCATGAGCG CGCCCGCGAA GCGCGCGTGT CGGCGCAGCG GCGGCGCCGA ATTGTTCGTG ACCCCGGCGC AGCCCGACAG CGGCCGCCCG CTGCGCGTGG TCGCCACCGC GTCCGAAAAC CCCGGCCCGG CCGCCATCAC CCTGATCGAC CCCAGCGGCA CCGCGCACAC GCCCGCGATG GTGCAGCTCG GCGGTCCGCC CTACGGCTAC GTGGCCGCGA TCGAGGCGCC GGCGGCCGGC ACCTGGACCG CCATGTTCGG CGACGGCGAC GAGCTGCGCG CGTGTGAGCG CATCCGCGTG CGCAGCCGGC AGCGCGCGGC CCCGGGTGAC GACGACGCCG CGGGCGTGTG GGCGATTCGC CGCGCTTGGG ACCAGAACGC CGAGGATCTC TACTCGGTGT TCGTCGAGCG CTTGTTCGAC TATCCGCTCG ACGAAGATCT CACCTGGGAC GGCTTCCATC ACCTCGTGCG CGATCGCGAT CGCAACATCC TCTACGATCA TCTCGGCCAC GGCGAGGATG CCGATCTGGT GCTGGTGCCC GACTGCGCCG ATCTGCCCTA CACCCTGCGC GCCTACTTCG CCTGGAAGCT GGGCTTGCCC TTTGGCTTCC ACGATTGCAA TCGCGCCCGT CCGGGCCGGC CGCCGCGCTG CGAGCCGCAC GGCGAGAACC TGATGTCGCG CGCCGAGCTC AGCACCCGCT CGCTGAGCGA ACGCAACAGC TCGGACCTGC ACCCCGACGT GGTCGCCTTC GCCCGCTTCC TCGACCGCGA GCTGCGCCGC GAGGTGCACT CGTCGAGCGG GCGCACGCAT CCCGACGACG ACGAGACCGA CTTCTACCCG GTGCCGCTCA CGCGCTCGGC GCTGCGCCCG GGCACGCTGT TCACCGATCC CTACGGCCAC CTGCTGGTCA TCGCCGATTG GGTGCCGCAG GGCGCGAGCA GCTACGGCGT GCTCATCGGC GCCGACGCGC AGCCCGACGG CACCGTCGGC CGACGCCGCT TCTGGCGCGG CTCCTTCCTC TTCGACCCCG ACACCGGCAG CGGCGGAGCC GGCTTCAAGG CCGTGCGCCC CTGGTCTCGC GGCGACGACG GCGAGCGCCT GGTCACCAGC GACAACCGCT CGCTGCGCCG ACGCAGCCCG ACGCCGTTCA GCAAGCAGCA GTACGAGGGC TCGATCGACG ACTTCTACGA CGCCATGGCC GCCCTGATCA ATCCGCGGCC GCTCGATCCC GCGGCCATGC AGGGATCGCT GGTCGACGCG CTCGAGGAGA CCGTGTCGCG GCGGGTCACC TCGGTCGAAA ACGGCGAGGC CTTCATGCGC GCGCGCGGCT TCTCGACCAT CGACATGCCC GATGGCTCGC GCATCTTCCT CACCACCGGG CCGTGGGAGG ACTACGCCAC GCCCTCGCGC GACCTGCGCT TGCTGATCTC GATCGACACC GTGGTCGAGT TCCCCGACGC CGTGGCGCGC GCGCCCGAGC GCTACGGCAT CCGCGGCAGC GAAGCCGAAA TCGCCGAGCA GATCGCGGCG CTGCGCGAGG CGCTGGCGGC GGCGCTGGCG GCGCGGCGGT TTTCGTACAC GCGTTCCGAC GGCAGCGCCT TCGAGCTGAG CCTGGGCGAC GTGGTCGAGC GCGCCAAGCG CCTGGAGATG GCCTACAACC CCAACGACTG CATCGAGACC CGCTGGGGGG CGCCCGCCGG CAGCGAGGAA GCCAAGACCT GCAAGCGCCA GGCGCCAGCC GAGCAGCGCG CGCGCATGAC GCGCTACCGC GATTGGTTCT CGAGTCGCAA GCGGCCGGCC ACCTGA
|
Protein sequence | MHSDERGGVG FNRSRSRSRS RSRSRGRGIG VLVALGAASL VALLGCGGKA EQTPRSEAPV ASAEDEQAME SPTCPANASA PALPPGTEPA QVQLDYWLER VGAAHDLDQV LLSPAEIARL NQAQRVPREH FHAQRDLLEP LAEDEIARDI DERVLWYRER FASGRYESAA GGALPDELQA DVKPSIRPSL RVALGQVPFR CAPVDTAFLA PGGNPNIDHN RCSTAHAQEP VEVLADWPGP MQLARTRYTW GFIADDAPLS PPLPAAQAQR FVRGPSVTLA SDAQLDDALP LGRVLPRGRG DTVLVATADG VRELAVPAQA LRSTPRPLTR RAFLEEAFRY LDTPYGYGGT GGGRDCSRLM LDVFESFGIA LPRHSAWQAR AGSYSIDVAS ASETERLFLI DAAAERGIVL LHLPGHIMLY LGRDQGERPM AMHALAEYKA RCPADEGETL FYADRVLVSD LELGRDTEKT AFIERIDRIT VFGEAPGPEL AGSAELRPAA PMSAPAKRAC RRSGGAELFV TPAQPDSGRP LRVVATASEN PGPAAITLID PSGTAHTPAM VQLGGPPYGY VAAIEAPAAG TWTAMFGDGD ELRACERIRV RSRQRAAPGD DDAAGVWAIR RAWDQNAEDL YSVFVERLFD YPLDEDLTWD GFHHLVRDRD RNILYDHLGH GEDADLVLVP DCADLPYTLR AYFAWKLGLP FGFHDCNRAR PGRPPRCEPH GENLMSRAEL STRSLSERNS SDLHPDVVAF ARFLDRELRR EVHSSSGRTH PDDDETDFYP VPLTRSALRP GTLFTDPYGH LLVIADWVPQ GASSYGVLIG ADAQPDGTVG RRRFWRGSFL FDPDTGSGGA GFKAVRPWSR GDDGERLVTS DNRSLRRRSP TPFSKQQYEG SIDDFYDAMA ALINPRPLDP AAMQGSLVDA LEETVSRRVT SVENGEAFMR ARGFSTIDMP DGSRIFLTTG PWEDYATPSR DLRLLISIDT VVEFPDAVAR APERYGIRGS EAEIAEQIAA LREALAAALA ARRFSYTRSD GSAFELSLGD VVERAKRLEM AYNPNDCIET RWGAPAGSEE AKTCKRQAPA EQRARMTRYR DWFSSRKRPA T
|
| |