Gene Hoch_6627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6627 
Symbol 
ID8549044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9083893 
End bp9087198 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content72% 
IMG OID646391287 
ProductNLP/P60 protein 
Protein accessionYP_003270986 
Protein GI262199777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.813247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000190725 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACTCTG ATGAGCGCGG CGGCGTCGGC TTCAACCGCA GCCGCAGCCG CAGCCGCAGC 
CGTAGCCGTA GCCGTGGTCG CGGTATCGGC GTGCTCGTCG CCCTCGGGGC GGCGTCGCTG
GTGGCCCTGC TCGGCTGCGG CGGCAAAGCC GAGCAGACGC CGCGGTCCGA AGCGCCGGTC
GCCAGCGCCG AGGACGAGCA GGCGATGGAA TCGCCCACGT GTCCGGCCAA CGCGTCGGCG
CCGGCGTTGC CGCCCGGTAC CGAGCCCGCC CAGGTGCAGC TCGACTACTG GCTCGAGCGC
GTCGGCGCGG CTCACGATCT CGATCAGGTG CTGCTGTCGC CGGCCGAGAT CGCGCGCCTC
AACCAGGCCC AGCGCGTGCC CCGCGAGCAC TTCCACGCCC AGCGCGATCT GCTCGAGCCG
CTCGCCGAGG ACGAGATCGC GCGCGACATC GACGAGCGCG TGCTCTGGTA CCGCGAGCGC
TTCGCCAGCG GCCGTTACGA GAGCGCGGCC GGCGGCGCGC TGCCGGACGA GCTGCAGGCG
GATGTGAAAC CGTCCATCCG TCCGAGCTTG CGCGTGGCCC TGGGCCAGGT GCCGTTTCGC
TGCGCGCCCG TGGATACGGC CTTCCTGGCC CCGGGCGGCA ATCCCAACAT CGATCACAAC
CGCTGCAGCA CCGCGCACGC CCAGGAGCCG GTGGAGGTGC TGGCCGACTG GCCAGGGCCG
ATGCAGCTCG CGCGCACGCG CTACACCTGG GGCTTCATCG CCGACGACGC GCCGCTGTCA
CCGCCGCTGC CGGCGGCCCA GGCGCAGCGC TTCGTGCGCG GTCCCTCGGT GACCCTGGCG
TCCGATGCCC AGCTCGACGA CGCCCTGCCG CTCGGCCGGG TGCTGCCGCG CGGTCGCGGC
GATACCGTGC TCGTGGCCAC CGCGGACGGC GTGCGCGAGC TCGCCGTGCC CGCGCAGGCG
CTGCGTTCGA CGCCGCGTCC GCTCACCCGC CGCGCCTTCC TCGAGGAGGC GTTTCGCTAC
CTCGACACGC CGTACGGTTA TGGCGGCACC GGCGGCGGCC GCGACTGCTC GCGCCTGATG
CTCGACGTGT TCGAGAGCTT TGGCATCGCG CTGCCGCGGC ACAGCGCCTG GCAGGCGCGC
GCCGGTTCCT ACAGCATCGA CGTAGCCAGC GCCAGCGAGA CCGAGCGGCT ATTCCTCATC
GACGCCGCGG CCGAGCGCGG CATCGTGCTC CTGCATCTGC CCGGCCACAT CATGCTGTAC
CTCGGCCGCG ACCAGGGCGA GCGCCCCATG GCCATGCACG CGCTGGCCGA ATACAAGGCG
CGCTGTCCGG CCGACGAGGG CGAGACCCTG TTCTACGCCG ATCGCGTGCT GGTGAGCGAT
CTCGAGCTCG GCCGCGACAC CGAGAAGACC GCGTTCATCG AGCGTATCGA CCGCATCACG
GTGTTTGGCG AGGCGCCCGG CCCGGAGCTG GCCGGCTCGG CCGAGCTGCG ACCGGCCGCG
CCCATGAGCG CGCCCGCGAA GCGCGCGTGT CGGCGCAGCG GCGGCGCCGA ATTGTTCGTG
ACCCCGGCGC AGCCCGACAG CGGCCGCCCG CTGCGCGTGG TCGCCACCGC GTCCGAAAAC
CCCGGCCCGG CCGCCATCAC CCTGATCGAC CCCAGCGGCA CCGCGCACAC GCCCGCGATG
GTGCAGCTCG GCGGTCCGCC CTACGGCTAC GTGGCCGCGA TCGAGGCGCC GGCGGCCGGC
ACCTGGACCG CCATGTTCGG CGACGGCGAC GAGCTGCGCG CGTGTGAGCG CATCCGCGTG
CGCAGCCGGC AGCGCGCGGC CCCGGGTGAC GACGACGCCG CGGGCGTGTG GGCGATTCGC
CGCGCTTGGG ACCAGAACGC CGAGGATCTC TACTCGGTGT TCGTCGAGCG CTTGTTCGAC
TATCCGCTCG ACGAAGATCT CACCTGGGAC GGCTTCCATC ACCTCGTGCG CGATCGCGAT
CGCAACATCC TCTACGATCA TCTCGGCCAC GGCGAGGATG CCGATCTGGT GCTGGTGCCC
GACTGCGCCG ATCTGCCCTA CACCCTGCGC GCCTACTTCG CCTGGAAGCT GGGCTTGCCC
TTTGGCTTCC ACGATTGCAA TCGCGCCCGT CCGGGCCGGC CGCCGCGCTG CGAGCCGCAC
GGCGAGAACC TGATGTCGCG CGCCGAGCTC AGCACCCGCT CGCTGAGCGA ACGCAACAGC
TCGGACCTGC ACCCCGACGT GGTCGCCTTC GCCCGCTTCC TCGACCGCGA GCTGCGCCGC
GAGGTGCACT CGTCGAGCGG GCGCACGCAT CCCGACGACG ACGAGACCGA CTTCTACCCG
GTGCCGCTCA CGCGCTCGGC GCTGCGCCCG GGCACGCTGT TCACCGATCC CTACGGCCAC
CTGCTGGTCA TCGCCGATTG GGTGCCGCAG GGCGCGAGCA GCTACGGCGT GCTCATCGGC
GCCGACGCGC AGCCCGACGG CACCGTCGGC CGACGCCGCT TCTGGCGCGG CTCCTTCCTC
TTCGACCCCG ACACCGGCAG CGGCGGAGCC GGCTTCAAGG CCGTGCGCCC CTGGTCTCGC
GGCGACGACG GCGAGCGCCT GGTCACCAGC GACAACCGCT CGCTGCGCCG ACGCAGCCCG
ACGCCGTTCA GCAAGCAGCA GTACGAGGGC TCGATCGACG ACTTCTACGA CGCCATGGCC
GCCCTGATCA ATCCGCGGCC GCTCGATCCC GCGGCCATGC AGGGATCGCT GGTCGACGCG
CTCGAGGAGA CCGTGTCGCG GCGGGTCACC TCGGTCGAAA ACGGCGAGGC CTTCATGCGC
GCGCGCGGCT TCTCGACCAT CGACATGCCC GATGGCTCGC GCATCTTCCT CACCACCGGG
CCGTGGGAGG ACTACGCCAC GCCCTCGCGC GACCTGCGCT TGCTGATCTC GATCGACACC
GTGGTCGAGT TCCCCGACGC CGTGGCGCGC GCGCCCGAGC GCTACGGCAT CCGCGGCAGC
GAAGCCGAAA TCGCCGAGCA GATCGCGGCG CTGCGCGAGG CGCTGGCGGC GGCGCTGGCG
GCGCGGCGGT TTTCGTACAC GCGTTCCGAC GGCAGCGCCT TCGAGCTGAG CCTGGGCGAC
GTGGTCGAGC GCGCCAAGCG CCTGGAGATG GCCTACAACC CCAACGACTG CATCGAGACC
CGCTGGGGGG CGCCCGCCGG CAGCGAGGAA GCCAAGACCT GCAAGCGCCA GGCGCCAGCC
GAGCAGCGCG CGCGCATGAC GCGCTACCGC GATTGGTTCT CGAGTCGCAA GCGGCCGGCC
ACCTGA
 
Protein sequence
MHSDERGGVG FNRSRSRSRS RSRSRGRGIG VLVALGAASL VALLGCGGKA EQTPRSEAPV 
ASAEDEQAME SPTCPANASA PALPPGTEPA QVQLDYWLER VGAAHDLDQV LLSPAEIARL
NQAQRVPREH FHAQRDLLEP LAEDEIARDI DERVLWYRER FASGRYESAA GGALPDELQA
DVKPSIRPSL RVALGQVPFR CAPVDTAFLA PGGNPNIDHN RCSTAHAQEP VEVLADWPGP
MQLARTRYTW GFIADDAPLS PPLPAAQAQR FVRGPSVTLA SDAQLDDALP LGRVLPRGRG
DTVLVATADG VRELAVPAQA LRSTPRPLTR RAFLEEAFRY LDTPYGYGGT GGGRDCSRLM
LDVFESFGIA LPRHSAWQAR AGSYSIDVAS ASETERLFLI DAAAERGIVL LHLPGHIMLY
LGRDQGERPM AMHALAEYKA RCPADEGETL FYADRVLVSD LELGRDTEKT AFIERIDRIT
VFGEAPGPEL AGSAELRPAA PMSAPAKRAC RRSGGAELFV TPAQPDSGRP LRVVATASEN
PGPAAITLID PSGTAHTPAM VQLGGPPYGY VAAIEAPAAG TWTAMFGDGD ELRACERIRV
RSRQRAAPGD DDAAGVWAIR RAWDQNAEDL YSVFVERLFD YPLDEDLTWD GFHHLVRDRD
RNILYDHLGH GEDADLVLVP DCADLPYTLR AYFAWKLGLP FGFHDCNRAR PGRPPRCEPH
GENLMSRAEL STRSLSERNS SDLHPDVVAF ARFLDRELRR EVHSSSGRTH PDDDETDFYP
VPLTRSALRP GTLFTDPYGH LLVIADWVPQ GASSYGVLIG ADAQPDGTVG RRRFWRGSFL
FDPDTGSGGA GFKAVRPWSR GDDGERLVTS DNRSLRRRSP TPFSKQQYEG SIDDFYDAMA
ALINPRPLDP AAMQGSLVDA LEETVSRRVT SVENGEAFMR ARGFSTIDMP DGSRIFLTTG
PWEDYATPSR DLRLLISIDT VVEFPDAVAR APERYGIRGS EAEIAEQIAA LREALAAALA
ARRFSYTRSD GSAFELSLGD VVERAKRLEM AYNPNDCIET RWGAPAGSEE AKTCKRQAPA
EQRARMTRYR DWFSSRKRPA T