Gene Hoch_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1904 
Symbol 
ID8544286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2615300 
End bp2616979 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content70% 
IMG OID646386609 
Producthypothetical protein 
Protein accessionYP_003266344 
Protein GI262195135 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.966187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCTGGT CGAAGATCTG GTTTTTCCTC ACCGCCCTGG TCGCCGGTGC GGCGATCACC 
GTGGCGCTCA TTTTGCCCCG GCCGGCCGAG CGCCAGCGCC TGGCCGATGA GCGCGAGCGC
GTGGTCACTG CCTGCGACGT GCTCAATATT CTGCTCGAGT CGAACGCGCG CAGCCGCGTC
GACCTGGCCG GGACCTTTGC GCGTTCGGAG ATCGACGTGG CCCAGGTGCT GGCGCCGGCG
TCGCTCGAGA GCGACGTGAG CGCCGACGCC AACAAGACGG CGCGCACGCT GGCCAATCAG
CTCGTCGAGA GCACGGCCGG CGACAAGCCC GAGTTCGTGA TCCTCATCGA CGGCCGCGGC
CGCGCGGTGG GGCGTGTCGG CATCCACGAG GACCGCTACG GCGACACCAT GGCCGGCTAC
CACCTGGTCG ATGACGCGCT CGACGGCTAC ATGCGCGACG ATTTGTGGCT GATCGACGAT
AAGCTGTACC TGGTGGCCGG CGCGCCCGTG ATCGCCAACC GCTGGGCCGG CGCCGTGGTC
ATCGGACACG AGGTCGACAA AGAGCTTGCC GATCGCCTGG TCGGCCAGCT CGGCGTGGAT
TTCGTGGTCT ACGCTGGCGG CCAACCGGTG GCGACCACCA ACCCGGTCGA GATCCACAGC
GAGGTGCAGC GCGCCTACGC CGAGGAGGGC GGCAAAGAGA CGCCGGTGAT TCAGGACTGC
CGCCAGAACA CGCCCTTCGA CGTGGTAACC GGCAGCGAGA CCTACACCGC CCTGGTCGCC
CGTCTGCCCG GCGAGGCCGG CACCCAGGGG GCCTTCTTCG CGGTGTTCGT GGCGCGGCCC
GCGTCGCTCG GGCTGATGGG CACGCTGGAC GCGGTCAACA AGGAAGACAT CGCGTTTGGC
AGCTTCCCGT GGATCCTGCT CGGTATCGGC CTGATCGCGG CCTTGGGGCT GGGCATCTTC
CTGATGATCT TCGAGGTCGA TCGGCCGCTG CGCCGGCTGT CCAAGGATTC GGTACTGCTC
GCCCAGGGCG ACGCCAAGCG GCTCGACGAG GAGCGCCACC GCAGTCACTA CGGCTCCATC
GCGCGCTCGG TGAATATCTT CATCGACAAG TCCAAGCGCG AGGCCCGCAG CGGCACGCAG
CCGAGCGTGA ATCCGCTGCC GCCGGTGGGG CCGGGCGGAC CGAGCGGCGG CGCGGGCAAG
CCGCCGCCGC CCTCGGAGTT CAAGTTTTCG GACACCAGTC CGGGAAACCC GCGCAAGCCG
TCGTCGGGCG GCCCGCCGCG GCGCGCGCCC ACGGCTCCGG GCGAGCGTCC GGCGCTGCCG
GGAGCGCCGC CCGCGCGTGC GCCGACCAAC CCGCCACCGG GAGCGCCGCC GCCGCTGCGT
GCCATGCGCA CGGAGTCGGG CATCACGGCC ATCGATGACA TCTTCGCGCC CGGCGCGGCG
GAGAGCGGCG AGATCCGCCT CGGCGACGAG TCGCATCGCG GACTGTACGA GGAGTTTCTG
GCGCTCAAGC GGCAGTGCGG CGAGCCCACG GCCAACCTCA CCTACGAGAA GTTTGCGGGC
AAGCTGCGCG CCAGCCGCGA TGCCCTCATC GCCAAGCACA ACTGCCGCGA CGTCAAGTTC
CAGGTCTACA TCCGCGACGG CGCGGCCGCG CTCAAGGCCA AGCCCGTGGG CCTGCCCTGA
 
Protein sequence
MFWSKIWFFL TALVAGAAIT VALILPRPAE RQRLADERER VVTACDVLNI LLESNARSRV 
DLAGTFARSE IDVAQVLAPA SLESDVSADA NKTARTLANQ LVESTAGDKP EFVILIDGRG
RAVGRVGIHE DRYGDTMAGY HLVDDALDGY MRDDLWLIDD KLYLVAGAPV IANRWAGAVV
IGHEVDKELA DRLVGQLGVD FVVYAGGQPV ATTNPVEIHS EVQRAYAEEG GKETPVIQDC
RQNTPFDVVT GSETYTALVA RLPGEAGTQG AFFAVFVARP ASLGLMGTLD AVNKEDIAFG
SFPWILLGIG LIAALGLGIF LMIFEVDRPL RRLSKDSVLL AQGDAKRLDE ERHRSHYGSI
ARSVNIFIDK SKREARSGTQ PSVNPLPPVG PGGPSGGAGK PPPPSEFKFS DTSPGNPRKP
SSGGPPRRAP TAPGERPALP GAPPARAPTN PPPGAPPPLR AMRTESGITA IDDIFAPGAA
ESGEIRLGDE SHRGLYEEFL ALKRQCGEPT ANLTYEKFAG KLRASRDALI AKHNCRDVKF
QVYIRDGAAA LKAKPVGLP