Gene Hoch_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0099 
Symbol 
ID8542470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp150550 
End bp152856 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content72% 
IMG OID646384887 
ProductGeneral secretory system II protein E domain protein 
Protein accessionYP_003264633 
Protein GI262193424 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCTC GCCTGAGCTC TCTCCTCGTC AGTGACGGGG TGGTCAGCGT CAAACGCATG 
GAACATGCGT TTCAGCGGCA GGTGATCTAC GGCGGTTCGC TCGACACCAT CCTGCTCGAG
ATGAACTTGG TCTCGGAGGC GCGGCTGGTG CAGTACCTGT CCAAGGCCAC CAGCCTGCCG
CCGGCGACGC CGCAGGAGAC CGCGGGCACC GACACCGCCG CGGCCGAGCG CTGCCTGGCG
GCGGTGGCCG ATCGCTACCG GGTGGTGCCG CTGTGTTTCT CGGACAGCGC GCTGCGCCTG
CTGGTGCACG ATCCCGTGGA GATGGCCGGG CTCGAGGAGC TGGCCAACGA GCTGGGCGTG
GCCGTGCAGC CGCTGGTGGT GCCCGAGTAT CGCTTTCACC TCACCCACGT GCGCGTCTTC
GGCGGCACCC CGGACGCGCG CTACGAGACC CTGGCGCGGC GCGCGGACGA GAGCCGGCCG
CTGCAGCCGG TGGGCAAGGC GCGCAGCGTG ATCGTCGATG CCGGGGGCGG CGGCGGCGGC
GACACCGAGC GGACTGCGGC GGCGCCGACG ACGATCGAGG CGGCCGCCGA GGTGTTCGAG
CTGCCGGCGC CGAAACGCCG CGAAAAGCGC ACCATGGAGA TGGCGAGCGC GGCCCTGGCG
CGGCGCAGTG AACTCGACCT GGGCAACGCC AGCCCGGTCG CGCCGACCCG CGAGCAGCGC
CTCGGCCACC TCGACACCGC GCCGGTGGCG GCGACCGACT CAGGCCGGGA ATCGCCCGCG
AACGCGCGCG CTCGCGCGGC GGGCCCGGAT ACCGAGGCGC TGTCGGCCGA GCAGGCCATC
GCCGCGCTGC CCGAGGCCGA GGATCGCGAC ATCATCTTCA ACCTGCTGCT GCGCGCTGTG
CGGCACTGGA GTGAGTACGC GGCCCTGTTC ACCGTGCAGG GCCAGTCGGC GATCGGCCGC
ATCGCCATCG ACGGCGATCG CGTCGACCGC ATGGCCATCG CCCGCGCGGT GCTGCCGCTC
GACATGGCCT CGCCCTTCCG CACCGTGGCG CAGTCGCTCA CGCCCTACGC GGGCCCGCTG
CGCATCGAGC TGCCGGGCAT GAACAGCATG CTCGCCGATC TCGGTATCGC GCCGACCACC
ACGGTGGTGC TGGTGCCCGT GATCCTGCGC GGCCGCGTGG TCGCCATCGC GCTCGGCCAC
GGCGGCGCCG AGCCCGTGAG CGACGAGGCC AGCGGCGCGC TGATGCCGGT GACCGTGGCC
GCGGCCGACG CCATCTCGCG CCTGATCGTC AAAGCCAAAT CGCAGCGCCA GACCGCGGTC
ACCGCGGTCG CGCCGCCGCC CGGACAGGAG CCCGCGGCTG CGCCCGCGCC GGACGCGGAG
GCGGACACGG ACAGGTACGA GGGGCCGCGG CGACGGCCCG AGCGCAGCAC CCAGGTGATG
GCGCATCCGC CCATCGACAG CGTGCTCAGC GCGGTGCAGA GCGAGAACCC GGAAGAGGCC
GATTACGGAC GCGCGTCCGC CTTGGCGCGC CCCGACGAGA CCCTGAGCGC GCTGGCTGCG
CGTTTCCCGG GCACGCTGTG GGTGGAGCGC TTCGAACTCG AGGGCCAGCC GCTGCCGCCG
GCCGAGCACG GGCCGCTGCT CGCGCTCACC ATCGAGCTGG GGCCGCTGGC CACCGAGCTG
CTGATCGAGA AGCTCGCGGA TCCCGACCGC GAGACCCGCT ACTACGCGGC GCTGTGCCTG
GCCGAGACGC GGCCGCGCGA GGCGCTCGAG CCCTTTGTCG AGCACCTCTT CGACAGCGAC
TACGGCATCC GCTCGCTGGT GATCGACGCG CTGTCCGACT ATCCGGCCGC GCAGCTAGAG
AAGGCACTGG CCCGGGTGCG GCAGGCGCTG CACAGCGACC AGGGCGGACG CGTGCAGGCC
GCCGGCAACG CCCTGGCCAA GCTCGGCGAC ACCCACGCCG TGCCGGTGCT CATCGACGTC
ATGGCCGAGG GCGGCAGCGG CGCCGAACAC GCGCGTCGCA CCCTGATTTC GCTCACCCGC
CAGGATTTTG CCAGCAGCGT GCGCAAGTGG CGCTCGTGGT GGAACAAGCA CCAGGAGCAG
CACATGATCG AGTGGCTGAT CGAGGCGCTC GGACACAAGG ACGAGAACTT GCGCGGCGCC
GCGGCCGAAG ATCTGCGCCA GCGCACGGGC GAGTACTTCG GTTTCCACCA CGACCTGTCG
AAGAGAGAGC GCGAGCAGGC CCAGCAGCGC TGGCGCGAGT GGTGGCGGCA GACCGGCTCG
ATGAAGTTCG CGGGTCCGCG GGCCTGA
 
Protein sequence
MPSRLSSLLV SDGVVSVKRM EHAFQRQVIY GGSLDTILLE MNLVSEARLV QYLSKATSLP 
PATPQETAGT DTAAAERCLA AVADRYRVVP LCFSDSALRL LVHDPVEMAG LEELANELGV
AVQPLVVPEY RFHLTHVRVF GGTPDARYET LARRADESRP LQPVGKARSV IVDAGGGGGG
DTERTAAAPT TIEAAAEVFE LPAPKRREKR TMEMASAALA RRSELDLGNA SPVAPTREQR
LGHLDTAPVA ATDSGRESPA NARARAAGPD TEALSAEQAI AALPEAEDRD IIFNLLLRAV
RHWSEYAALF TVQGQSAIGR IAIDGDRVDR MAIARAVLPL DMASPFRTVA QSLTPYAGPL
RIELPGMNSM LADLGIAPTT TVVLVPVILR GRVVAIALGH GGAEPVSDEA SGALMPVTVA
AADAISRLIV KAKSQRQTAV TAVAPPPGQE PAAAPAPDAE ADTDRYEGPR RRPERSTQVM
AHPPIDSVLS AVQSENPEEA DYGRASALAR PDETLSALAA RFPGTLWVER FELEGQPLPP
AEHGPLLALT IELGPLATEL LIEKLADPDR ETRYYAALCL AETRPREALE PFVEHLFDSD
YGIRSLVIDA LSDYPAAQLE KALARVRQAL HSDQGGRVQA AGNALAKLGD THAVPVLIDV
MAEGGSGAEH ARRTLISLTR QDFASSVRKW RSWWNKHQEQ HMIEWLIEAL GHKDENLRGA
AAEDLRQRTG EYFGFHHDLS KREREQAQQR WREWWRQTGS MKFAGPRA