Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0099 |
Symbol | |
ID | 8542470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 150550 |
End bp | 152856 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646384887 |
Product | General secretory system II protein E domain protein |
Protein accession | YP_003264633 |
Protein GI | 262193424 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCTC GCCTGAGCTC TCTCCTCGTC AGTGACGGGG TGGTCAGCGT CAAACGCATG GAACATGCGT TTCAGCGGCA GGTGATCTAC GGCGGTTCGC TCGACACCAT CCTGCTCGAG ATGAACTTGG TCTCGGAGGC GCGGCTGGTG CAGTACCTGT CCAAGGCCAC CAGCCTGCCG CCGGCGACGC CGCAGGAGAC CGCGGGCACC GACACCGCCG CGGCCGAGCG CTGCCTGGCG GCGGTGGCCG ATCGCTACCG GGTGGTGCCG CTGTGTTTCT CGGACAGCGC GCTGCGCCTG CTGGTGCACG ATCCCGTGGA GATGGCCGGG CTCGAGGAGC TGGCCAACGA GCTGGGCGTG GCCGTGCAGC CGCTGGTGGT GCCCGAGTAT CGCTTTCACC TCACCCACGT GCGCGTCTTC GGCGGCACCC CGGACGCGCG CTACGAGACC CTGGCGCGGC GCGCGGACGA GAGCCGGCCG CTGCAGCCGG TGGGCAAGGC GCGCAGCGTG ATCGTCGATG CCGGGGGCGG CGGCGGCGGC GACACCGAGC GGACTGCGGC GGCGCCGACG ACGATCGAGG CGGCCGCCGA GGTGTTCGAG CTGCCGGCGC CGAAACGCCG CGAAAAGCGC ACCATGGAGA TGGCGAGCGC GGCCCTGGCG CGGCGCAGTG AACTCGACCT GGGCAACGCC AGCCCGGTCG CGCCGACCCG CGAGCAGCGC CTCGGCCACC TCGACACCGC GCCGGTGGCG GCGACCGACT CAGGCCGGGA ATCGCCCGCG AACGCGCGCG CTCGCGCGGC GGGCCCGGAT ACCGAGGCGC TGTCGGCCGA GCAGGCCATC GCCGCGCTGC CCGAGGCCGA GGATCGCGAC ATCATCTTCA ACCTGCTGCT GCGCGCTGTG CGGCACTGGA GTGAGTACGC GGCCCTGTTC ACCGTGCAGG GCCAGTCGGC GATCGGCCGC ATCGCCATCG ACGGCGATCG CGTCGACCGC ATGGCCATCG CCCGCGCGGT GCTGCCGCTC GACATGGCCT CGCCCTTCCG CACCGTGGCG CAGTCGCTCA CGCCCTACGC GGGCCCGCTG CGCATCGAGC TGCCGGGCAT GAACAGCATG CTCGCCGATC TCGGTATCGC GCCGACCACC ACGGTGGTGC TGGTGCCCGT GATCCTGCGC GGCCGCGTGG TCGCCATCGC GCTCGGCCAC GGCGGCGCCG AGCCCGTGAG CGACGAGGCC AGCGGCGCGC TGATGCCGGT GACCGTGGCC GCGGCCGACG CCATCTCGCG CCTGATCGTC AAAGCCAAAT CGCAGCGCCA GACCGCGGTC ACCGCGGTCG CGCCGCCGCC CGGACAGGAG CCCGCGGCTG CGCCCGCGCC GGACGCGGAG GCGGACACGG ACAGGTACGA GGGGCCGCGG CGACGGCCCG AGCGCAGCAC CCAGGTGATG GCGCATCCGC CCATCGACAG CGTGCTCAGC GCGGTGCAGA GCGAGAACCC GGAAGAGGCC GATTACGGAC GCGCGTCCGC CTTGGCGCGC CCCGACGAGA CCCTGAGCGC GCTGGCTGCG CGTTTCCCGG GCACGCTGTG GGTGGAGCGC TTCGAACTCG AGGGCCAGCC GCTGCCGCCG GCCGAGCACG GGCCGCTGCT CGCGCTCACC ATCGAGCTGG GGCCGCTGGC CACCGAGCTG CTGATCGAGA AGCTCGCGGA TCCCGACCGC GAGACCCGCT ACTACGCGGC GCTGTGCCTG GCCGAGACGC GGCCGCGCGA GGCGCTCGAG CCCTTTGTCG AGCACCTCTT CGACAGCGAC TACGGCATCC GCTCGCTGGT GATCGACGCG CTGTCCGACT ATCCGGCCGC GCAGCTAGAG AAGGCACTGG CCCGGGTGCG GCAGGCGCTG CACAGCGACC AGGGCGGACG CGTGCAGGCC GCCGGCAACG CCCTGGCCAA GCTCGGCGAC ACCCACGCCG TGCCGGTGCT CATCGACGTC ATGGCCGAGG GCGGCAGCGG CGCCGAACAC GCGCGTCGCA CCCTGATTTC GCTCACCCGC CAGGATTTTG CCAGCAGCGT GCGCAAGTGG CGCTCGTGGT GGAACAAGCA CCAGGAGCAG CACATGATCG AGTGGCTGAT CGAGGCGCTC GGACACAAGG ACGAGAACTT GCGCGGCGCC GCGGCCGAAG ATCTGCGCCA GCGCACGGGC GAGTACTTCG GTTTCCACCA CGACCTGTCG AAGAGAGAGC GCGAGCAGGC CCAGCAGCGC TGGCGCGAGT GGTGGCGGCA GACCGGCTCG ATGAAGTTCG CGGGTCCGCG GGCCTGA
|
Protein sequence | MPSRLSSLLV SDGVVSVKRM EHAFQRQVIY GGSLDTILLE MNLVSEARLV QYLSKATSLP PATPQETAGT DTAAAERCLA AVADRYRVVP LCFSDSALRL LVHDPVEMAG LEELANELGV AVQPLVVPEY RFHLTHVRVF GGTPDARYET LARRADESRP LQPVGKARSV IVDAGGGGGG DTERTAAAPT TIEAAAEVFE LPAPKRREKR TMEMASAALA RRSELDLGNA SPVAPTREQR LGHLDTAPVA ATDSGRESPA NARARAAGPD TEALSAEQAI AALPEAEDRD IIFNLLLRAV RHWSEYAALF TVQGQSAIGR IAIDGDRVDR MAIARAVLPL DMASPFRTVA QSLTPYAGPL RIELPGMNSM LADLGIAPTT TVVLVPVILR GRVVAIALGH GGAEPVSDEA SGALMPVTVA AADAISRLIV KAKSQRQTAV TAVAPPPGQE PAAAPAPDAE ADTDRYEGPR RRPERSTQVM AHPPIDSVLS AVQSENPEEA DYGRASALAR PDETLSALAA RFPGTLWVER FELEGQPLPP AEHGPLLALT IELGPLATEL LIEKLADPDR ETRYYAALCL AETRPREALE PFVEHLFDSD YGIRSLVIDA LSDYPAAQLE KALARVRQAL HSDQGGRVQA AGNALAKLGD THAVPVLIDV MAEGGSGAEH ARRTLISLTR QDFASSVRKW RSWWNKHQEQ HMIEWLIEAL GHKDENLRGA AAEDLRQRTG EYFGFHHDLS KREREQAQQR WREWWRQTGS MKFAGPRA
|
| |