Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3765 |
Symbol | |
ID | 8546158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5175653 |
End bp | 5177791 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646388435 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003268158 |
Protein GI | 262196949 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0441363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.184331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGG TCATCTTCAT CATCCTCTTG GCCGCGGCGG GCGGTTTCTT CGTGCGCACC TTGAGCCGGA TGACGCGCGC TGCCGCACGC GGCACCGCCG ACCCGCGCCC CCGCATGGAT CAGCTCGGCG AGCGCGTCGC CTCGATCCTG GTGTTCTTCT TCGGCCAGAA GAAAGTCGTC GCCAACCGCG GGCCCTCGCA ATCGCGTCTC ACCTGGAGCT GGCACCACCT GATCATCTTC TGGGGCTTTT TGGTGATCAC CATCGCCAGC GTCGAGATCC TGGTCAACGG CGTGATTCCC GCCCTGAGCC TCGAGCTTCT GCCCGAGCCG CTGTTCATGC CGCTCAAGGT GCTCATCGAC GTGCTCAACC TGCTGGTCCT GCTCATCGTC GGCTGGGCCT TCTTCCGGCG CATCGTGCTG CAGCCGCGCC TGATCCCGAT GAACCTCGAC GCCGGCGTCA TCCTCGGCGC CATCGCGTCG CTGATGCTCA CCCACTTCCT GTTTCACGGC TTCCACTTCG TGGCCGAGGG CATGAGCGAG TTCCCGGCCT ACGCGCCGGT CTCGGGCGTG GTCGCCGCGC TGCTGAGCGA CACGCCGCAG CCGGTCGCCC ACTTCGGCGC CGAGGCCGCC TATTGGCTGC ACGTCCTGCT GCTGCTCACC TTCCTCAACT ACCTGCCCTA CTCCAAGCAC ATCCACCTGC TGGGCGCGTT CCCCAACATC TTCACGCGCA ACCTCAGCGA GCGCAAACTC GACATGCCCA AGCTCGACCT CGAGGACGAG AACCAGTGGG GCGTCGGTCG CATCGAGCAG TTTAGCTGGA AGTCGCTGCT CGATAACTAC GCGTGCACCG AGTGCGCGCG CTGCACCAAC AACTGCCCGG CCTACGCCAC CGGCAAGAAC CTGTCGCCGA TGCAGCTCGT CCACGATCTG CGCTACGAGA TGATCGATCG CGACGCCCTG CTGGCCCAGC GCGACGGCCT CGATCGCGAG ATCGAAGCCT TCGAGCACAA AGAAGAGGAC GGCCACAAGC ACCCGGACTT CGAGCACCTC GAGCAGCAGC GCGCCGAGGT CGAGGAGCAG CTCGAGGCCA TGCCCGAGCT GGTCGGCGGT CGCATCGCCG ACGAGACCCT GTGGGCCTGC ACCACCTGCG GCGCCTGCCA GGCCGTGTGC CCGGTGTTCA TCGAGCACCC GCTCAAGATC CTGCAGATGC GCCAGAACCT GGTGCTCGAG CAGGAGCGCG TGCCCGGCGA GCTAGGCCGC ACCTTCCGCA ACATCGAGCG CCAGTCCAAC CCGTGGGGCA TCGCCAGCGA CCAGCGCATG GACTGGGCCG AGGGCCTCAA CGTGCCCACC ATCGAGGAGA ACCCCAACCC CGAGTACATC CTCTGGGTCG GCTGCGCGGG CGCCTTCGAC AACCGCATCA TCAAGCAGAC CAAGGCGATG ATTCAGATCC TCGGCGCCGC GCACGTCAAC TACGCCGTGC TCGGTCACCA GGAGGGCTGC ACGGGCGATC CGGCGCGGCG CGCGGGCAAC GAGCTGCTGT TCCAGATGCA GGCCGAGACC AACATCGAGG TGCTCAACGA GACCGGCACC AAGAAGGTCA TCACCTCGTG CCCCCACTGC CTGCACACGC TGCGGCACGA CTACCCGCAG TTTGGCGGTG ACTACGAGGT CATCCACCAC ACCCAGCTCA TCGCCCACCT CATCGACGCC GGCAAGCTGC AGACCGGCAA CAGCTCGAGC ATCGAGCGCA TCACCTACCA CGATAGCTGC TACCTCGGCC GCTGGAACCA GGAGTTCGAG GCGCCGCGCG AGATCCTGCG CTCGCTGCCC ATCGCCGGCG GCGTCACCGA GCTCGAGCGC AACCGCATGC ACGGCTTCTG CTGCGGCGCC GGTGGCGCGC GCATGTTCAT GGAAGAGGAA GAGCCGCGGG TCAACGTCAA CCGCGCCGAC GAGGTCATCG CCACCGGCGT CGACGCCGTC GCCGTGGCCT GCCCGTTCTG CAACATCATG CTCACCGACG GCATGAAGCA CCGCGACAAG GACGAGGACA TCCAGGTCCT CGACATCGCC GAGGTGGTCG CCAGCAGCCT CATCCCGGCC AGCTCGCTGG TGCGCAAGAA GGACGAAGCC GCCAACTGA
|
Protein sequence | MKTVIFIILL AAAGGFFVRT LSRMTRAAAR GTADPRPRMD QLGERVASIL VFFFGQKKVV ANRGPSQSRL TWSWHHLIIF WGFLVITIAS VEILVNGVIP ALSLELLPEP LFMPLKVLID VLNLLVLLIV GWAFFRRIVL QPRLIPMNLD AGVILGAIAS LMLTHFLFHG FHFVAEGMSE FPAYAPVSGV VAALLSDTPQ PVAHFGAEAA YWLHVLLLLT FLNYLPYSKH IHLLGAFPNI FTRNLSERKL DMPKLDLEDE NQWGVGRIEQ FSWKSLLDNY ACTECARCTN NCPAYATGKN LSPMQLVHDL RYEMIDRDAL LAQRDGLDRE IEAFEHKEED GHKHPDFEHL EQQRAEVEEQ LEAMPELVGG RIADETLWAC TTCGACQAVC PVFIEHPLKI LQMRQNLVLE QERVPGELGR TFRNIERQSN PWGIASDQRM DWAEGLNVPT IEENPNPEYI LWVGCAGAFD NRIIKQTKAM IQILGAAHVN YAVLGHQEGC TGDPARRAGN ELLFQMQAET NIEVLNETGT KKVITSCPHC LHTLRHDYPQ FGGDYEVIHH TQLIAHLIDA GKLQTGNSSS IERITYHDSC YLGRWNQEFE APREILRSLP IAGGVTELER NRMHGFCCGA GGARMFMEEE EPRVNVNRAD EVIATGVDAV AVACPFCNIM LTDGMKHRDK DEDIQVLDIA EVVASSLIPA SSLVRKKDEA AN
|
| |