Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3938 |
Symbol | |
ID | 8546334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5431540 |
End bp | 5432610 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646388610 |
Product | pyrimidine utilization protein A |
Protein accession | YP_003268330 |
Protein GI | 262197121 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03612] pyrimidine utilization protein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.252871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0210815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGTCG GAGTGTTCAT CCCCATCGGC AACAACGGCT GGCTGATCTC GGAGAATTCG CCCCAGTACC AGCCGAGCTT CGACCTGAAC AAGGAGATCG CGCAGCGGGC CGAGGCCCAC GGTCTGGATT TCTTGCTGTC GATGATCAAG CTGCGCGGCT TCGGCGGCAA GACCGAGTTC TGGGAGTACA ACCTCGAGAG CTTCACGCTG ATGGCGGGTC TGGCCTCGGT CACCGAGCGC ATCAAGCTGT TCGCGACCTG TGCCACGCTG CTCATCCCGC CGGCCTACGC GGCGCGCATG TGCAACACCA TCGACTCGAT CAGCCACGGC CGCTTCGGGC TCAACCTGAT CACGGGCTGG CAGCGGCCCG AGTACTCGCA GATGGGTATG TGGCCGGGCG ACGAGCACTT CGCGCGGCGT TACGACATGC TGTCCGAATA CGCGCATATC CTGCGCGAGC TGTGGGAGAA GGGCGAGAGC AGCTTCCAGG GCGAGTTCTA CCAGATGGAG GATTGTCGGG TGCGGCCGCA GCCGCAGGGC GACATGAAGC TCATCTGCGC GGGCAGCTCG GACGCGGGCC TGGCGTTCTC GGCCAAGTAC GCGGACTACG CGTTCTGTCT GGGCAAGGGC GTGAACACGC CGACGGCGTT CGCGGGCAAC AACGAGCGCC TGGCGGCGGC GACGGCCAAG ACCGGGCGCG ATGTGTCGAT CTATGTGCTG ATGATGGTGA TCGCGGCGGA GACCGACGAC GAGGCCATGG ACAAATGGAT GCACTACCGC GCGGGCGTGG ACGAGGAGGC GGTGGCGTGG CTGGGCAACC AGGGCGCGGC CGACAAGACC TCGGACACGA CCAACGTGCG CCAGCTCGCG GCCCGCGAAT CGGCCGTGAA CCTCAACATG GGCACGCTGG TGGGCTCGTA CGAGAACATC GCGCGCATGC TCGACGAGGT CGCGACCGTG CCCAACACCG GCGGCGTGCT GCTGGTGTTC GATGATTTCG TCGCGGGCGT GGAGACCTTT GGCACGCGCA TTCAGCCGCT GATGAAGACC CGCCAGCACA CGGCGGGCTG A
|
Protein sequence | MEVGVFIPIG NNGWLISENS PQYQPSFDLN KEIAQRAEAH GLDFLLSMIK LRGFGGKTEF WEYNLESFTL MAGLASVTER IKLFATCATL LIPPAYAARM CNTIDSISHG RFGLNLITGW QRPEYSQMGM WPGDEHFARR YDMLSEYAHI LRELWEKGES SFQGEFYQME DCRVRPQPQG DMKLICAGSS DAGLAFSAKY ADYAFCLGKG VNTPTAFAGN NERLAAATAK TGRDVSIYVL MMVIAAETDD EAMDKWMHYR AGVDEEAVAW LGNQGAADKT SDTTNVRQLA ARESAVNLNM GTLVGSYENI ARMLDEVATV PNTGGVLLVF DDFVAGVETF GTRIQPLMKT RQHTAG
|
| |