Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2049 |
Symbol | |
ID | 8544431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2831523 |
End bp | 2832518 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646386752 |
Product | protein of unknown function DUF403 |
Protein accession | YP_003266487 |
Protein GI | 262195278 |
COG category | [S] Function unknown |
COG ID | [COG2307] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.715011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATCGC GCGTCGCAGA GCACTGTTTC TGGATGCATC GCTACGTCGA GCGGGCCGAG AACATGGCCC GACTGCTCCA GGTCAATCGC AACTTCGTGC TCGACGTCAA CGTGCCGCGG GCCGAGCGCT GGCTGCCGGT GGTGGTGGTC TCGGGTGAGC AGGAGCGCTT CGGCGAGCTG TACACCGAGG AGATGACCAA CGACGGCGAG CTGGTCGAGG AGTACCTCAC CTGGAACGAG GACAACCCGG TGTCGCTGCT CAGCTCGCTG CGCTGGGCGC GCGAGAACGC GCGCACCATC CGCGAGGTCA TCAGCCTCGA GATGTGGCGC GCGCTCAACA GCACCTACCG CTGGATGGTC GACGGCGAGG GCCGACGGCT CTACCAGTCG GAACGCGACG AGTTCTACGA CCGGGTGCGC TCGTCGGCCG CGCTGTTCGA CGGGGTGTTC CACAACACCA TCCTGCACAC CGAGCCCTAC TACTTCATGT TGCTCGGCAT CTACCTCGAG CGCGCCGGCC AGACCGCGCG CATCCTCGAC GTCAAGCATC ACCAGGTGAA CCGCGACGGC GCGGCCTCGG CCGTGGACTC GGCGCGCAGC CTGGCGCTGT TGCGCTCGTG CTCGGCGACC GAGCCGTTCT TCAAGCACGT GCGCGCGGCG CCCAGCGGCA AGACGATCGC GCCCTTCCTC GTGCTCGAGG AGCGCTTTCC GCGCTCGGTG CTGCACTGCC TGGCGCGCGC GCGCGCGTGC CTCACCGACA TCCGCGGCTT CACGCAGCGC GCCGCGCCCA CGCGTTCGGC CCAGCTCCTC GACGCCGTGG TCGGCAGCCT GCGCGCGCAC ACGGCCGGCA GCTTGTTCGA GACCGGCCTG CACACCGAGC TCACCCGGGT GATCGATACG GCAGCCGAGA TCTGCAGCGC GTTCCGCGAG GACTACTTCG ACCCCAGCTT TACGAGCGCA AGCCCAACAC TCGGGGTACA GTCCCAAAAA CAATGA
|
Protein sequence | MISRVAEHCF WMHRYVERAE NMARLLQVNR NFVLDVNVPR AERWLPVVVV SGEQERFGEL YTEEMTNDGE LVEEYLTWNE DNPVSLLSSL RWARENARTI REVISLEMWR ALNSTYRWMV DGEGRRLYQS ERDEFYDRVR SSAALFDGVF HNTILHTEPY YFMLLGIYLE RAGQTARILD VKHHQVNRDG AASAVDSARS LALLRSCSAT EPFFKHVRAA PSGKTIAPFL VLEERFPRSV LHCLARARAC LTDIRGFTQR AAPTRSAQLL DAVVGSLRAH TAGSLFETGL HTELTRVIDT AAEICSAFRE DYFDPSFTSA SPTLGVQSQK Q
|
| |