Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3176 |
Symbol | |
ID | 8545564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4375417 |
End bp | 4376337 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646387843 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003267571 |
Protein GI | 262196362 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.410481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.342999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTTCC TCGACCCCAG CGCGCTCACC CAGGTGAGCG GCATGCTGCT GCGCGCCCGC TTGATCGTGG AGGGCGCGCT CACGGGCGCG CACAGGGCGC GGCTGCGGGG CTCGTCGGTG GAGTTCGCCG AGCACAAGGA GTACGCGCCC GGCGACGAGA TTCGCCACAT CGACTGGAAG GCCTACGCCA AGGTCGACCG CTACTACGTC AAGCAGTTCG AGCAGGAGTC GCAGCTCACG GCGTACCTGG TGCTCGACAC CTCGGCGTCG ATGGACTACG CGGGCGAGGG GCTGAGCAAG CTGCGCTACG CGGCCTATCT GAGCGCGGCG CTGGCGTATC TGCTGGTGCA GCAGCGCGAT CGCGTGGGGC TGCTGCCCTT TGGCCAGCTC GACAGCGGCG GCTACGTACC GCCCCGGGCC CAGCCCGCGC ATCTGCGCAC GCTGCTCGGG TCGCTCGAGG AGTTGTGCGA GCGCGGCGGC GCCGGCGACG CCTCGGCCGC GGCCGCTCTC GACCGGGTGG CCGAAATCGC CGGTCGGCGG CGCGCGCTGA TCGCGGTGTG CTCGGATCTC TTTGCCGCCG AGGGCGACGG CCTGGCGGTG CTGCGGCGGC TGCAGGCGCG CGGCCACGAC GTGGTGGTGT TTCACGTGCT CGACCCCGAT GAGCTGTCGT TTCCCTTCCG CGGCCTCACG CGCTTCGAAT CGCTCGAGGA TGAGCGCGTG CTGCTGGCCG AGCCGGAGTC GCTGCGGCGC GCGTATCTGC GCCGGCTCGA GGCGTTTCTG GCGCGGGTCG AGCGCGGCTG TGCCGACAGC GGCGTGGGCT ATCACCGGGT GCCGACCTCG CAGCCGGTCG AGCGCACGCT GCTCGAGTTT CTCGAGACGC GCGCGCGGCT GCGGGGGGGA GGGCGAACGT GGAGTTCCTA G
|
Protein sequence | MSFLDPSALT QVSGMLLRAR LIVEGALTGA HRARLRGSSV EFAEHKEYAP GDEIRHIDWK AYAKVDRYYV KQFEQESQLT AYLVLDTSAS MDYAGEGLSK LRYAAYLSAA LAYLLVQQRD RVGLLPFGQL DSGGYVPPRA QPAHLRTLLG SLEELCERGG AGDASAAAAL DRVAEIAGRR RALIAVCSDL FAAEGDGLAV LRRLQARGHD VVVFHVLDPD ELSFPFRGLT RFESLEDERV LLAEPESLRR AYLRRLEAFL ARVERGCADS GVGYHRVPTS QPVERTLLEF LETRARLRGG GRTWSS
|
| |