Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2102 |
Symbol | |
ID | 8544488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2916224 |
End bp | 2917630 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646386809 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003266540 |
Protein GI | 262195331 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.573488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.797611 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGT TCTCGATCGT CGCCATCACC GTGCTCATCG TGCTCAACGC GCTGTACGTG GCCGCCGAGT TCGCCATCGT CGCGGCCCGG CGCTCGATCA TGGATGAGAA GGCGCGCGCG GGTAATCGGG TGGCCGTGCA GGTCGCGCGG GTGCTCGACG ACGACCAGCT CAAAGACCGC TACATCGCGG CCGCTCAGCT CGGCATCACC CTGGCCAGCC TGGGGCTCGG CATGTACGGC GAGAAGGTCT TGGCCCACAG CATCGAAACT TGGCTCGGCA GCAGCGATGT GCCGGCGTGG CTGGCCTCGC ACGCGATCGC CGGTCTGTGC GCGGTGCTCA TCCTCACCTA TTTTCACATC GTCCTGGGCG AGATGGTGCC CAAGGCCATC GCCCTGGCCA CGCCCGAGCG CGCGGCGCTG TGGCTCTCGC CCGTCATCCG CACCACCCAG TTCGTGGCCT ATCCGCTGAT CGCCGGGCTC AACGGCCTGG GTAACCTGCT GCTGCGCGGC CTGCGTCTGT CCGACGACAA CAGCAAGCAC TACTACACGC CGGAAGAGCT GCAGTACGTC GTGCGCGAGA GCGAGGAGCA GGGCCTGCTC GGCCACGAGG CCGCCGAGGT GGTCGACGAG CTGCTCGAGT TCGGCACGCT CACGGCCCGC GAGGTCATGG TCCCGCGGGT CAAGATCCGC GGCGTCGAGC TCGGCAGCGT GCTCTCGGCC GCGCGCGAGA CCCTGCGCGC CGCGCCGCAC ACGCGCTATC CCGTGTTTGC GGGTGATCTC GACCATATCG TCGGCACCGT GCACATCAAA GACGTGCTGC GCCGCACCCG GCAGTCGCGC TCGACCATCA CCCAGGCCGA CATCCACCCG GTGCCCTTCA TCCCCGAGAC CTCGACCCTC GACAAGGTGC TGGCCGCCAT GCGCCAGTGG CGCACGCAGA TGGCCATCGT CATGGATGAG CACGGCGGCA CCGCCGGTCT GCTCACCATC GAGGACCTGT TCGAGGAGAT CATCGGCGAC ATCGACGAGA CCACGGCGGC CGCGCGTCTG CCCGACATCT TCCGCGAGGA GGACGGCAGC CTGCGCGTCG TCGGCACCGT CCGCGTGGAC GAAGTCGGCG AGTCCCTCGA GCCCGAGCGC GAGCTCGAGC ACGAGGAGGT CGACACCGTG AGCGGTCTGG TGCTGGCGCT GCTCGACCGG CCCCCCGAGG TCGGCGACGC CGTGAGCTTC GAAGGCCTGC GCTTCGAGGT GTGCGCGGTC GAGGGCCACG GCGTGGCCGA GTGCCGGGTC ATCCCCCTCG AGCCTGACCC GACCTCCGAA GGCGAGGACG CGGGCGAGGG CGAGCCGCAC GCGAGCGGCG CGGAGCAGGG CGCTAACGCC GAGCGAGGCG ACGAGCGCGA CGCATAG
|
Protein sequence | MSLFSIVAIT VLIVLNALYV AAEFAIVAAR RSIMDEKARA GNRVAVQVAR VLDDDQLKDR YIAAAQLGIT LASLGLGMYG EKVLAHSIET WLGSSDVPAW LASHAIAGLC AVLILTYFHI VLGEMVPKAI ALATPERAAL WLSPVIRTTQ FVAYPLIAGL NGLGNLLLRG LRLSDDNSKH YYTPEELQYV VRESEEQGLL GHEAAEVVDE LLEFGTLTAR EVMVPRVKIR GVELGSVLSA ARETLRAAPH TRYPVFAGDL DHIVGTVHIK DVLRRTRQSR STITQADIHP VPFIPETSTL DKVLAAMRQW RTQMAIVMDE HGGTAGLLTI EDLFEEIIGD IDETTAAARL PDIFREEDGS LRVVGTVRVD EVGESLEPER ELEHEEVDTV SGLVLALLDR PPEVGDAVSF EGLRFEVCAV EGHGVAECRV IPLEPDPTSE GEDAGEGEPH ASGAEQGANA ERGDERDA
|
| |