Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0535 |
Symbol | |
ID | 8542915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 722859 |
End bp | 724688 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646385329 |
Product | RNA binding S1 domain protein |
Protein accession | YP_003265066 |
Protein GI | 262193857 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1098] Predicted RNA binding protein (contains ribosomal protein S1 domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACG TGTCCACTAG AGACGAAGAA CAAAACCAAA GCGAGTCGGC GCAAGCTGCG CCGGCGTCGA CCGAGTCGCC CGCAGCGGAG TCCGCCACGT CGGAATCCGC TGATGCGGCG TCATTGCAAG CGGATGAGAG CGCGGCCGAA GCCGCCGAGG GCGGCGAGGC GGGCGCATCA TCGGCAGAGA ACGAGAGCCA AGCTGCCGAG GGCGGTGAAG CTGCCGAGGG CGGTGACGCT GCCGAGGGCG GCAAGGCGCC CCTGCGCAAG AGTCGCTCGC GTGGTGGCCG TCGCGGCGGC CGCGGCTCGG GCGGGGGCGG GCAGAACGAT CCCACGCCCG AGCTGCTCAA GCTGGTCGAT CTCAGCGTCA AGTTCCCCGA CATCGGGCCG GCTCTGGCCG AGCTGGCCTT CAAGCTGGGC AACAGCGAGA TCGGCGAGCG CGTGCTGAGC ATGGGCCTGT CGAGCGACCG GCCGGGGCTG GAGTTCTACT TCGTGGCCGC GCACGCGGCC CGTCGCGAGC GCCGCTACGA GGACGCCCTG CGGGCCTCGA TCGATGCGGT GCGCGCCTAC GCCGACGCCG CCGACGACGC CATCGCTGCG GACGACGCTC AGCGCCTGCT GCACCTGGTG CGCCTGGGCT TCAACACGCT GATGTTCGAG ATCGGCGACG TCGAGAAGCA TCCGTGGTTC ACGCGCGATC TGATCGCCGA GCTCACGCGC GCCGAGCCGC GGCTGGGCGA GGACTCGTTC TTCCGCTCGC TGCTGGCGCA GGCGCTGTGG TTCGAGGACC GCGAGCGCAG CGAGGCCGAG TGGCAGCGCG CGCACGAGCT GGGCGCGCCC GAGACCACGT GGAACGCGCG CGGGACCTGG TACCGCGAGG CCGAGAACGA CCTGGTCAAG GCCGAGGAGG CGTACCGCAA GGGCCTGGAG GCGGCGTCCG ATAGCGCGCT GCTCAACCAC AACCTGGCCC AGGTCCTGAT CGATCGCGCC CGCGGTCTCG AGGACGACAA GGAGCAGGTC AAGGCGCTGC TGCAGGAGGC CGAGACCCTG CTGCGCGACG CCCTGCGCGA GGACGGCCCC AAGGGTCTGC GCCGGCACAT CCACGCCACC CGCGACCGGC TGTTCGAGCT GCGCCGCGCG GTGGTGCCGC GCTCGCGGCG CCGGGGCGGC AAGAAGGAGG CCGACGCGCC GGTCAACAAA GAGCCGCCCG CGGTCGGCGA TACGGTGCAG GGCCGGGTTC GCTCGGTGGC CGCGTTCGGC GCCTTCGTGG TGCTGCCCAG CGGCCACGTC GGCCTGGTGC ACAAGAGCGA GCTGCGCACC GGTCAGGTCA ACGACGCGAC CGAAGAGGTC AATGTCGGCG ACGAGTTCGA GGTCAAAGTG CTCGATGTCT CGCCTGACGG CGACTCCAAT CGGCTGCGCA TCGGCCTGTC GCGGCGCGTG CTGATGCCGG GCGGCGACAA GGAGCGCGAG CGCGAGCGCG CCAAAGGCGA TGAGCGCGGC AAGGGCGGCG GACGTGGTCG CGGCGGCGGC GGCGGTCGCG GCGGTGGCGA GCGCGGCAAG AGTGGTGGCG GCGGCGGCGG TGGCGGCGGC GGCGGTCGCG GTCGCGGCGC TGGCGAGCGC GGCAAAGGCG GCGGCGGTCG CGGACCCCGG CGCGGTGGCG GCGACGAGCC TCGCGGCGCC CGCGACGACG GCGATCGCCG GCCGCGGCGC GACGGCGACG ACCGGCGCTC GCCGCGCGAG AGCGAGGCCG ATCGCAAGAA GCAGGAGAAG CTCGCCAGCC TGGGCGAGAT GCTCCTGGCC AAGATGAACG AGGACGACAG CAAGAGCTGA
|
Protein sequence | MENVSTRDEE QNQSESAQAA PASTESPAAE SATSESADAA SLQADESAAE AAEGGEAGAS SAENESQAAE GGEAAEGGDA AEGGKAPLRK SRSRGGRRGG RGSGGGGQND PTPELLKLVD LSVKFPDIGP ALAELAFKLG NSEIGERVLS MGLSSDRPGL EFYFVAAHAA RRERRYEDAL RASIDAVRAY ADAADDAIAA DDAQRLLHLV RLGFNTLMFE IGDVEKHPWF TRDLIAELTR AEPRLGEDSF FRSLLAQALW FEDRERSEAE WQRAHELGAP ETTWNARGTW YREAENDLVK AEEAYRKGLE AASDSALLNH NLAQVLIDRA RGLEDDKEQV KALLQEAETL LRDALREDGP KGLRRHIHAT RDRLFELRRA VVPRSRRRGG KKEADAPVNK EPPAVGDTVQ GRVRSVAAFG AFVVLPSGHV GLVHKSELRT GQVNDATEEV NVGDEFEVKV LDVSPDGDSN RLRIGLSRRV LMPGGDKERE RERAKGDERG KGGGRGRGGG GGRGGGERGK SGGGGGGGGG GGRGRGAGER GKGGGGRGPR RGGGDEPRGA RDDGDRRPRR DGDDRRSPRE SEADRKKQEK LASLGEMLLA KMNEDDSKS
|
| |