Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1936 |
Symbol | |
ID | 5733825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2345455 |
End bp | 2346900 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279080 |
Product | NOL1/NOP2/sun family RNA methylase |
Protein accession | YP_001544707 |
Protein GI | 159898460 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00306451 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGTC AACTACCGCG ACAGCTTGAA GCCTATCGCG ATTTACTGAA TGAAACCGAG CTTAACCAAC TGTTGGAGAG CATCAACCAA CCATTGCCCA GCGGCCTGCG CACCAATCCG CTCAAGGCGA GTGCAAACGG CCCGCAAGCC TGGCAACAAC AGTATGGCTG GCAACTAGAG CAAGTGCCAT TTTGCCCAAC GGGATGGCAA CTCCGCAACG AGGTTGGCAA CTTAAGCCGC ACCGTCGAGC ATCAAATGGG CCATTACTAC ATCCAAGATG CCGCCTCGAT GTTGCCCGTT GAGCTATTTG AATTGCCCCC TGAGGCCGAA CCAAGTGTGC TGGATTTGAC CGCTGCTCCT GGCGGCAAAA CCACCCATAT CATCTCGCAA CTGCAAGATC GTGGGCTGGT GGTGGCCAAC GACAGCAACT TACAACGGAT TGCTGGCCTC AAAGGCAATA TTCAGCGCTG GGGCAGCACC TCGGCGGTGA TCACTAACCA ACCAGGCGAG CGTATGGGTC GTTGGTTGCC CGAACAATTC GATTATGTTT TATTGGATGC GCCGTGTAGT GGTGAAGCGC TGCGCACCAG CGAACGCCAC ACCAGCCGTC TGGTTTCGAG CCACGAACGC AACACCCTAC AACAACGCCA AATTAAGTTG CTTGAAAGTG CCTTGCAAGC GGCGCGACCG AATGGCCATG TGGTTTATTC GACCTGTAGC CTTGCGCCCG AAGAAGATGA GGCGGTGCTT GATGCGCTGC TCAAACGCTA CCCTGAGCAA ATTCAGATTA TGTCAATTCC AACCAGCGTG CCAATTGAAG CGCCTGGCTT GTTGGCGGCA GGCTCACAGC ATTACGATCC CAGCATCGCC AAGGCCTTGC GTTTATGGCC GCATCTCTAC AACACCGCCG GATTTTTCGC TGCGCTGATT ATCAAGCGTG ATCAGATTGC CACGCCCAGC CTAGAGCGCC CGCAACAAAC CCTAGCCAAA GCTGGCTACA AAGCAGTTAC TGCCGACGAA CAGCGCCAAA TTCTTGATAC ATTGCACGCT GTGTATGGCT TTGATCTGGG TAAAATTTTA GAAGCCAACG GCCTGAGTTT ATGGCGTAAC GGCAAAACAA TTCAGGCGAT TCCAGAGCGC TGGCTCAACA ATTTCGAGCA CTTTCCATTT GTCAGTGCAG GGATTCAGGT TGGCAAACTA TCGCGGCATG GATTCCAACC AGCCCATGAT TTGGCTTCGC GCTATGCCGA GCAATTCAGC CAACAATGGC TCACGATCAA CGATCAACAG ATTGACGATT GGCTCGCCCG CCGCGATCTA CCGCTGAAAA CCAGCACGTA TGCAGCTGGC AGCATCGTGG TGGTGCGCGA TCAGCAGCAG CGCTACCTTG GTTTAGGCCA AATCGACGGC AACAGCCTAG AAAATTTGCT GCCGCATTGG CAATAG
|
Protein sequence | MTRQLPRQLE AYRDLLNETE LNQLLESINQ PLPSGLRTNP LKASANGPQA WQQQYGWQLE QVPFCPTGWQ LRNEVGNLSR TVEHQMGHYY IQDAASMLPV ELFELPPEAE PSVLDLTAAP GGKTTHIISQ LQDRGLVVAN DSNLQRIAGL KGNIQRWGST SAVITNQPGE RMGRWLPEQF DYVLLDAPCS GEALRTSERH TSRLVSSHER NTLQQRQIKL LESALQAARP NGHVVYSTCS LAPEEDEAVL DALLKRYPEQ IQIMSIPTSV PIEAPGLLAA GSQHYDPSIA KALRLWPHLY NTAGFFAALI IKRDQIATPS LERPQQTLAK AGYKAVTADE QRQILDTLHA VYGFDLGKIL EANGLSLWRN GKTIQAIPER WLNNFEHFPF VSAGIQVGKL SRHGFQPAHD LASRYAEQFS QQWLTINDQQ IDDWLARRDL PLKTSTYAAG SIVVVRDQQQ RYLGLGQIDG NSLENLLPHW Q
|
| |