Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11940 |
Symbol | rluD |
ID | 7760136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1145900 |
End bp | 1146910 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643804095 |
Product | Pseudouridine synthase |
Protein accession | YP_002798397 |
Protein GI | 226943324 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAAA CCAAAGGCCA TGCCCCGCCG TTGCCGAGCA TGTCAATACA CAACGATCAG GCTATCAGAC TTCGCGCCGA AGTCCCCGTC GAGCTGGGCG GCCAACGCCT GGACCAGGTC GCCGCACAAC TCTTCGCCGA GCATTCGCGC TCGCGCCTGG CCGCCTGGAT CAAGGACGGG CGACTGACCG TCGATGGCGC CGCGCTGCGC CCCCGCGACA TCGTGCATGG CGGCGCGCTG CTCGAACTGG ATGCCGAGCA GCAGGCGCAG GGCGAGTGGA TCGCCCAGGC CATCGAACTG GACATCGTCC ACGAGGACGA GCAGATCCTT GTGCTGAACA AGCCGGCCGG GCTGGTCGTG CATCCGGCTG CCGGGCATGC CGACGGCACC CTGCTCAATG CGTTGCTGCA CCATGTGCCG GGGCTGGTCA ATGTGCCGCG CGCGGGCATC GTCCACCGTC TGGACAAGGA CACGACCGGC CTGATGGTGG TGGCCAAGAC CCTGCAGGCG CAGACCCGGC TGGTCGAGCA ACTGCAAAAG CGCAGTGTCA GCCGCATCTA CGAGGCCATA GTGGTCGGTG TGATCACCGC CGGCGCCACC ATCGATGCGC CCATCGGCCG GCATGGCCAG CAGCGCCAGC GCATGGCGGT GGTCGAGGGC GGCAAGCCCG CGGTCAGCCA CTACCGTGTG CTCGAACGCT TCCGTGCGCA TACCCATGCC CGTATCAAGC TGGAAACCGG GCGAACCCAC CAGATCCGCG TGCACATGGC GCACGTCGGC TATTCGCTGG TGGGCGATCC GGTCTATGGC GGGCGTTTCC GCATTCCGCC GGCGGCCAGT CCGACCCTGG TGCAGGCGCT TCGGGAGTTT CCGCGCCAGG CCCTGCACGC CCGCTTCCTC GAACTGGATC ACCCGGCCAG CGGCGAGCGC CTGAAATGGG AGGCGCCGCT GCCGGACGAT TTCGTCTGGC TGTTGACACT GCTGCGCCAG GACAACGAGG CGTTCGTCTG A
|
Protein sequence | MVETKGHAPP LPSMSIHNDQ AIRLRAEVPV ELGGQRLDQV AAQLFAEHSR SRLAAWIKDG RLTVDGAALR PRDIVHGGAL LELDAEQQAQ GEWIAQAIEL DIVHEDEQIL VLNKPAGLVV HPAAGHADGT LLNALLHHVP GLVNVPRAGI VHRLDKDTTG LMVVAKTLQA QTRLVEQLQK RSVSRIYEAI VVGVITAGAT IDAPIGRHGQ QRQRMAVVEG GKPAVSHYRV LERFRAHTHA RIKLETGRTH QIRVHMAHVG YSLVGDPVYG GRFRIPPAAS PTLVQALREF PRQALHARFL ELDHPASGER LKWEAPLPDD FVWLLTLLRQ DNEAFV
|
| |