Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2752 |
Symbol | rluD |
ID | 5595205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2774645 |
End bp | 2775625 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640921869 |
Product | 23S rRNA pseudouridine synthase D |
Protein accession | YP_001459388 |
Protein GI | 157162070 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000000000325368 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAAC GAGTACAGCT CACTGCAACG GTGTCCGAAA ACCAACTCGG TCAACGCTTA GATCAGGCTT TGGCCGAAAT GTTCCCGGAT TATTCACGTT CGCGAATAAA AGAATGGATC CTCGACCAGC GAGTGCTGGT TAACGGCAAA GTTTGTGATA AGCCGAAAGA AAAAGTATTG GGTGGCGAGC AGGTTGCCAT CAACGCTGAG ATTGAAGAAG AAGCGCGTTT TGAACCGCAG GATATCCCGC TGGATATCGT CTATGAAGAT GAAGACATTA TTATCATTAA TAAACCGCGC GACCTGGTGG TACATCCTGG CGCGGGTAAC CCGGATGGCA CGGTACTGAA TGCGTTGCTT CATTACTATC CACCCATTGC CGATGTACCG CGTGCGGGCA TCGTCCATCG TCTGGATAAA GACACCACTG GCCTGATGGT TGTGGCAAAA ACCGTTCCGG CTCAGACGCG TTTAGTCGAA TCTTTGCAAC GGCGTGAAAT TACTCGTGAG TATGAAGCGG TGGCGATTGG TCATATGACC GCAGGTGGCA CGGTGGACGA GCCAATCAGT CGCCACCCGA CCAAACGTAC CCATATGGCG GTGCATCCGA TGGGCAAACC AGCGGTGACT CACTATCGCA TCATGGAACA CTTCCGTGTG CACACGCGTC TGCGGTTGCG TCTGGAAACT GGACGTACGC ACCAGATCCG CGTGCATATG GCCCATATCA CTCATCCGCT GGTGGGCGAT CCGGTTTATG GTGGCCGTCC GCGTCCGCCA AAAGGTGCTT CGGAAGCATT TATCTCCACG CTGCGTAAGT TTGACCGCCA GGCGCTACAT GCAACCATGC TGCGTCTTTA TCACCCGATC TCCGGCATCG AAATGGAATG GCATGCGCCT ATTCCACAAG ATATGGTGGA GCTGATTGAG GTGATGCGCG CCGATTTCGA AGAACATAAG GATGAAGTGG ACTGGTTATG A
|
Protein sequence | MAQRVQLTAT VSENQLGQRL DQALAEMFPD YSRSRIKEWI LDQRVLVNGK VCDKPKEKVL GGEQVAINAE IEEEARFEPQ DIPLDIVYED EDIIIINKPR DLVVHPGAGN PDGTVLNALL HYYPPIADVP RAGIVHRLDK DTTGLMVVAK TVPAQTRLVE SLQRREITRE YEAVAIGHMT AGGTVDEPIS RHPTKRTHMA VHPMGKPAVT HYRIMEHFRV HTRLRLRLET GRTHQIRVHM AHITHPLVGD PVYGGRPRPP KGASEAFIST LRKFDRQALH ATMLRLYHPI SGIEMEWHAP IPQDMVELIE VMRADFEEHK DEVDWL
|
| |