Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_1473 |
Symbol | |
ID | 6123151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010508 |
Strand | + |
Start bp | 1628127 |
End bp | 1629887 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641638051 |
Product | RNA-binding S4 domain-containing protein |
Protein accession | YP_001764770 |
Protein GI | 170732823 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.780813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACAGATA TCCACGATAT TGAATCGTCC GCGCCTGCCG TGCAGGCAGC GCGTGCCGAC GATGCGCCGG AGCAGGATGC TTCGGCCGCG GGTGGCGACG AGCGTCCCCG CCGCGGTCTG CGGCGCGGCC CGCGTAGCCT GATCGCGCGG CGTCGCGCGG CGGCCAAGTC GAAGGGCGCC GAAGGCGAGC AGCCGGGTGC CGAGGGCGCG GATGCGCAGC CGGCCGAGGC TGCCGACGCG CAGCCGGCGC GTGCGCCGCG CAAGGAAGGC GCGGCCAAGG GCGGTCGCAA GCCGGCCGGC AAGCGCGAAG GCGCGGCGGC GGCCAAGGGC GGCCAGGCTG GCCAGGGCCG TCGCGGTGCG GCGAAGACTG AAGGTGGCCC GGCGAAGGTA GCGGAAGGCG ATGCGTCCCA GGACGAACTG TTCGCATACG TGACGTCGCC GGCCTTCGAC GCGGACAACT CCGCGGGCGG CAGCGGCGTG CGTGCGCCGA TGCTGCGCCG CGGCCGCGCC CAGCCGGCCA ACAAGCGCGT GCTGTCGCCG GACGACGACG CGCCGAAGCT CCACAAGGTG CTCGCGGAAG CGGGCATGGG GTCGCGCCGC GAAATGGAAG AGTTGATCGT CGCCGGCCGC GTGTCGGTGA ACGGCGAGCC CGCGCACATC GGCCAGCGGA TCATGCCGAC CGACCAAGTG CGGATCAACG GCAAGCCGGT CAAGCGCAAG CTGCCGAACA AGCCGCCGCG CGTGCTGCTG TATCACAAGC CGACGGGCGA AATTGTCAGC CACGCGGATC CGGAAGGCCG TCCGTCGGTG TTCGACCGTC TGCCGCCGAT GAAGACGGCG AAATGGCTTG CGGTGGGTCG TCTCGACTTC AATACCGAAG GTCTGCTGAT GTTGACGACT TCCGGTGATC TCGCGAACCG CTTCATGCAT CCGCGTTACA GCGTCGAGCG CGAGTATGCG GTTCGCGTCG TCGGCGAGCT GGCCGAAGGC ATGCGGCAGA AGCTGCTGCA CGGCGTCGAG CTCGACGACG GTCCAGCGAA TTTCCTGCGC ATCCGCGATG GCGGCGGCGA AGGTACGAAC CACTGGTATC ACGTCGCGCT GGCCGAGGGC CGCAACCGCG AAGTGCGCCG CATGTTCGAG GCGGTCGGCC TGATGGTGAG CCGGCTGATC CGTACGCGTC ACGGCCCGAT TCCGCTGCCG CGCGGCCTGA AGCGTGGTCG TTGGGAGGAG CTCGACGACG CACAGGTGCG CAAGCTGATG GCGACCGTCG GCCTGAAGGC GCCGTCCGAG GAGAAGGGCA AGCGCGGTGG CGCTGCCGGT CCGACCGAAC GTCGCCAGCC CGATCCGATG CAGACGTCGA TGGGCTTCAT CAGCCGCGAG CCGGTGCTGA CGACGCACGG GCAACTCGAC CAGCCGCGTC GCGGCGGTGG CCGACGCGGC GGGCCGGGCT TGCCGGGCCT GAGCGGCTAC GGCAGCTTGC CGGTTGCGCC GTCGGGTTAC GGCAACCGCG CGGGCGGCGG CCGTGACGGC AATCGTACCG GCGGTGGCCG CGACGGCAAC CGTGCAGGCG GCGGTCGCGA TGTCGACGGC AATCGCGCAT CGTACGGCGC AGGCCCCAAG CGCGAGGGCG CGGGCGGCAA GCGCGGCGGC AGCAAGGGCG GCGGCAATCG CAATCCGAAC GGCAATCGTG CCGATGGCGC CGGCAACGGT GGCCCGCGCG GCGGTCAGCG TCCGCAGCGC AGTCGTACAC GCAGCCGCTA A
|
Protein sequence | MTDIHDIESS APAVQAARAD DAPEQDASAA GGDERPRRGL RRGPRSLIAR RRAAAKSKGA EGEQPGAEGA DAQPAEAADA QPARAPRKEG AAKGGRKPAG KREGAAAAKG GQAGQGRRGA AKTEGGPAKV AEGDASQDEL FAYVTSPAFD ADNSAGGSGV RAPMLRRGRA QPANKRVLSP DDDAPKLHKV LAEAGMGSRR EMEELIVAGR VSVNGEPAHI GQRIMPTDQV RINGKPVKRK LPNKPPRVLL YHKPTGEIVS HADPEGRPSV FDRLPPMKTA KWLAVGRLDF NTEGLLMLTT SGDLANRFMH PRYSVEREYA VRVVGELAEG MRQKLLHGVE LDDGPANFLR IRDGGGEGTN HWYHVALAEG RNREVRRMFE AVGLMVSRLI RTRHGPIPLP RGLKRGRWEE LDDAQVRKLM ATVGLKAPSE EKGKRGGAAG PTERRQPDPM QTSMGFISRE PVLTTHGQLD QPRRGGGRRG GPGLPGLSGY GSLPVAPSGY GNRAGGGRDG NRTGGGRDGN RAGGGRDVDG NRASYGAGPK REGAGGKRGG SKGGGNRNPN GNRADGAGNG GPRGGQRPQR SRTRSR
|
| |