Gene Gdia_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0473 
Symbol 
ID6973868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp518134 
End bp519102 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content69% 
IMG OID643390006 
Productpseudouridine synthase, RluA family 
Protein accessionYP_002274884 
Protein GI209542655 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.000500273 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACG ACACCCAACC CATCCGCCTG ACGCCCGAGA CGGAGCATGC CGGCCAGCGT 
ACCGACCGTT TCCTGGCCGA TATGGTGGGG ACGCTGTCGC GCTCGCGCGT CAAGGCGCTG
ATGGAGGGCG GCCATGTCCT GCGCGACGGC CATGTCCTGC GCGAACCGGC CGACCCGGTC
CGGGCGGGTC TGTGTTATGA GATAAGGATG CCCCCGGCAA TCCCGGCGAC ACCCCGGGCG
CAGGCCATTC CCTTCGCCAT CCTGTACGAG GATTCGGACC TGATCGTGCT GGACAAGCCC
GCCGGGCTGG TCGTGCATCC CGCGCCGGGC AACGAGGACG GGACGCTGGT CAACGCCCTG
CTGGCGCATT GCGGCGACAG CCTGACCGGC ATCGGCGGCG AACGCCGGCC GGGCATCGTG
CACCGGCTGG ACAAGGATAC GTCGGGCGTC ATGGTGGTGG CCAAGACCGA GCAGGCGCAT
ACCGCCCTGT CGGACGCGTT CGCCGCGCGC GATATCGACC GCACCTATCT GGCGCTGGCC
TGGGGCATCC TGTCACCGGC CAGCGGCACG TTCGAGGGCG CGATCGGCCG CGACAGGCGC
GACCGCAAGC GCATGGCCGT GGTCACGCAC GGCGGCAAGC ACGCCATGAC GCACTACAGG
ACGCTGCACA GCTTCCATGG CGGGATCAGT TCCGTCGAAT GCCGGCTGGC GACGGGCCGC
ACGCACCAGA TCCGCGTGCA TTTTTCCACC AGCGGCCATC CGCTGGTCGG CGACCCGGTC
TATCTGCGCC GCATTCCCGC CGCCGCCCGC GCCCTGCCCG AGGATGCACG CCGCGCGGCG
CTGGATTTTC CGCGCCAGGC ATTGCATGCG GCGCGACTGG GCTTTACCCA CCCCCGCACC
GGCGAATCCC TGCTGTTCGA AACCGCGCCC CCGGACGATT TCAAGACGTT GCTGGCAAAG
ATTGCTTAG
 
Protein sequence
MTDDTQPIRL TPETEHAGQR TDRFLADMVG TLSRSRVKAL MEGGHVLRDG HVLREPADPV 
RAGLCYEIRM PPAIPATPRA QAIPFAILYE DSDLIVLDKP AGLVVHPAPG NEDGTLVNAL
LAHCGDSLTG IGGERRPGIV HRLDKDTSGV MVVAKTEQAH TALSDAFAAR DIDRTYLALA
WGILSPASGT FEGAIGRDRR DRKRMAVVTH GGKHAMTHYR TLHSFHGGIS SVECRLATGR
THQIRVHFST SGHPLVGDPV YLRRIPAAAR ALPEDARRAA LDFPRQALHA ARLGFTHPRT
GESLLFETAP PDDFKTLLAK IA