Gene Daro_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3468 
Symbol 
ID3567336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3717609 
End bp3718748 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content63% 
IMG OID637681940 
Productpseudouridine synthase, Rsu 
Protein accessionYP_286667 
Protein GI71909080 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.242566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.43397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA ACGCCGGTCA AGACGATCCC TGGGCCAAAT GGCGCAAGCC GGAAGTCCCC 
GCTGCGGTCG AGGCTGAAAC CAAGCCGAAA GCCAAACCGG TTGAAGCGGT AGAGGCTGCG
GCGCCGAGCG AGGCTGCAGT CAAGCCGGAA GGAACCCTGG GGGTACGGAA GTCGGCGACG
GTAGCCGAGC GTCCGGGTCA CCGCAAGCCG ATCCGGGGTG GTGCCGTGCC GAAGCGGACG
ACCATGGCGG ACAAGGTGGC GATCGCCGAG CGCTTGGCAG ACAAGCCGGC GCGATCGGCG
ACTGCCAAGA CGCGCGCTGA TGCGCCACGT GAAAATCCGT GGAAGAAGGC CGCGCCTGTA
CCTCCGTCAC GAACCCCCGC GGTGGCCGCG AAGCCGCCTG GGCCAAGCCC GGAAGGCGTC
CGGCTTTCCA AGGTGATGTC CGAGCGCGGC ATGTGTTCGC GCCGCGAGGC CGATCTGTGG
ATTGAGCGTG GCTGGGTGTT TGTCAATGGC GAGCAGGTCA GCGAGCTGGG TTCGCGGATC
GACCCATCGG TATCCGAGAT CACCGTTTCG CAGGAAGCCA AGAAGGATCA GGCCAAGGCG
GTCACCATTC TGCTGCACAA GCCGGTCGGT TATGTGTCCG GACAGCCCGA GCCGGGTTGT
ATTCCGGCGG TGACCTTGAT TACGGCGGAA ACGCAGGTCG AGCAATCGGG TGGTCCGGAA
TTCAAGCCGT GGATGTTGCG TGGTCTGGCG CCGGCCGGTC GCTTGGACAT CGATTCGACC
GGCTTGCTGG TGTTGACCAG CGATGGTCGT GTCGCCAAGC GCCTGATTGG CGAGGATAGC
GAGGCAGAGA AGGAATATCT GGTTCGTGTT TCCGGCGAAA TGATCAAGGG TGGTCTGGAC
TTGCTGCGCC ATGGTTTGGA ACTTGATGGC AAGCCGTTGA AGCCGGCTTG GGTCAAGCAG
TTGAACGAAG ACCAGTTGCA CATCATCCTG AAGGAAGGCA AGAAGCGCCA GATTCGCCGC
ATGTGCGAAC TGGTTGGTTT GCAAGTGATC GGCCTGAAGC GGGTGCGCAT CGGGCGGATC
AGGCTGGGCG ATTTGCCGAT GGGGCAATGG CGCTTCCTGC GGGCTGACGA GGCGTTCTGA
 
Protein sequence
MSDNAGQDDP WAKWRKPEVP AAVEAETKPK AKPVEAVEAA APSEAAVKPE GTLGVRKSAT 
VAERPGHRKP IRGGAVPKRT TMADKVAIAE RLADKPARSA TAKTRADAPR ENPWKKAAPV
PPSRTPAVAA KPPGPSPEGV RLSKVMSERG MCSRREADLW IERGWVFVNG EQVSELGSRI
DPSVSEITVS QEAKKDQAKA VTILLHKPVG YVSGQPEPGC IPAVTLITAE TQVEQSGGPE
FKPWMLRGLA PAGRLDIDST GLLVLTSDGR VAKRLIGEDS EAEKEYLVRV SGEMIKGGLD
LLRHGLELDG KPLKPAWVKQ LNEDQLHIIL KEGKKRQIRR MCELVGLQVI GLKRVRIGRI
RLGDLPMGQW RFLRADEAF