Gene Daro_2455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2455 
Symbol 
ID3568228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2656524 
End bp2657546 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content60% 
IMG OID637680923 
Productribosomal large subunit pseudouridine synthase B 
Protein accessionYP_285660 
Protein GI71908073 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.155456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA AACGCACACC ATTCCGTCAA TCGTCGAACA AATCCGAGGG GGCTGCCGAG 
CGCCGACCGG AAGATGCAGG TCCACGTGTC GGCGCACCGT CGCGTGGACG CGCGGCCGCG
CAGCGTGACC AGGTGCCAGG CGATGCTCAG GGGGCGAAAC CTGCGCCGCG CAGAAAGCCG
GCACCCAATA CCGGCGGAAG AGCCAATCGT GGTAGCGTCG CCCGCGACGG ACGGCCGCTT
GCTGAGGCCA AGCCAGTGCG TTTGCAGAAG GTGCTGGCCG AGGCTGGTGT CGGTTCGCGT
CGCGAAATGG AAGAGTGGAT TGCTGCAGGC AAGGTCAGCG TCAATGGCGT CGTGGCGACT
GTAGGGCAAT CGGTTGTGAA TTCCGACAAG GTCAAAATTG GTGGCCGCCT GATCAATATC
CGCTTTACGG GCAGTTCTCG TCCGCCGCGC GTCTTGATGT ATCACAAGCC GGAAGGCGAA
ATCGTTTCGC GCGACGACCC GGATGGTCGG CCGTCGGTGT TTGCTGCGCT GCCACGGATG
CGCGGTGGGC GGTGGATCAA TGTCGGTCGC CTCGACTTCA ATACCTCGGG TTTGCTGTTA
TTCACTACTT CTGGTGAGCT GGCCAACAAA CTGATGCATC CGAGTTCGGA ACTGGTTCGC
GAGTACGCCG TGCGTGTTCT TGGTGAACTG ACCCTGGATG CACAACAGAA GCTGTTGCAC
GGCGTCGAAC TGGAAGATGG TCGAGCGAAC TTTGGTTCGC TACACGACGG TGGTGGCGAG
GGGGCGAACC ACTGGTACCG AGTAACGATC TCCGAGGGGC GTAACCGTGA GGTCCGTCGC
ATGTTCGAGG CGGTCGGTTG CACGGTCAGC CGATTGATTC GCGTTCGCTA TGGCCCGTTC
ATCCTGCCGC CGCAACTGAA ACGAGGTATG GCCCGCGAGT TGAAAGAGGC GGAAATCAAA
ATGCTGATGC GAGAACTCGA AAACATGCCA TCGTCTCAGC GAAAAGGCCC TGAAGGCACG
TAA
 
Protein sequence
MKNKRTPFRQ SSNKSEGAAE RRPEDAGPRV GAPSRGRAAA QRDQVPGDAQ GAKPAPRRKP 
APNTGGRANR GSVARDGRPL AEAKPVRLQK VLAEAGVGSR REMEEWIAAG KVSVNGVVAT
VGQSVVNSDK VKIGGRLINI RFTGSSRPPR VLMYHKPEGE IVSRDDPDGR PSVFAALPRM
RGGRWINVGR LDFNTSGLLL FTTSGELANK LMHPSSELVR EYAVRVLGEL TLDAQQKLLH
GVELEDGRAN FGSLHDGGGE GANHWYRVTI SEGRNREVRR MFEAVGCTVS RLIRVRYGPF
ILPPQLKRGM ARELKEAEIK MLMRELENMP SSQRKGPEGT