Gene Nmul_A1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1976 
Symbol 
ID3785000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2273070 
End bp2274026 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content58% 
IMG OID637812065 
Productpseudouridine synthase, RluD 
Protein accessionYP_412663 
Protein GI82703097 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACTGA TCATTCCCGA CCATTGCGCC GGCCTCCGGC TGGATCAGGC CCTGGCGCAA 
CTATTGGCCG AATGGTCTCG CAGCCGGCTG CAATCTTGGA TTCTGGAAAA GAGGGTAAGC
GTAGATGGCG CATGCAGCCT TCCCCGGCAG AAAGTGTGGG GTGGGGAAAA GATCGTACTC
TCTCCCGCAC GCGATCCTGC TGAAACTGCG CATGAGCCGG AAGCCATTGC GCTGGATATT
GTCCATGAAG ATCACGCCAT CATTATCATC GATAAGCCCG CGGGGCTCGT CGTCCATCCC
GGCAGCGGAA ACTGGCAGGG CACCCTGCTC AATGCATTGC TGCACCATTC CCCTCAATTA
AGCGGCATAC CGCGCTCCGG TATCGTTCAC CGTCTGGACA AGGAAACCAG CGGTCTCCTG
GTAGTGGCAA AAACCCTGGA AGCCCAAACC AGCCTGGTGC GCCAGTTGCA AAAGCGCACG
GTCAAACGGG AGTATCTGGC GCTGGTCTGG GGCAGCGTTT CCTCCCACGG AAGGGTTGAC
GCTCCGGTCG GCCGCCATCC GGTACAGCGG ACCAGAATGG CAGTAGTCGC GAGTGGCAAG
GAAGCGCGCA CACGTTACGA GGTATTGGAG CAATTCACCG ATTGCACCTT GCTCCGGTGC
GGACTGGAGA CAGGGCGCAC CCATCAGATA CGCGTGCACA TGCAGTCTCT CGGCCATCCC
CTGGTGGGAG ATCCGTTGTA TGGCGGCAAA GCAAAAAAAG GCAGCAGTGC GACGATGCAG
TTGGCTGCTT TTCCCCGGCA GGCGTTGCAT GCCCACAAGC TGGAATTGAC GCATCCGCAG
AACGGCCAGA GAATGGGATG GGAAGCGCCA TTGCCGGAAG ACATGAGCAA CCTGCTGCTG
ATGCTTCAGA AAGCGCGTGA TAAAGAATCC CATGCAATTC CAGCCATGAT CAAATGA
 
Protein sequence
MELIIPDHCA GLRLDQALAQ LLAEWSRSRL QSWILEKRVS VDGACSLPRQ KVWGGEKIVL 
SPARDPAETA HEPEAIALDI VHEDHAIIII DKPAGLVVHP GSGNWQGTLL NALLHHSPQL
SGIPRSGIVH RLDKETSGLL VVAKTLEAQT SLVRQLQKRT VKREYLALVW GSVSSHGRVD
APVGRHPVQR TRMAVVASGK EARTRYEVLE QFTDCTLLRC GLETGRTHQI RVHMQSLGHP
LVGDPLYGGK AKKGSSATMQ LAAFPRQALH AHKLELTHPQ NGQRMGWEAP LPEDMSNLLL
MLQKARDKES HAIPAMIK