Gene Nmul_A1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1065 
Symbol 
ID3784885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1232624 
End bp1233610 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID637811149 
Productpseudouridine synthase, RluD 
Protein accessionYP_411760 
Protein GI82702194 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.901288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGAAA TCATAGGGGA GGAGAGCAAA GGGCAGCGTA TTGATAATTT TTTAATCAAA 
CGCTTGAAAA ACGTGCCCAA AAGTCACGTC TACCGGTTGT TGCGCAGCGG GCAGGTGCGC
ATCAACAGCA AGCGCGCCCC CCCGGACTAC CATCTCCAAT CCGGAGATAT TGTTCGCATA
CCCCCGGTGA GAACGGTGGA AAAATCGGCG CTGCCCCCGA AGAAATTGAG CAAACCGGGT
TTTATTGCAT TCCAGGTTTT GTTCGAGGAT GATGCACTGA TTGCTGTCAA CAAGCCTCCG
GGAGTTGCGG TGCATGGTGG AAGCGGCATA AGTTTCGGCG TGATAGAGCA ATTGCGCGCT
CAACATCCTG ACTGGAGATT TCTGGAGCTC GCACATCGCC TGGACAGGGA AACTTCTGGC
GTGCTGCTCC TTGCCAAGAA CCGGGCGGCA CTTGTAGAGT TGCATCGGCA ACTCCGCATG
GGAGAGGTGG AAAAACACTA CCTGACCCTG GTCAAGGGCA GGTGGCGTAA TGGGCGGCAG
AGTGTCAGGC TGTCGCTCAG GAAATATCTG ACACCCGGCG GCGAACGGAG GGTAGCGGTG
GAAAAGGATG CAGATGAAAA AAAAGGTGGA ATGAGCGCTC ATACCGTTTT CATCCTGCGG
GAATCATGGC AGAGCTTCAG CCTGCTGGAG GCTGAACTGA AAACCGGGCG TACGCATCAG
ATTCGTGTGC ACCTTGCCTA TCTCGGTTTC CCTATAGCGG GAGACGACAA ATATGGCGAT
TTTGTCTTGA ATAAGGATAT TGCCCGGCGT GTTCCTGGTT TGGGACGGAT GTTTCTCCAT
GCTTGGGCGG TCGAATTCAC GCATCCCGTC ACGCATGAGA AACTTCGTCT TGAAGCGCCC
CTGCCGGACG ATCTGCAAAA ATTTCTGGAT GTGATGAATA ACCCCGATAA ACCACCGAAG
CTTCCGGCGG AAAGGACATT CTCCTGA
 
Protein sequence
MREIIGEESK GQRIDNFLIK RLKNVPKSHV YRLLRSGQVR INSKRAPPDY HLQSGDIVRI 
PPVRTVEKSA LPPKKLSKPG FIAFQVLFED DALIAVNKPP GVAVHGGSGI SFGVIEQLRA
QHPDWRFLEL AHRLDRETSG VLLLAKNRAA LVELHRQLRM GEVEKHYLTL VKGRWRNGRQ
SVRLSLRKYL TPGGERRVAV EKDADEKKGG MSAHTVFILR ESWQSFSLLE AELKTGRTHQ
IRVHLAYLGF PIAGDDKYGD FVLNKDIARR VPGLGRMFLH AWAVEFTHPV THEKLRLEAP
LPDDLQKFLD VMNNPDKPPK LPAERTFS