Gene Noc_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2844 
Symbol 
ID3705536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3221119 
End bp3222093 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content49% 
IMG OID637739320 
Productpseudouridine synthase, RluD 
Protein accessionYP_344820 
Protein GI77166295 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.546073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTG CGAAAAAACC ACCAGCGGCC CAAGTCCGCA TCCTGAAAAT ACATCTAGAG 
CAGGCAGGCC AACGGATCGA CAATTTTTTG TTTTCCCAGC TCAAAGGAGT TCCCAAGAGT
AGGATTTACC GTTGCCTCCG CAAGGGAGAG GTCCGAGTTA ATAAATCCCG CATCGCTAGT
AGCTACCGCC TACAACAAGG AGACCAGGTC AGGGTTCCGC CACTGCGCGT ATCTCATTCC
CCCCTTAAGA GACCGATAGC TCACTCCCTC TTGCTCCTAA TTAAGAACAG CTTATTGTAT
GAGGATAAGG AGCTACTCAT GCTCAACAAA CCGGCCAGAA TCCCGGTCCA CGGGGGGAGT
GGGGTAAGTT ATGGAATTAT TGAGGCGCTG CGGATTTTGC GCCCAGAGGC CGCCTTTTTG
GAGCTAGTAC ACCGTTTAGA CCGAGAAACT TCCGGTTGCC TTATGGTAGC CAAAACCAGA
AGCGCCTTAC TGACGCTCCA AGCAATGCAG CAAAAACAAC TTATCCACAA ACGCTATCTT
GCCCTCGTCA AAGGGCGTTG GCGAAAAAAT GTCCAGCGGG TAGACTTGCC CCTACTAAAA
AATATATTAC GTTCTGAAGA ACGGATAGTT AAGGTGAATT CAGCTGGCAA GCCCGCTATT
AGTCACTTCT ACCCCAAACT TTTTTATGGA GGTGAAGCAA CCCTTATGGA GGTTACCTTA
GAAACCGGTC GCACACATCA GATACGGGTA CACGCCGCTC ACCTTGGACA CCCTCTCGGT
GGCGATGAAA AATATGGTGA TTCCAGCTTC AATAAGCAAT TACGATGTAT GGGATTACAT
CGACTATTTC TCCATGCTAG CCAGCTCACC TTTACTCTGC CGGAAGGGAA AAAGACAATA
AAAGCCGCCG CCCCTCTGCC CAAGGAATTA AATGCTGTGC TGCAAACATA CGCTGCCAAG
CTCAGAGCTA AATAA
 
Protein sequence
MNTAKKPPAA QVRILKIHLE QAGQRIDNFL FSQLKGVPKS RIYRCLRKGE VRVNKSRIAS 
SYRLQQGDQV RVPPLRVSHS PLKRPIAHSL LLLIKNSLLY EDKELLMLNK PARIPVHGGS
GVSYGIIEAL RILRPEAAFL ELVHRLDRET SGCLMVAKTR SALLTLQAMQ QKQLIHKRYL
ALVKGRWRKN VQRVDLPLLK NILRSEERIV KVNSAGKPAI SHFYPKLFYG GEATLMEVTL
ETGRTHQIRV HAAHLGHPLG GDEKYGDSSF NKQLRCMGLH RLFLHASQLT FTLPEGKKTI
KAAAPLPKEL NAVLQTYAAK LRAK