Gene Noc_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2118 
SymboltruB 
ID3704428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2439503 
End bp2440429 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content51% 
IMG OID637738593 
ProducttRNA pseudouridine synthase B 
Protein accessionYP_344108 
Protein GI77165583 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00431] tRNA pseudouridine 55 synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.206379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC AACGCCGCTT TCAAGGCCAG GATATTCACG GAATGCTCCT ATTGGATAAA 
CCCGTAGGCA TCAGCTCTAA TGGTGCTCTG CAGCGAGTTA AGCAGATTTA CCAAGCCCGA
AAAGCAGGAC ATACAGGCAG TCTAGACCCT CTTGCAAATG GATTGTTGCC AATTTGTTTG
GGGGAGGCAA CTAAACTGTC GGGGTTTTTG CTAGAAGCCG ATAAGCGTTA TCAGGTGATG
TGCCGTCTCG GCGTAGTGAC TACCACTGGG GACGCTGATG GCGAGGTGCT AGAAACCCAT
CCTGTCAACG AGTTAGATAG GGACGAGGTG GCGAAATTTT TGTCCGGCTT CTCTGGCCCG
CAAGAGCAAG TGCCTCCCAT GTATTCGGCG ATTAAACACC AGGGCCAGCG GCTCTATAAA
CTTGCCCGCC AAGGAATTGA AGTGGAGCGC AAATCTCGCC AGGTGACTAT CCATACCATT
AAATTGACGG AATTAGTTAA TAATGAGCTA GGGTTTGAGG TCTTTTGCTC TAAGGGAACT
TATATTCGTA CCTTGGCTGA AGACATCGGT CGGGCGCTAG GATGCGGCGC CCATGTTATA
GCGCTGCGCC GTACCCAAGT CGGTTCTTTT GGTGCTTCGG ATATGATCTC TCTGGAGGAG
CTGGAAATGC TAGCCGAGAC GAATGTTGAG GCACTCGGAA ATTTGCTGTT ACCCGTAGGG
CAAATTTTGG CGGATTGGCC TGCAGTAAAT CTAATCGCCG ATTTAGCCTA CTATCTGCGG
CAGGGGCAAT CGGTACGGGT TCCGCAAGCT CCAAGCGAAG GCTGGGTTCG TTTAATAGAG
TGTGGCAAGG GATTTTTTGG AGTAGGGCGG ATTACGGAGG ATGGACGTAT TGCTCCGCGC
CGTTTGATTT TCAGTCAGAG CGGTTAA
 
Protein sequence
MKKQRRFQGQ DIHGMLLLDK PVGISSNGAL QRVKQIYQAR KAGHTGSLDP LANGLLPICL 
GEATKLSGFL LEADKRYQVM CRLGVVTTTG DADGEVLETH PVNELDRDEV AKFLSGFSGP
QEQVPPMYSA IKHQGQRLYK LARQGIEVER KSRQVTIHTI KLTELVNNEL GFEVFCSKGT
YIRTLAEDIG RALGCGAHVI ALRRTQVGSF GASDMISLEE LEMLAETNVE ALGNLLLPVG
QILADWPAVN LIADLAYYLR QGQSVRVPQA PSEGWVRLIE CGKGFFGVGR ITEDGRIAPR
RLIFSQSG