Gene Nmul_A0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0988 
SymbolubiB 
ID3786588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1146418 
End bp1147932 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content56% 
IMG OID637811071 
Productputative ubiquinone biosynthesis protein UbiB 
Protein accessionYP_411683 
Protein GI82702117 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID[TIGR01982] 2-polyprenylphenol 6-hydroxylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTCT TCCGTCTTCT CAAAATACTC GCCGTCTCAT TCCGTTTCGG CCTCGACGAA 
TTCTTCCTGA GCCACGAACG TCTGCGCCTA CTGCGGCCTG TCGTCAGAAC GGCAACGTTC
TGGCGTAAAC TGGATCGTGC GCGGGGCGAA CGTCTCCGCC TCGCCCTGGA GGCTCTGGGT
CCCATCTTCG TCAAGTTCGG CCAGATGCTC TCTACCCGGC GGGATCTTCT GCCGCCCGAT
ATCGCCGACG AACTGGCGAA ACTACAGGAT CAGGTTCCAC CATTTCCCTC CAGCATTGCC
CTCAAGACAC TGGAAGAGGT CTACGGAAAA CCCGTCAACG AAGTTTTTCT GCTGTTCGAT
GTGGAGCCGG TAGCAAGCGC GTCCATTGCC CAGGTTCATC TGGCAGTGCT GCATGATGGT
ACGGAAGTCG CAGTCAAGGT GCTGCGTCCC GGCATCGCGC CTGTGATCGC CCATGATATC
GCGCTGATGG ACACAGGCGC TCTTTTGCTC GAAATAGTCT GGCCGGATGC CAAGCGGCTC
AAGGTACGGG AAGTTGTGAC TGAATTCGCC CGTCATCTCG ATGACGAACT GGATCTCATG
CGCGAAGCCT CCAATTGCAG CCAGTTGCGG CGCAATTTCC TGGATTCTCC CCTGCTTCTG
GTTCCGGAAG TCTACTGGGA TTACTGCTAT TCGAGCGTCA TGGTGATGCA GCGCGTCAAA
GGCACGCCCA TCAGTCATGT CACAGCCTTG CGGGAGCAGG GCGTGGATAT TCCGCGACTT
GCCCGCGTCG GCGTGGAAAT CTTTTTCACC CAGGTATTCC GCGATGGCTA TTTTCATGCC
GACATGCATC CGGGAAACAT CTTTGTCGGC AAGGACGGCC GGTATATCGC TGTCGACTTC
GGCATAATGG GAACCCTTAC CGACGAAGAC AAGAATTATC TCGCGCAGAA TTTCCTGGCT
TTCTTCCGCC GTGACTACAA GCGTGTGGCG GAAGCGCACG TGGAGGCGGG ATGGGCGCCG
AAAAACACGC GAGTCAATGA TTTCGAAACC GCTATCCGGG CAGTATGCGA ACCTATTTTC
GACAAGCCCT TGAGCGAGAT TTCATTTGGG CGGGTATTGC TGCGGCTGTT TCAAACGTCG
CGCCAGTTCA ATGTCGAAAT CCAGCCGCAG CTCGTGATGC TGCAAAAAAC CCTGCTCAAT
ATCGAGGGGC TGGGGCGGGA CCTCGACCCT AATCTCGACC TGTGGACGAC AGCCAAGCCG
TACCTGGAAA ACTGGATGGC GGAGCAGTTG GGCTGGAGAG GGCTCAGCCG CCGCCTGCGC
AAGGAAGCCA CGAGTTGGGC GGTAATCATG CCCCAGTTTC CCCGCCTGAT GCATCATGCC
CTGACGGAAA TACGTACCAG CGCTCTGGAA GAAAAGATGG ACCAGTTCAT CCTGGAGAAA
AAACGTGAGA CCCGGCGCCT CACCATCTTC ATCGTATTAC TGATCATAGT GATACTGTGG
CATCTGGGAA AATAA
 
Protein sequence
MRFFRLLKIL AVSFRFGLDE FFLSHERLRL LRPVVRTATF WRKLDRARGE RLRLALEALG 
PIFVKFGQML STRRDLLPPD IADELAKLQD QVPPFPSSIA LKTLEEVYGK PVNEVFLLFD
VEPVASASIA QVHLAVLHDG TEVAVKVLRP GIAPVIAHDI ALMDTGALLL EIVWPDAKRL
KVREVVTEFA RHLDDELDLM REASNCSQLR RNFLDSPLLL VPEVYWDYCY SSVMVMQRVK
GTPISHVTAL REQGVDIPRL ARVGVEIFFT QVFRDGYFHA DMHPGNIFVG KDGRYIAVDF
GIMGTLTDED KNYLAQNFLA FFRRDYKRVA EAHVEAGWAP KNTRVNDFET AIRAVCEPIF
DKPLSEISFG RVLLRLFQTS RQFNVEIQPQ LVMLQKTLLN IEGLGRDLDP NLDLWTTAKP
YLENWMAEQL GWRGLSRRLR KEATSWAVIM PQFPRLMHHA LTEIRTSALE EKMDQFILEK
KRETRRLTIF IVLLIIVILW HLGK