Gene Nmul_A1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1107 
Symbol 
ID3785687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1275224 
End bp1276213 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content55% 
IMG OID637811192 
Productserine/threonine protein kinase 
Protein accessionYP_411802 
Protein GI82702236 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAACAG CAGATAATTC GGCTCCAAAT GATTTCTCCA CGCTTTCTCC TGAACGCGTG 
TTGCATGCTT TGGAAAGCTT GGGTTTCCAC AGCGATGGAC GGCTGCTGGC ACTCAACAGC
TACGAGAACC GCGTCTACCA AATCGGCCTC GAAAACGGTG CGCCAGTCAT AGCAAAATTC
TACCGCCCGG AACGCTGGAC GAACAACGCC ATTCTCGAGG AACACGCGTT CGTGCGGGAA
CTGGCCGAGC ACGAAATTCC TGTAGTACCC CCATTGGTGC TGCAAGGAAT ATCGCTGCAT
TATTTTGAGG GATTTCGTTT CACTGTTTTT CCAAGGCATG GTGGTCGCGC GCCCGAACTG
GAAGATCCCC ATACCCTGGA ATGGATGGGG CGCTTCCTAG GACGTATCCA TGCGGTTGGC
GCACTGAATC CCTTTCTTGA ACGCCCGGAA TTGAATATCG CCAACTTCGG CGAACAACCC
CGCGACTATC TGTTGGCACA TGGATTCGTT CCGCCTGATA TTGAGGCTGC CTATCGCAGT
GCCGTGAATC AGGCGTTAGA CAGTGCACGG CACTGTTTTG GACGCGCAGG TAAAGTACGC
GCGTTACGCC TGCACGGGGA CTGTCATGCA GGCAATGTTT TGTGGACTGA CGATGGACCG
CACTTCGTCG ATTTTGACGA CAGCCGCATG GGACCAGCAG TACAGGACTT GTGGATGCTG
TTATCCGGCG AACGGGCCGA CATGAGGAAG CAGTTGGACA GCGTGCTGGC CGGGTATGAA
AACTTCTTCG ATTTCGACGA AAGGGAATTG CATCTGGTCG AGGCGTTACG CACTCTGCGC
TTGATCCACT ACGCGGCGTG GCTTGCACAG CGATGGGACG ACCCTGCTTT CAAGCGAGCG
TTTCCCTGGT TCAACACCCA ACGCTACTGG CAGGATCGCA TTCTCGAATT GCGGGAGCAG
ATCGCCCTTA TGGATGAACC GCCGCTATGA
 
Protein sequence
MGTADNSAPN DFSTLSPERV LHALESLGFH SDGRLLALNS YENRVYQIGL ENGAPVIAKF 
YRPERWTNNA ILEEHAFVRE LAEHEIPVVP PLVLQGISLH YFEGFRFTVF PRHGGRAPEL
EDPHTLEWMG RFLGRIHAVG ALNPFLERPE LNIANFGEQP RDYLLAHGFV PPDIEAAYRS
AVNQALDSAR HCFGRAGKVR ALRLHGDCHA GNVLWTDDGP HFVDFDDSRM GPAVQDLWML
LSGERADMRK QLDSVLAGYE NFFDFDEREL HLVEALRTLR LIHYAAWLAQ RWDDPAFKRA
FPWFNTQRYW QDRILELREQ IALMDEPPL