Gene Nmul_A1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1938 
Symbol 
ID3784234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2228720 
End bp2229829 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content56% 
IMG OID637812024 
Productgalactokinase 
Protein accessionYP_412625 
Protein GI82703059 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.491701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAGTT TCGAGTCTGT ATTTGGAGAT TCGCCTGAGT TTGTGGCACG CGCCCCAGGC 
CGCGTTAACC TGCTGGGAGA GCATACTGAC TACAACGGAG GTTATGTACT GCCGATTGCA
ATCGAGCAGC AGACGAGTGT CAGCATGAGC CATAGCAACA GCAAGCAATA CGCATTGTAT
TCGGAAATGC TCGATGACAT CGTTTACTTT ACTCTCGGCA AATCTCCAAC TGAGCACTTC
GCCACCTACG TATATGGATG TTTGATGGAA GCTCGCGCGG TTGGGATGAA GGTGCCGGTG
CTGGATATCT ACATTCAATC AGATGTGCCG ATGGGAGCAG GGCTATCCTC CAGTGCAGCG
CTTGAGGTAG CGACGTTGCG AGTCCTGCGT GCGCTGACCG GTTTCCCCCT GGATGATGTG
CAAATCGCAC AACTTGCTCA GCGCGCAGAA ATCCAGTACG CAGGAGTGCG CTGTGGCATC
ATGGATCAGA TGGCATCGAG CCTTGCGGGA ACCAGAGAGG CCTTGCTGCT GGATACCCTT
ACGCTTGAGC GGCGCCTGGT TCCCTTACCG CCTGCTTCGG CTGTGCTGGT ACTTGATTCA
GGCGTTGTCC GCACGCTTGC GACAAGCGGT TACAATCAGC GCCGGACAGA GTGTGAAGAG
GCGGCACATC AACTTGGAGT GCAATCACTT CGCGAAGTCC ATGACATTTC ACTGGCGGAC
GCTTTGCCTG AGCCGCTGGG CCGCCGGGTA CGCCATATCG TAAGCGAGAA CGCGCGCGTG
CTGCGGGCGG CGGAATGCAA CAACGCCGCG GAGTTCGGGA TGCTGATGAA CGCATCCCAT
ACCAGTTTGC GCGACGATTA TGAGGTCTCG GTTCCCCAAC TTGATCAATT GGTGACCCTG
CTTCAGGCTC ATCCTGATGT TTATGGAGCG CGCTTGACAG GAGCAGGCTT TGGAGGCGCG
TGCGTCGCCT TATGCAAGCC GGAGTCCTTG CATCAGATTT CCGAAGCGGT GCTGCAAGAT
TATTCAAGCA TGGGCTTGAA GGGACGCATC CTTGTGCCAC CACATTGGGT CGATAGAACT
GCTACTGTAG GGGGAGGTTC AGCATCTTGA
 
Protein sequence
MSSFESVFGD SPEFVARAPG RVNLLGEHTD YNGGYVLPIA IEQQTSVSMS HSNSKQYALY 
SEMLDDIVYF TLGKSPTEHF ATYVYGCLME ARAVGMKVPV LDIYIQSDVP MGAGLSSSAA
LEVATLRVLR ALTGFPLDDV QIAQLAQRAE IQYAGVRCGI MDQMASSLAG TREALLLDTL
TLERRLVPLP PASAVLVLDS GVVRTLATSG YNQRRTECEE AAHQLGVQSL REVHDISLAD
ALPEPLGRRV RHIVSENARV LRAAECNNAA EFGMLMNASH TSLRDDYEVS VPQLDQLVTL
LQAHPDVYGA RLTGAGFGGA CVALCKPESL HQISEAVLQD YSSMGLKGRI LVPPHWVDRT
ATVGGGSAS