Gene Namu_5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5037 
Symbol 
ID8450668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5621503 
End bp5622612 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content75% 
IMG OID645044073 
Productfolate-binding protein YgfZ 
Protein accessionYP_003204297 
Protein GI258655141 
COG category[R] General function prediction only 
COG ID[COG0354] Predicted aminomethyltransferase related to GcvT 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACA TGGCGACACC GCAGGTCCGA TCCTTCCCCC CAGTGCCCTC CCCGCTGCTC 
GGCCTACCCG GGGCGGTGGC CGCCCGGGGC GCGGACGAGG GCGTGGCCTG GCACTACGGC
GATCCGACGG CCGAGCAGCG GGCGGCCGAA TCCGGCGCCG CCCTGATCGA CCACACCAAC
CGGGACGTGC TGGCCGTCAC CGGTGAGGAC CGGCTGACCT GGCTGCACAC GCTGAGCAGC
CAGCACCTGA CCGACCTGGC CGACGGGGCC AGCACCGAAG CCCTCTTCCT GTCCCCCAAC
GGTCACGTCG AGCACCACGC CGTGCTCACC CACCAGGACG GGGTCGTCTA CCTGGACACC
GAGCCCGGGG CCGGCGCCGC CCTGCTGGCC TTCCTGGACG GCATGCGGTT CTGGTCCAAG
GTCGAGGTGG CCCCGGCCGA CCTGGCCGTG CTGGCGCTGG CCGGCCCGAC CGCCGCCGAC
GTGGCCGGCC GGGCGCGCAA TGCCGAGCCC GGCCGCTCCG GCCCGGACGG CGGGTTCACC
CGGCGGTCCG CCGAAGGCCT GGATCTGGTC CTGCCCCGCG CCGCGGTCGG CGCGGTCGCG
CAGGAACTGC GGGCGGCCGG GGCGGTGCCG GCCGGCAGCT GGGCGGCCGA CGCGCTGCGC
ATCCCCACCC GCCGCCCCCG CTGGGGCGTG GACACCGACG AGAAGACCAT CCCCAACGAG
GTGAGCTGGC TGAGCACCGC CGTGCACCTG CACAAGGGGT GCTACCGCGG TCAGGAGACG
GTCGCCCGCG TGCACAACCT GGGCCGCCCG CCCCGCCGGC TGGTCATGCT CAACCTGGAC
GGCTCTGTCG GCACCCTGCC CGAGCCGGGC GAACCGGTCA CCAGCGGCGC CGGCCGGGCC
GTCGGCCGCC TGGGCACCAT CGCCCAGCAC CACGAACTCG GCCCGATCGC GCTGGCCCTG
ATCAAGCGGT CGGTGGAGGC CGGCACCCCG TTGCTGGTCG GCGGCATCGA CGCGGTCGTC
GACATCGACG ACCGGCTGGA CGAGGGCCAG CAGCAGACCC CGCTGTCGGC GATCGACCGG
CGCGCCTTCA CCCAGCTCCG GCGGAGCTGA
 
Protein sequence
MGDMATPQVR SFPPVPSPLL GLPGAVAARG ADEGVAWHYG DPTAEQRAAE SGAALIDHTN 
RDVLAVTGED RLTWLHTLSS QHLTDLADGA STEALFLSPN GHVEHHAVLT HQDGVVYLDT
EPGAGAALLA FLDGMRFWSK VEVAPADLAV LALAGPTAAD VAGRARNAEP GRSGPDGGFT
RRSAEGLDLV LPRAAVGAVA QELRAAGAVP AGSWAADALR IPTRRPRWGV DTDEKTIPNE
VSWLSTAVHL HKGCYRGQET VARVHNLGRP PRRLVMLNLD GSVGTLPEPG EPVTSGAGRA
VGRLGTIAQH HELGPIALAL IKRSVEAGTP LLVGGIDAVV DIDDRLDEGQ QQTPLSAIDR
RAFTQLRRS