Gene Namu_5121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5121 
Symbol 
ID8450752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5706190 
End bp5707782 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content75% 
IMG OID645044156 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003204380 
Protein GI258655224 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCCC AGGGCTTCGA CGTCGTCATC GTCGGTGCCG GGTCGGCCGG ATGTGCGTTG 
GCCGCGCGTC TTTCGGCCGA CGAGTCGTGC ACGGTGCTGC TGCTGGAGGC CGGAAGCGGC
CGGTGGCGAC CCGAGTCGCG GGTGCCGGCG CTGTATTCGC GGCTGTTCCG GACCGCGGCG
GACTGGGCCT ACCGCACCGA ACCGCAGCCC GAGTTGAACG GCCGGCGGCT GTACTGGCCG
CGCGGCCGGA TGCTCGGGGG CAGTTCCACG ATGAACGACA TGGTCTACGT CCGCGGCAAC
GCCGCCGACT TCGACGGGTG GGCGGCCGCC GGCAACCCGG GCTGGGACCA CGCCGGCCTG
CTGCCCGCCT TCGAGGCGGC CGAGGCGCAG CTGTGGCCGG ACGGCGATCC CGACCGGGGT
GAGCATCGGT GGCGGGCGCC GCGCACCGCC GACTTCCTGG CTGCCGCGGA GCGCGCCGGG
CTGGTCCGGA ACCCGGACCT CAACGGGCCC GGGCAGGACG GTGTCGGCCG GCATCGGGTG
GCCCAGCGGC GGGGGGTCCG GTGCAGCGCG GCCGACGCCT ACCTGCGCCC GATCGCCGCC
CGGCCGAACC TGACCGTCGT CACCGGGGCC CAGGTCACCG GCTTGGTGTT CTGCGGCCCG
CGGGTCGTCG GGGTGCGGTG GCTTCGTCGG GGCCGGGCCG AGTACGCGCG GGCCGGCAGC
GAGGTCGTGC TGTGCGGCGG GGCGATCAAC ACCCCGGCCC TGTTGCTGGC CTCGGGCATC
GGCGACGGCG CCGACCTGCA CCGGCTGGGC ATCCCGGTGC GGGCCGACCT TCCGGGCGTG
GGCCGGAACC TGCAGGACCA TCTGATGATC CCGATGTGCT GGCGGGCCGC CGAGCCGACC
AGCCTGCTCG ACGGCCGGCG TCCGGTCAAC GTCGCCCACT ACCTGCTGCA CCGGCGCGGC
CCGCTGACCT CCAACATCGG GCAGGCCGGC GGATTCGTCC GGAGCCGGCC CGGGCTGGCC
GCCCCGGACG TGCAATTGGT CTTCGCGCCG GTGTTGCTGG ACGGCATCCG GGACGAACGG
GTCAGCGAAC CTCGCGAGCA CGGCTATTCG ATCGGCGCGG TGCTGCTCCA GCCCGGCAGC
CGCGGCCGGA TCACCCTGCG CCGCGCCGAC CCGCTGGCCC GGCCGGTCAT CGACCCCGGG
TACCTGTCCG ACCCGGCAGA CCTGGACACG CTGGTCCGCG GGGTTCGGCT GGCCCTGCGC
ATCGGCGCGA CGGGACCGCT GGCCGGCGCG GCGCGCGCCC CGCACCCGTT GACCGACGCC
GGCGACGACG CGGTGATCCG GGCCATCCGG GCCGGGGTGG ACACCATGTT CCACCCGGTC
GGCACCTGCC GGATGGGTCC GGCGGCCGAC CCCGGGGCGG TGGTCGACCC GACGCTGGCC
GTGCACGGCG TCGACGGGTT GCGGGTGGCC GACGCGTCGG TGATGCCGAC CATCACCCGC
GGCAACACCC ACGCCCCGAC GACGGCCATC GCCGAACGCG CCGCCATGCT TCTGCGGGGT
CAGCCGCAAC CGTCGGTGGC GGGTCGGTCA TGA
 
Protein sequence
MRAQGFDVVI VGAGSAGCAL AARLSADESC TVLLLEAGSG RWRPESRVPA LYSRLFRTAA 
DWAYRTEPQP ELNGRRLYWP RGRMLGGSST MNDMVYVRGN AADFDGWAAA GNPGWDHAGL
LPAFEAAEAQ LWPDGDPDRG EHRWRAPRTA DFLAAAERAG LVRNPDLNGP GQDGVGRHRV
AQRRGVRCSA ADAYLRPIAA RPNLTVVTGA QVTGLVFCGP RVVGVRWLRR GRAEYARAGS
EVVLCGGAIN TPALLLASGI GDGADLHRLG IPVRADLPGV GRNLQDHLMI PMCWRAAEPT
SLLDGRRPVN VAHYLLHRRG PLTSNIGQAG GFVRSRPGLA APDVQLVFAP VLLDGIRDER
VSEPREHGYS IGAVLLQPGS RGRITLRRAD PLARPVIDPG YLSDPADLDT LVRGVRLALR
IGATGPLAGA ARAPHPLTDA GDDAVIRAIR AGVDTMFHPV GTCRMGPAAD PGAVVDPTLA
VHGVDGLRVA DASVMPTITR GNTHAPTTAI AERAAMLLRG QPQPSVAGRS