Gene Namu_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3683 
Symbol 
ID8449302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4040847 
End bp4042517 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID645042747 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003202983 
Protein GI258653827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.642564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0444916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCT ACGGGAACAT CCGGAGCCGC AACGAATCGG CCTGGCTGCT GCCGACCGAC 
GGGTCCCGGA CCGATCACCG GCTACGCGCC GACATGCGCC GCTACGCCGA CCACGACGAG
GTCGACCTGG TGGTGGTCGG AGCCGGCGCC GGCGGCAGTG TCCTGACGCA GCGACTGGCC
CGCGCGGGAT GGAGCGTGAT CTGCCTGGAC GCCGGGCCGT TCTGGGACCC CGACGCCGAC
TGGGTCAGCG ACGAACGCGC CTCGCACACC CTGTACTGGA CCGACCCGCG GCAGATCGGC
GGCGCCGACC CCGTCCCGTT GGGCTCCAAC AACTCCGGCC GCGGCGTCGG CGGCTCCATG
ATCCACTACG CCGGATACAC ACCCCGCTTC CACCCATCCG ATTTTCGAAC CCGCACCACC
GAAGGAGTCG GAGCCGACTG GCCCATCGCC TACAGCGACC TCCGGCCGCA CTACGAACAA
CTTGAAGCCG AACTGCCCGT CGCCGGCCAG GACTGGCCCT GGGGTGACCC GCACGGCTAC
CCCCACCACC CGCACCGAGT GTCCGGCAAC GGGGAGATCT TCCTCCGCGG CGCCGCCGCT
GCCGGCATCA CCGCCCGGGT CGGACCCGTC GCGATCACCA ACGGCCGCTT CGGGAACCGC
CCGCACTGCA TCTACCGAGG TTTCTGCCTG CAGGGCTGCA AGGTCAACGC CAAGGCCAGC
CCGCTGATCA CCCACATCCC CGACGCCCTG GCGCACGGCG CGGAAATCCG ACCCGACAGC
CACGTCAGCC GGGTCCTGGT CGACGACCGC ACCGGCCGGG TCACCGGAGT CACCTATCTC
CGGGCCGGAG TGGAGCACCG GCAATGGGCC AGGGCCGTCG CCGTGGCCGG CTACAGCATC
GAGACCCCGC GGCTGCTGCT GCTCTCCGCC TCCCCCCGGT TCCCCGACGG GCTCGGCAAC
GACCACGACC AGGTCGGCCG CTACCTGATG GTCCAGGGCG CCCCGCAAAC CGCCGGCCGG
TTCGACGACG AGATCCGGAT GTACAAGGCA CCGCCGCCGG AAGTGAGCAG CGAACAATTC
TACGAAACCG ACCCCGGCAA GCCCTACCGA CGCGGATGGT CCATCCAAAC CGTCAGCCCA
CTGCCGATCA CCTGGGCCGA ACACGTTACT GCGCAGGGAC ACTGGGGTGA ACCGTTGCGG
GAATACATGC GCGACTACGT GCATTGGGCC ACCCTCGGCG CGCTCTGCGA ATTCCTCCCC
GATCCCGACA ACCGCGTCAC CCTGGCCGAG GAAAAGGATC GGCACGGGCT GCCCGTCGCG
CACTTCGCCT ACACCCAGAC CAGCAACGAC CGGCTGCTGA TGCGCGCCGC GCAGGACTCG
ATGGAGACGA TTCTGCATGC GGCCGGAGCA GGCGAGGTCA TCACCATCGA CCGGTACGCC
CACCTCGTCG GCGGCGCGCG GATGGCCGAC CGACCCCAGG ACGGGGTCGT CGACGCCGAC
CACCGGGTGT TCGGCGTCCC CAACCTGTTC ATCGTCGACG GCAGTGTTCT GCCCACCCAG
GGCGCTGCCA ACCCCGCCCT GACCATCATG GCGCTGGCCG CCCGCGCCGC ACACCGGCTG
ACCACCCGGC GGGTCCACGC GGCCGCTGCG GCGCCGGCAG GCGCGTCGTG A
 
Protein sequence
MTFYGNIRSR NESAWLLPTD GSRTDHRLRA DMRRYADHDE VDLVVVGAGA GGSVLTQRLA 
RAGWSVICLD AGPFWDPDAD WVSDERASHT LYWTDPRQIG GADPVPLGSN NSGRGVGGSM
IHYAGYTPRF HPSDFRTRTT EGVGADWPIA YSDLRPHYEQ LEAELPVAGQ DWPWGDPHGY
PHHPHRVSGN GEIFLRGAAA AGITARVGPV AITNGRFGNR PHCIYRGFCL QGCKVNAKAS
PLITHIPDAL AHGAEIRPDS HVSRVLVDDR TGRVTGVTYL RAGVEHRQWA RAVAVAGYSI
ETPRLLLLSA SPRFPDGLGN DHDQVGRYLM VQGAPQTAGR FDDEIRMYKA PPPEVSSEQF
YETDPGKPYR RGWSIQTVSP LPITWAEHVT AQGHWGEPLR EYMRDYVHWA TLGALCEFLP
DPDNRVTLAE EKDRHGLPVA HFAYTQTSND RLLMRAAQDS METILHAAGA GEVITIDRYA
HLVGGARMAD RPQDGVVDAD HRVFGVPNLF IVDGSVLPTQ GAANPALTIM ALAARAAHRL
TTRRVHAAAA APAGAS