Gene Namu_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1840 
Symbol 
ID8447445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2023138 
End bp2024742 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content68% 
IMG OID645040969 
ProductGMC oxidoreductase 
Protein accessionYP_003201219 
Protein GI258652063 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000489986 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.328535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGAAT CCACCGACCA CGGCTCCACC GACTACGACT CCACCGATTA CGACTCCACC 
GATTACGACG TTGTCATCAT CGGTTCCGGC GCCGGGGGCG GCACGCTCGC CCACCGGCTG
GCGCCCTCCG GCAAGCGGAT CCTGATCCTG GAGCGGGGTG ACTGGCTGCC CCGGGAGGTG
CAGAACTGGG ATGCCACCGC GGTCTTCGTC GACAACCGGT ACGTCTCGGC CGACACCTGG
TACGACGCCG ACGGTAAGTC CTTCCAGCCG CAGATCCACT ACAACGTCGG CGGCGCCACC
AAGCTTTACG GCGCGGCGCT GTACCGGTTG CGGGAGAAGG ACTTCGGCGA ACTCATCCAC
TTCGACGGGA TCTCCCCCGC GTGGCCGGTG AGCTACGCCG ACTTCGAGCC GTACTACGCG
CAGGCCGAGC AGCTCTACCA GGTGCACGGG CAGCGGGGCG AGGATCCCAC CGAACCACCG
AGTTCGGGAC CGTATCTTTT TCCGGCGGTC TCGCACGAGG CGCGGATCCA GCAGCTGTAC
GACGATCTGC GGGCCAGCGG GCTGCATCCG TTCCACGCCC CGGCCGGCAT CATGCTCAAC
GAGGCGGACA TGGCCTACAG CCGGTGCATC CGCTGCGCCA CCTGCGACGG TTTCCCGTGC
CTGGTGCACG CCAAGTCCGA CGCCGAGGTG GTCGCGGTCC GCCCGGCCCT GACGCACCCT
AACGTCACGC TGATCCGCGG CGCCGAGGTG ATCCGGCTGG ACACCGACCT GACCGGACGC
TCGGTCACCG ACGTGGTGGC CATGATCGGC GGCGAGCGGC ACCGCTTCCA CGGTTCGATC
GTGGTGGTCA GCGCCGGCGC GGCCAACTCG GCCAAGCTGT TGCTGCGCAG CGCCTCCGAC
CGGCATCCGA ACGGGCTGGC CAACGGTTCG GACCAGGTCG GGCGCAACTA CGTCTTCCAC
AACAGCCGGG CGTTCCTGGC CGTGTCGACC GAGCGCAACG ACACCCGCTT CCAGAAGACC
CTGGGGGTCA ACGACTTCTA CTTCGGCGAC GACGAGTTCG ACTACCCGAT GGGCAACATC
CAGATGGTCG GCAAGAGCTC GGCGCCGATG TACCGGGGCG AGAAGCCACT GGAGACCGCC
CTGGCCCCCT CCTTCGCCCT GTCCGACGTG GCCGTGCACG CGGTGGATTT CTGGCTGTCC
ACCGAGGATC TGCCTCGGCC GGAGAACCGG GTCACGCTGG CCGCCGACGG GAACATCACC
CTGTCCTACA CGCCGAACAA CACCAAGCCG CTGGACGAGC TCTACCACCG GATCAAGCGC
CGGCTGAGCC ATCTCGGGCT GAACCCGCAT CACCTGATCC CGCGTTCGGC CTACATGAAG
AACGACATCC CGATCGCCGG GGTGGCCCAC CAGGCCGGTA CCTGCCGTTT CGGCAGCGAT
CCGGCCGACT CGGTGCTGGA CACCGACTGC AAGGCCCACG AGCTGGACAA CCTGTACGTG
GTGGACACCA GCTTCTTTCC CTCGATCGGT GCGGTGAACC CGGCGCTGAC CGCGGCGGCC
AACGCGTTGC GGGTGGGCGA CCACCTGCTG GACCGGCTGG GCTGA
 
Protein sequence
MPESTDHGST DYDSTDYDST DYDVVIIGSG AGGGTLAHRL APSGKRILIL ERGDWLPREV 
QNWDATAVFV DNRYVSADTW YDADGKSFQP QIHYNVGGAT KLYGAALYRL REKDFGELIH
FDGISPAWPV SYADFEPYYA QAEQLYQVHG QRGEDPTEPP SSGPYLFPAV SHEARIQQLY
DDLRASGLHP FHAPAGIMLN EADMAYSRCI RCATCDGFPC LVHAKSDAEV VAVRPALTHP
NVTLIRGAEV IRLDTDLTGR SVTDVVAMIG GERHRFHGSI VVVSAGAANS AKLLLRSASD
RHPNGLANGS DQVGRNYVFH NSRAFLAVST ERNDTRFQKT LGVNDFYFGD DEFDYPMGNI
QMVGKSSAPM YRGEKPLETA LAPSFALSDV AVHAVDFWLS TEDLPRPENR VTLAADGNIT
LSYTPNNTKP LDELYHRIKR RLSHLGLNPH HLIPRSAYMK NDIPIAGVAH QAGTCRFGSD
PADSVLDTDC KAHELDNLYV VDTSFFPSIG AVNPALTAAA NALRVGDHLL DRLG