Gene Namu_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4139 
Symbol 
ID8449765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4575205 
End bp4576920 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content63% 
IMG OID645043188 
Productmalate dehydrogenase 
Protein accessionYP_003203417 
Protein GI258654261 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000025754 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000138599 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGTCG CCCCGTATGA GCTGGTCCAT CAACCGCGGG AGGTGGTGGC GCGGGTGCGT 
GCTCGTGGTC GCCTGGTGTT GTCCTCGCCG ACGATTAATC GGGGGACGGC GTTCACGTTG
GCGCAGCGGG AGCAGTTGGA GTTGACGGGG TTGTTGCCGA CGGGGGTGTC GACGTTGGAG
GGTCAGGTTC GGCGGGTGTG GGCGCAGTAT TTGCAGCAGC CGTCGGATTT GGCGAAGTGG
GTGTATTTGG CGAATTTGCG GGATCGTAAC GAGGTGTTGT TCTACCGGTT GTTGTCGGAG
CATTTGCCGG AGATGTTGCC GGTGGTGTAT ACGCCGACGG TGGGGACGGC GATCGAGCGG
TTCAGTCATG AGTTCCGGCG TAGTCGGGGT GTGTTTTTGT CGGTGGATCA TCCGGATCAG
GTGGAGACCG CGTTGCGGAA CACTGGTTTG GGTCCGGATG ATGTTGATTT GTTGGTCGCG
ACGGATTCCG AGGGGATTTT GGGGATCGGT GATCAGGGGA TCGGTGGTAT TGAGATCTCG
ATCGGGAAGT TGTCGGTGTA TACGGCGGCG GCGGGGATTC ATCCGCGGCG GGTGTTGCCG
GTGGTGTTGG ACATGGGTAC CGATAATTTG CGGTTGTTGA ATGATTCGAT GTATTTGGGT
GAGCGGCATG CGCGGGTGCG GGATCATCGG TATGACGAGT TGATCGATGC GTATGTGACG
GCGTGTAACA AGTTGTTCCC GAACGCGATG TTGCATTGGG AGGATTTCGG GACGGAGAAC
GCCCGCCGGA TTCTGAACAA GTATTCGGGG GTGTGTTGCA CGTTCAATGA TGATATGCAG
GGCACGGCGG CGGTGGTGTT GGCGGCGGTG TTCTCGGCGG TGCGGGCGGC GGGGTCGCGG
TTGGCTGATC AGCGGATCGT GATCCATGGG GCGGGTACGG CCGGGTTGGG GATCGCGGAC
ATGTTGCGGG ATCAGATGAT CCGGGAGGGG TTGTCGCCGG CGGAGGCGAC GGGCCGGTTC
TATGCGTTGG CCAAGCAGGG GTTGTTGGTT GATGACGATC CGTCGTTGTT GGATTTCCAG
GTGCCGTATG CGCGCTCGCG CGCCGAGGTG GCGGGGTGGC CGGCGGGTGC GGGTGGGGTC
GGGTTGGCCA CGGTGGTGTC GCGGGCGCGG CCGACGATTT TGATCGGGAC GTCGACGCAG
GCGGGGGCGT TCACGGAGTC GATCGTGCGG GAGATGGCTT CGTTCAATGC GCGGCCGATC
ATTTTGCCGT TGTCGAATCC GACGAGTAAG GCCGAGGCGT TGCCGCAGGA TCTGATCCAT
TGGACGGACG GGAAGGTGTT GACCGCGACG GGTAGTCCGT TCGAGCCGGT GCATTACAAG
GGGGTGGCGT ATCAGATTGC GCAGTCGAAC AATGCGTTGG TGTTTCCCGG GTTGGGGTTG
GGGGTGGCGG TGACGAAGGC GTCGCGGATC AGTGAGGGGA TGATCGCGGC GGCGGCGGAT
GCGGTGGCGG CGATGTCGGA TGCGCGCACG CCGGGGGCGA GTTTGTTGCC GCCGATGACG
GTGTTGCGGA CGGCGTCGGC GGCGGTGGCG ATCGCGGTGG CGAAGGCGGC CGATGCCGAG
GGGTTGGCGC GGGTGGAGCT GAGTAATCCG GTGCAGCAGG TGTATGACGC GATGTGGCAG
CCGGAGTATC CGCGGATCGA GCCGATCGAG GCCTGA
 
Protein sequence
MAVAPYELVH QPREVVARVR ARGRLVLSSP TINRGTAFTL AQREQLELTG LLPTGVSTLE 
GQVRRVWAQY LQQPSDLAKW VYLANLRDRN EVLFYRLLSE HLPEMLPVVY TPTVGTAIER
FSHEFRRSRG VFLSVDHPDQ VETALRNTGL GPDDVDLLVA TDSEGILGIG DQGIGGIEIS
IGKLSVYTAA AGIHPRRVLP VVLDMGTDNL RLLNDSMYLG ERHARVRDHR YDELIDAYVT
ACNKLFPNAM LHWEDFGTEN ARRILNKYSG VCCTFNDDMQ GTAAVVLAAV FSAVRAAGSR
LADQRIVIHG AGTAGLGIAD MLRDQMIREG LSPAEATGRF YALAKQGLLV DDDPSLLDFQ
VPYARSRAEV AGWPAGAGGV GLATVVSRAR PTILIGTSTQ AGAFTESIVR EMASFNARPI
ILPLSNPTSK AEALPQDLIH WTDGKVLTAT GSPFEPVHYK GVAYQIAQSN NALVFPGLGL
GVAVTKASRI SEGMIAAAAD AVAAMSDART PGASLLPPMT VLRTASAAVA IAVAKAADAE
GLARVELSNP VQQVYDAMWQ PEYPRIEPIE A