Gene Namu_4544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4544 
Symbol 
ID8450172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5052857 
End bp5054338 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content70% 
IMG OID645043585 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_003203812 
Protein GI258654656 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACGA CGCTCACCGC CGCCCCCGAG AGGGCGATGC TGATCGGGGG CGAGTGGGTC 
GCCGGCGTGG ACGGGATCTG GACCGACGTG GTCTCCCCCG GCCGCCGCGG CACCGTGCTC
GCCCAGGTCC CCGAGGGCAG CGCCGAGGAC GCGGACCGGG CGGTCCGCGC GGCGCGGGCC
GCCTTCCCGG CCTGGCGGGC CCTGCACTTC AAGGACCGGC AGAAGATCCT GCTGCAGATC
GCCGACGCGC TCGAAGAGCA CGCCGAGGAG CTGGCGCAGC TCACCGCGGC CGATACCGGC
AATGCCCTGC GGACGCAGGC CCGTCCGGAG TCGCAGACCC TGGTCACCCT GTTCCGCTAC
TTCGGCGGCG TGGCCGGCGA GTTCAAGGGC ACCGTGCTGC CGGCCGGGGA CGACCAGCTG
CAGTACACCC GGCGCGAGCC GCTGGGCGTG GTGGCCGGGA TCCTGCCGTG GAACTCGCCG
CTGATGATCG CCGGGATGAA GACTCCCGCC GCGCTGGCGG CCGGAAACAC CCTGGTACTC
AAGACCGCCG AGGACGCGCC ACTGACCATC CTGCGGATGG CCGAGATCTG CTCGCAGTTC
CTGCCGCCGG GCGTGCTCAA CGTGATCACC GGCCGCGGCC CGGTGGTCGG CGAGGCGCTG
CTGGTGCACC CCGACGTGGA CAAGGTCTCG TTCACCGGAT CCACCGGGGT CGGCCGGCAC
GTCGCCCAGA CGGCCGGGCA GCGCCTGGCC CACGTGTCGC TGGAGCTGGG CGGCAAGAGC
CCGAACATCG TCTTCCCGGA CGCCGCGAGC GCCGAGAACA TCGAGGCCAC CGCGGCCGGC
GTCCTGCTGG CCATGCGGTT CACCCGACAG GGCCAATCCT GCACCGCCGG CTCCCGGCTG
TTCGTGCACG CGGACGTCTA CGACACCTTC CTGGCCGCCC TGGTGGAGAA GGTCTCCGCG
CTCAAGGTCG GCGACCCGCT GGACGAGGCG TCCGATATGG GCTCGATCAT CAATCAGAAG
CAGTACGAAT CGGTGCTCGG CTACATCGAG AGCGGCAAGG CCCAGCCCGG CGTGGACGTC
GCCCTGGACG GCACCACGCA GGACTTCTCC GGACTGGACG GCTACTACAC CGGGCCGACC
ATTCTCGGCT CGGTGGCCAA CGACTGGCGG ATCGCCCAGG AGGAGATCTT CGGCCCGGTA
CTGGTGGCCA TCCCCTGGCG GGACCGGGAC GAGGTCATCG CGATGGCCAA CGACTCGCAC
TACGGCCTGG GCGCGTACAT CTGGTCGAAC AACCTGACCG ACGCGCTGGA CACCGCGCAC
CGGGTCGAAT CCGGTTGGGT GCAGGTCAAC CAGGGCGGCG GGCAGGTCAT CGGCCAGTCC
TACGGCGGCT ACAAGTCCAG CGGCATCGGG CGCGAGTTCT CCATCGAGGG CGCGCTGGAG
TCGTTCACCC AGATCAAGCA GATCAACGTC AAGCTCGGCT GA
 
Protein sequence
MATTLTAAPE RAMLIGGEWV AGVDGIWTDV VSPGRRGTVL AQVPEGSAED ADRAVRAARA 
AFPAWRALHF KDRQKILLQI ADALEEHAEE LAQLTAADTG NALRTQARPE SQTLVTLFRY
FGGVAGEFKG TVLPAGDDQL QYTRREPLGV VAGILPWNSP LMIAGMKTPA ALAAGNTLVL
KTAEDAPLTI LRMAEICSQF LPPGVLNVIT GRGPVVGEAL LVHPDVDKVS FTGSTGVGRH
VAQTAGQRLA HVSLELGGKS PNIVFPDAAS AENIEATAAG VLLAMRFTRQ GQSCTAGSRL
FVHADVYDTF LAALVEKVSA LKVGDPLDEA SDMGSIINQK QYESVLGYIE SGKAQPGVDV
ALDGTTQDFS GLDGYYTGPT ILGSVANDWR IAQEEIFGPV LVAIPWRDRD EVIAMANDSH
YGLGAYIWSN NLTDALDTAH RVESGWVQVN QGGGQVIGQS YGGYKSSGIG REFSIEGALE
SFTQIKQINV KLG