Gene Namu_4897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4897 
Symbol 
ID8450527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5462559 
End bp5464061 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content73% 
IMG OID645043935 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003204160 
Protein GI258655004 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACA CCACCACAGC TCCCGCGCAC CTGGCCGACC CGGGCGTCCG GCCGCACGCG 
CGGCCCACGC GGTCGATCGT GAACCCGGCC ACCGGCGAGG TCATCGCCAC CGTCCCGGAA
CAGGGCGCGG CCGACGTCGA CCGGGCCGTG GCGACCGCCA AGGAGGCTTT CGAGACCGGC
CCGTGGCCGA CGATGGTGCG TAGCAAACGA GCCCGGCTGC TGCTCACGCT GGCCGACGCC
ATCGAGGCCA ACTCGGCCCG GCTCTACGAG TTGGAGACCC GCAACAACGG GCGGCCGATC
ACCGAGACCC GGGCCCAGCT GTCCCGGGTG CCCGAATGGT TCCGCTACAA CGCCGGCCTG
CTGGCCGCGC AGCGCGACGC CGTGCTGCCC GGCGACGGCG AGTACCTGAC CTACCAGCGC
CGGGCCCCGC TCGGCGTCTG CGGGATCATC ACGCCGTTCA ACCACCCGAT GCTGATCCTG
GCCCGCAGCC TGTCCGCCGC GCTGGCCACC GGCAACACGG TCGTGGTCAA GCCCTCCGAA
CTCACCCCGC TCACCACCCT GGCCCTGGTC GAGATCCTGC ACGCCGCAGG TCTGCCGGCC
GGGGTGGTGA GCGTGGTGAC CGGCAGTCGG GAGGCGGGTG AACGCCTCAC CCGCCATCCC
GACGTCGCCA AGATCACCCT GACCGGGGGC ACCGAGGCCG GGCGGTCGGC CGCGCTGGCC
ACCGCGGCCC GGTTCGCCCG GGTCACCGCC GAGCTCGGCG GCAAGACCCC CATCGTCGTA
TTCGACGACA TCGACTCGGT GGTGGCGGCG CAGGGCGCGG CCTTCGCCGC GTTCGTCGCC
GCCGGCCAGT CCTGCGTCGC CGGTTCCCGC TTCCTGGTGC AGCGCGGCAG CTACGACGCC
TTCGTCGACG CCCTGGCCAC CCGGGCCGAC GCCATCCGGA TCGGCGACCC GGCCGCCCCC
GGCACCCAGC TCGGCCCGCT GATCAGCGCG GCCCAGCGGG ACAAGGCGCT GCGGTACGTG
CAGATCGGGC TGGACGAGGG GGCCCGGCTG GTCGCCGGCG GCACTGTCCC CGACCTGCCC
GGCCCGCTGA ACCAGGGCTT TTACCTGCGG CCGACCGTGC TGGCCGACGC CACCAACGAC
ATGCGTATCG CCCAGGAGGA GATCTTCGGG CCGGTGGCCA CCGTGGTCCC GTTCGACACC
GAGGCCGACG CGATCGCCAT GGCCAACGAC AACCGGTTCG CCCTGGGCGC CGGCGTCTGG
ACCCGCGATC TGGCCCGTGG CCACCGCGTC GCCGACCGGA TCACCGCCGG CATGGTCTGG
GTCAACGACC ACCACCGGCT CGAGCCCTCC CTGCCGTGGG GCGGGGTCAA GGAGTCCGGC
CTCGGCAAGG ACGCCGGCAC CGAATCGTTC GACGACTTCA CCTGGATCAA GACCGTCGTG
GTGCGCACCG CCGCCGACGA CGTCGACTGG TACGGCCAGG CCGACCCCGG CCGACTCAAC
TGA
 
Protein sequence
MTDTTTAPAH LADPGVRPHA RPTRSIVNPA TGEVIATVPE QGAADVDRAV ATAKEAFETG 
PWPTMVRSKR ARLLLTLADA IEANSARLYE LETRNNGRPI TETRAQLSRV PEWFRYNAGL
LAAQRDAVLP GDGEYLTYQR RAPLGVCGII TPFNHPMLIL ARSLSAALAT GNTVVVKPSE
LTPLTTLALV EILHAAGLPA GVVSVVTGSR EAGERLTRHP DVAKITLTGG TEAGRSAALA
TAARFARVTA ELGGKTPIVV FDDIDSVVAA QGAAFAAFVA AGQSCVAGSR FLVQRGSYDA
FVDALATRAD AIRIGDPAAP GTQLGPLISA AQRDKALRYV QIGLDEGARL VAGGTVPDLP
GPLNQGFYLR PTVLADATND MRIAQEEIFG PVATVVPFDT EADAIAMAND NRFALGAGVW
TRDLARGHRV ADRITAGMVW VNDHHRLEPS LPWGGVKESG LGKDAGTESF DDFTWIKTVV
VRTAADDVDW YGQADPGRLN