Gene Ndas_3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3248 
Symbol 
ID9247105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3882244 
End bp3883674 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_003681160 
Protein GI297562186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCTTCG AGTACGCGCC CGCACCGGAG TCCCGCTCCG TCGTCACCCT GCGCGAGAAC 
TACGAGCTGT TCGTCGACGG CGCCTTCTCC CCGGCCGAGG GCGGCGAGTA CCTCACCACC
CTCAACCCCG CCGACGAGAC CGAGCTCGCC CGGGTCGCCG TCGCCGGACC CGCCGACGTG
GACCGGGCCG TCGCCGCCGC CCGCCGCGCC TTCGACACCG TCTGGGGGCC GATGCCCGGG
GCGGAGCGCG GCAAGTACCT GTTCCGCATC GCCCGCATCA TCCAGGAGCG CTCGCGCGAG
CTGGCCGTGC TGGAGTCGAT GGACAACGGC AAGCCCATCA AGGAGACGCG CGACGTCGAC
CTGCCGCTGG TCGCCGCGCA CTTCTTCTAC CACGCGGGGT GGGCCGACAA GCTCGCCCAC
GCCGGGCTGG GCCCCGACCC GCGCCCGCTG GGCGTGGCCG CCCAGGTCAT CCCGTGGAAC
TTCCCGCTGC TCATGCTCGC GTGGAAGATC GCGCCGGCGC TGGCCACCGG CAACACCGTC
GTGCTCAAGC CCGCCGAGAC CACCCCGCTC ACCGCGCTGG CCTTCGCCGA GATCTGCCAG
GAGGCCGACC TGCCCCCGGG CGTGGTCAAC ATCCTCACCG GCGCGGGCGA GACCGGCCGG
GCCCTCGTGG AGCACCCGGG CGCCGACAAG GTGGCCTTCA CCGGCTCCAC CGGGGTGGGC
CGCCAGATCG CCCGCTCGGT CGCGGGCACC GGCAAGCGCC TCACCCTGGA GCTGGGCGGC
AAGGGCGCCA ACATCGTCTA CGACGACGCG GCCCTGGACC AGGCGGTCGA GGGCGTCGTC
TCGGGCATCT TCTTCAACCA GGGCCACGTG TGCTGCGCGG GTTCGCGGCT GCTGGTCCAG
GAGTCGATCG CCGAGGAACT GCTGCCCCGG CTCAAGGAGC GCATCGCCAA GCTGCGCCTG
GGCGACCCGC TGGACAAGAA CACCGACATC GGCGCGATCA ACTCCGCCGC GCAGCTGGAG
CGCATCCGCG AACTCACCGA CACCGGCGAG GAGGAGGGCG CCGAGCGCTG GTCGCCCGCC
TGCGAGCTGC CCGCGAGCGG CTACTGGTTC GCGCCCACCG TCCTGACCGG CGTCAGCCAG
TCCCACCGCG TGGCCCGCGA GGAGATCTTC GGCCCCGTGC TCTCGGTGCT GACCTTCCGC
ACCCCCGAGG AGGCCGTGGC CAAGGCCAAC AACACCCCCT ACGGCCTCTC CGCCGGGGTG
TGGACCGAGA AGGGCTCGCG CATGCTCTGG ACCGCCGAGC GCCTGCGCGC CGGGGTGATC
TGGTCCAACA CGTTCAACAA GTTCGACCCG ACCAGCCCCT TCGGCGGCTA CAAGGAGTCG
GGATACGGCC GCGAGGGCGG GCGGCACGGA TTGGAGGCCT ACCTTGGCTG A
 
Protein sequence
MIFEYAPAPE SRSVVTLREN YELFVDGAFS PAEGGEYLTT LNPADETELA RVAVAGPADV 
DRAVAAARRA FDTVWGPMPG AERGKYLFRI ARIIQERSRE LAVLESMDNG KPIKETRDVD
LPLVAAHFFY HAGWADKLAH AGLGPDPRPL GVAAQVIPWN FPLLMLAWKI APALATGNTV
VLKPAETTPL TALAFAEICQ EADLPPGVVN ILTGAGETGR ALVEHPGADK VAFTGSTGVG
RQIARSVAGT GKRLTLELGG KGANIVYDDA ALDQAVEGVV SGIFFNQGHV CCAGSRLLVQ
ESIAEELLPR LKERIAKLRL GDPLDKNTDI GAINSAAQLE RIRELTDTGE EEGAERWSPA
CELPASGYWF APTVLTGVSQ SHRVAREEIF GPVLSVLTFR TPEEAVAKAN NTPYGLSAGV
WTEKGSRMLW TAERLRAGVI WSNTFNKFDP TSPFGGYKES GYGREGGRHG LEAYLG