Gene Noca_4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4188 
Symbol 
ID4596702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4426848 
End bp4428206 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content67% 
IMG OID639778794 
Productaldehyde dehydrogenase 
Protein accessionYP_925372 
Protein GI119718407 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTACGCCG TCACCAACCC GGCCACGGGC GAGCTGATCA GCGAGTTCGA CACGGCAACC 
GACGCGCAGG TTCGCGAGGC CGTCAGCCGC GCCGACCTCG CCTTCCAGTC CTGGAAGAGC
ACGCCGCTGG AGGAGCGCTC CCGGACCCTG GCTCGGGCCG CAGACCTGTT CCTCGAGCGC
AGCGACGAGT TGGCCCGCGC CATCACCCAG GAGATGGGCA AGCGCCTCGA GGAGAGCCGG
GGGGAGGTAC GCATCGCCTC GGACATCTTC CATTACTACT CCGACAACGC TCCGAAGCTG
CTCGCGGACG AAACCATCGC CATCCAGGGC GGCGAGGCCA AGATCCTCAA GCGGCCTGTC
GGGGTGCTGC TCGGGATCAT GCCGTGGAAC TACCCGTACT ACCAGGTCGC CCGCTTCGCG
GCGCCCAACC TGGTGCTGGG CAACACGATC ATCCTCAAGC ACGCGCCGTC CTGTCCGCAG
TCCTCGGCCC TGGTCGAGCA GCTCTTGCAC GACGCCGGGG TACCGGTTGA CGCCTACATC
AACGTCTACG CCACCAACGA GCAGGTCGCC TGGGCCCTCG CTGATCCGCG CATCCAGGGC
GTATCCGTCA CTGGCAGCGA GCGAGCCGGC GCGGCCGTCG CGGCCGAGGC CGGCAGGAAT
CTGAAGAAGG TGGTTCTCGA ACTGGGTGGC TCGGACCCCA TGGTCATCCT CGACACCGAC
GATCTCGATG CCCTGGTCGA GACGGCCATG GAGTCACGGA TGGGCAACAC CGGACAGGCG
TGCAACGCGC CCAAGCGGAT GATCGTGGTG GACGAGCTCT ACGACGACTT CGTGACGAAG
ATGGTCCAGG CTGCCCGCAG ACTCCAGCCA GGGGACCCGC TTGATCCGGA GACGACGCTT
GCCCCGCTGT CGTCGGAGCA GGCCGCCGTA CGCTTGATCG GACAGCTCGA CGAGGCGCGC
AACCAGGGCG CCACCATCCG CGTGGGCGGT CACCGGGTCG AGCGACCCGG CGCCTACGTC
GAGCCAACGG TGATCACTGA CGTCACGCCG GAGATGTCGG CCTATCGGGA CGAACTCTTC
GGCCCGGTGG CCATCATCTT CCGAGTCGAT GACGAGGACG ACGCGGTTCG ACTCGCGAAC
GACACGCCCT TCGGCCTCGG CGCCAGCGTC TTCTCAGGCG ATTCCGAGCG CGCCGAGCGC
GTGGCAGCCC GGATCGACGC CGGCATGGTC TACCTCAACC AGGCTGGCGG CTCGCAGCCC
GACCTCCCCT TTGGCGGCAT CAAGCGCTCC GGCATCGGCC GCGAACTCGG TGCCCTCGGC
ATCGAGGAGT TCATGAACAA GAAGGTCGTG CGGCTCTGA
 
Protein sequence
MYAVTNPATG ELISEFDTAT DAQVREAVSR ADLAFQSWKS TPLEERSRTL ARAADLFLER 
SDELARAITQ EMGKRLEESR GEVRIASDIF HYYSDNAPKL LADETIAIQG GEAKILKRPV
GVLLGIMPWN YPYYQVARFA APNLVLGNTI ILKHAPSCPQ SSALVEQLLH DAGVPVDAYI
NVYATNEQVA WALADPRIQG VSVTGSERAG AAVAAEAGRN LKKVVLELGG SDPMVILDTD
DLDALVETAM ESRMGNTGQA CNAPKRMIVV DELYDDFVTK MVQAARRLQP GDPLDPETTL
APLSSEQAAV RLIGQLDEAR NQGATIRVGG HRVERPGAYV EPTVITDVTP EMSAYRDELF
GPVAIIFRVD DEDDAVRLAN DTPFGLGASV FSGDSERAER VAARIDAGMV YLNQAGGSQP
DLPFGGIKRS GIGRELGALG IEEFMNKKVV RL