Gene Noca_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1188 
Symbol 
ID4599288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1261422 
End bp1262807 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content71% 
IMG OID639775782 
Productaldehyde dehydrogenase 
Protein accessionYP_922389 
Protein GI119715424 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.417271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCTG ACCCGATCAT CACCGTCGAC CCCGCGACCG GCCGCGAGCT GCGTCGCTAC 
GAGGTCATGA CGGACACCGA CATCGAGGGC GTGCTCGCCC AGGTGACCGC GGCCCAGAGC
CTCTGGGCCG GCTCGTCCGT CGAGATCCGC ACCGGGGTGC TCCTCCGGGC GGCCGGGATC
CTGCGGGATC GGGCCGCCGA GCTCGGCGCC CTCGCGACCG CCGAGATGGG CAAGCCGATA
ACGGAGGCGA TCGCGGAGGT CGAGAAGTGT GCGTGGGTGT GCGAGTACTA CGCCGAGACC
TCGCCCGGGC AGCTGGCGGA CGAAGAGGTC TCCGCCGCCG GGTCCCGCAG CTTCATCCGT
TACGAGCCAC TCGGCACGGT GCTGGCGATC ATGCCGTGGA ACTACCCGTT CTGGCAGGTG
TTCCGCTTCG TCGCGCCCAC GCTCGCCGCG GGCAACGCGG GAATCCTCAA GCACTCGCCC
AACGTGACCG GGGTCGCGCT CGCCATCGAG CAGGTGCTCA CCCAGGCGGG CCTCCCGGCT
GGCGTCTTCC GCACCCTTCT CGTGGCCGAG GCTGAGGTGC CCAGGGTCGT GGACGGACTC
ATCCAGGACG ATCGCGTTGC TGCGGTGACC CTGACCGGCA GCAACAGGGC GGGAGCGAGC
GTCGCCGCGA GTGCCGGTCG CGCCGCCAAG AAGACCGTCC TGGAGCTCGG GGGATCCGAC
CCGTTCGTCG TGCTCGCCGA CGCCGATCTG GATGTCGTCG TTCCCAAGGC GGTGGCGGGG
CGCTTCCTCA ATACCGGCCA GTCGTGCCTG TGTGCCAAGA GGTTCATCGT GCACGAGTCG
CTGGCAGGCG AGTTCGCTCG ACGTTTCACC GCCGCGGTGG AAGACCTGGC GATCGGACCG
CCGGGCGAGG AAGGCACCAG GATCGGGCCG CTCGCCCGGG CTGATCTGGC GGAGAACCTC
GGGCGCCAGG TCGACGAGTC GGTCGCGGCC GGCGCGGTCG TCCTCACGGG CGGCAAGCGG
CTGGACCGGG GACCGGCCTG GTACGCCCCC ACCGTCCTGG TGAACGTCAC GCCCGACATG
CCGGTCATGG CCGAAGAGAC CTTCGGCCCG GCAGCAGCGG TCGTCGAGTT CGCCACAGAT
GACGAGGCGG TGGCGCTCGC GAACGCGACC CCGTACGGCC TCGGGGCCAG CGTCTGGTCC
GCGGATGCCG GCCACGCGCT CGAGGTCGGT TCGCGGATCA GCTCCGGCGC GCTCTTCGTC
AACGCCACCA CCGCATCCGA CCCGCGGCTG CCCTTCGGGG GGGTGAAGCA GAGCGGCTAC
GGTCGCGAGC TGGGTCCACT CGGCGCACGC GAGTTCACCA ACATCCGCAC CGTCGTCATC
GGGTGA
 
Protein sequence
MTADPIITVD PATGRELRRY EVMTDTDIEG VLAQVTAAQS LWAGSSVEIR TGVLLRAAGI 
LRDRAAELGA LATAEMGKPI TEAIAEVEKC AWVCEYYAET SPGQLADEEV SAAGSRSFIR
YEPLGTVLAI MPWNYPFWQV FRFVAPTLAA GNAGILKHSP NVTGVALAIE QVLTQAGLPA
GVFRTLLVAE AEVPRVVDGL IQDDRVAAVT LTGSNRAGAS VAASAGRAAK KTVLELGGSD
PFVVLADADL DVVVPKAVAG RFLNTGQSCL CAKRFIVHES LAGEFARRFT AAVEDLAIGP
PGEEGTRIGP LARADLAENL GRQVDESVAA GAVVLTGGKR LDRGPAWYAP TVLVNVTPDM
PVMAEETFGP AAAVVEFATD DEAVALANAT PYGLGASVWS ADAGHALEVG SRISSGALFV
NATTASDPRL PFGGVKQSGY GRELGPLGAR EFTNIRTVVI G