Gene Noca_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1421 
Symbol 
ID4597328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1505518 
End bp1506990 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content71% 
IMG OID639776019 
Productaldehyde dehydrogenase 
Protein accessionYP_922622 
Protein GI119715657 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.545693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACTG TCATGCCGTC CGGTGGACGA GTGTCGGGAC TCCTCGTCGG CGCCGGGTGG 
GAGACGGGCC GGGGAGCGCA GCTCCTCGAG GTCCGCGCAC CGTACGGCGG CGAGCTGCTG
GCCCGGATCG CCGAGTCCAC GCCGGACGAG GTCGACGCGG CGGTGACCCT GGCCGCGCGG
ACGTTCGCCA GGAACGAGCT CACCCCCCAC GAGCGGGCCG ACATCCTGCG CCGGACCTCC
CAGCTCATCG CTCAGCGCGC CGAGGAGTTC GCCCACTCGG TCGTGGCCGA GGCGGGCAAG
CCCCTCCGGG ACGCGCGGGC GGAGGTCGCC CGCGCCGTGC TGACCTTCGA GCTGTGCGCG
CAGGAGGCCA CCCGCCTGGC CGGCGAGGTC GTCCCCGTCG AGGGCACGCC CGGATCCGAG
AACCGCCGGG CATTCACGAT CAGGCGGCCG GCCGGCGTCG TGTGCGCCAT CACGCCGTTC
AACGGCCCGG TCAATCAGCT CTCGCACAAG GTGCCCACGG CCATCGCCGC CGGCTGCACC
GTGGTGGTCA AGCCCGCCGA GGTGACCCCG CTGTCCGCGA TCAAGGTCGT CCAAGCGATG
CTCGACGCCG GCCTGCCCCC GGGCCACGTG ACCGTCGTGC AGGGCCGTGG TGAGACCGTC
GGCCAGCAGC TTCTCGAAGA CCCGCGCTTC GCGGTGTACA GCTTCACCGG GAGCACGGCG
GTGGGCGCCC ACATCCGCCG GACCGTGGGG CTGCGGAAGA CCCTGCTCGA GCTGGGGAAC
AACTCCGCCA ACATCGTGCA CGCCGATGCG GACCTGGGTC TCGCCGCGAA GGTGCTCGCG
AAGTCGTCGA CGGCGTACGC CGGCCAGGTG TGCATCAGCG CTCAGCGCAT CCTGGTTCAC
GAGGACGTCT TCGAGGAGTT CTCCGCCCTG CTGGCGGACC AGGTCAGCGC GTTGAGCGTG
GGCGATCCGT CGGACGAAGG CACCGATGTC GGTCCGATGA TCTCGCTCGA TGCCGCGCGA
CGGGCGGAGC AGTGGGTCGC CCAGGCGATC GATGACGGGG CCAAGCTCGT GTGTGGGGGA
AGTCGCGACG GCCAGTTCTT CGTCCCGACC GTCGTAGCCA GGCCGGCCGC CCACTCGGCG
CTGGCCTGCC AGGAGGCGTT CGCGCCGGTG GCGGTGCTCA TCGCGTACCG AACGCTGCGG
GAGGCCATCG ACATCGCCAA CAGCACCGAG TACGGCCTCC AGGCGGCCGT GTTCACCGAG
GGCCTCGACG TGGCGATGGC GGTGGCACGA CGACTCGACG TGGGCGGCGT GATCGTCAAC
GACGCCTCCT CGTATCGCGT CGACTCCATG CCCTACGGCG GGGTCAAGCA CAGCGGGACC
GGGCGCGAGG GCGTGCGATA CGCCGTCGAG GAGATGACCG AATCGCAGCT CGTCGTGCTC
AACCTGCGCG ATCCCGTCGG CGAAGGCCTG TGA
 
Protein sequence
MQTVMPSGGR VSGLLVGAGW ETGRGAQLLE VRAPYGGELL ARIAESTPDE VDAAVTLAAR 
TFARNELTPH ERADILRRTS QLIAQRAEEF AHSVVAEAGK PLRDARAEVA RAVLTFELCA
QEATRLAGEV VPVEGTPGSE NRRAFTIRRP AGVVCAITPF NGPVNQLSHK VPTAIAAGCT
VVVKPAEVTP LSAIKVVQAM LDAGLPPGHV TVVQGRGETV GQQLLEDPRF AVYSFTGSTA
VGAHIRRTVG LRKTLLELGN NSANIVHADA DLGLAAKVLA KSSTAYAGQV CISAQRILVH
EDVFEEFSAL LADQVSALSV GDPSDEGTDV GPMISLDAAR RAEQWVAQAI DDGAKLVCGG
SRDGQFFVPT VVARPAAHSA LACQEAFAPV AVLIAYRTLR EAIDIANSTE YGLQAAVFTE
GLDVAMAVAR RLDVGGVIVN DASSYRVDSM PYGGVKHSGT GREGVRYAVE EMTESQLVVL
NLRDPVGEGL