Gene Noca_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2095 
Symbol 
ID4595540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2238419 
End bp2239969 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content73% 
IMG OID639776698 
Productaldehyde dehydrogenase 
Protein accessionYP_923291 
Protein GI119716326 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.110223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGAGG GCTTCGTACG CGCCGAGGTC GCGGGTGTCG TCGGGCGGGC CCGGAGCGGG 
CAGCGCTCGC TGGGCGCGCT GTCGGTGGCC GAGCGCCTGG TGCACCTGCG CGCCCTGCGC
GGCGCGATCG CGGGCCGGGT CGACGAGATC GTCGACCGGG TGCAGACCGA GACCGGCAAG
TCGCGCTCGG ACATCTTGAT GTCGGAGATC TTCGGCGCGA TGGACGCGAT CGCCTGGCTC
GAGGCCAACG CCGACGAGGC GCTCGCCGAC GAGAAGGTGC CCACACCGAT GACCCTGATG
GGCAAGAAGT CGCGGGTCTG GTTCCAGCCG CGCGGGGTCG TCCTCGTCAT CTCGCCGTGG
AACTACCCGT TCTTCCAGGC GGTCGTCCCG ATCGCGAGCG CCCTGGCCGC CGGCAACGCG
GTGGTCTACA AGCCGAGCGA GCACACCCCG TTGGAGGGAC TGGTCGAGTC GCTGGCCGAG
CAGGCGGCGA TCGCGCCGCA CTGGCTGCAG ATCGTGTACG GCGACGGGTC GGTCGGCGCG
GAGGTGATCG GGCAGCGACC CGACCAGGTG ATGTTCACCG GGTCGACCCG CACCGGCCGG
GCGATCCTGC GCCAGGCGGC CGAGCTGCTC ATCCCCGTGG AGCTCGAGCT CGGCGGCAAG
GACCCGATGA TCGTCTTCGA GGACGTCAAC ATCGCCCGGA CGGCGGCCGG GGCCGCGTTC
GGGGCGCTCA CCGCGGCCGG CCAGTCCTGC ACCTCGGTCG AGCGGCTCTA CGTCCACGAG
TCGGTCCACG ACGAGTTCGT CGACACCCTC GTCGAGGTGG TCTCGTCGCT GCGGCTCGTC
GAGTCGCCCG GGGACGACCG CGACGGCGAC GGCGACATCG GCTGCATGAC CACCGACTTC
CAGGTGCGTA CCGTCGCCGA GCACGTCCTC GACGCCCGCG CCCGCGGCGC CCGGGTGCGC
ACCGGCGCCG ACTGGGATGC GGCGGCGGTC CTGGACGGCC GGCCCGGCCT GTCCGGCCGG
CCGTTCCGGC TGGTGCCGCC GATGGTGGTC ACCGACCTGC CCGACGACGC GCTGCTGGCG
ACCGAGGAGA CGTTCGGACC AGTGGTACCG GTGCTGCGAT TCGCCGGCGA GCAGGAGGTG
ATCGAGCGCG CCAACGCCTC GGCGTACGGC CTGACCGCGA GCGTGTGGAG CGCCGACGCC
GAGCGGGCCG AACGGGTCGC GCGGCAGCTG CGCTGCGGCG GCGTGTCGAT CAACAACGTG
ATGGCCACCG AGGCGACTCC CGCGCTGCCG TTCGGCGGGG TCGGCGAGTC GGGCATGGGC
CGCTACAAGG GCGTGGCCGG GCTGCGCGCG TTCACCAACC CGCAGGCGGT CGTCGTCGAC
TCCGACGGCA CCAAGCTCGA GGCCAACTGG TACCCCTACA CCGCCCGGAA GCACGCCCTG
TTCACCTCGA TGATGCGGGC CTGGTTCAGC GACGGACCGA CCCGGTTGGC CCGGTTCGCG
GTCGCCGGCG CGCGGCTCGA GCGCCATGCC CAGAAGGCTC GCCGTGAGTA G
 
Protein sequence
MNEGFVRAEV AGVVGRARSG QRSLGALSVA ERLVHLRALR GAIAGRVDEI VDRVQTETGK 
SRSDILMSEI FGAMDAIAWL EANADEALAD EKVPTPMTLM GKKSRVWFQP RGVVLVISPW
NYPFFQAVVP IASALAAGNA VVYKPSEHTP LEGLVESLAE QAAIAPHWLQ IVYGDGSVGA
EVIGQRPDQV MFTGSTRTGR AILRQAAELL IPVELELGGK DPMIVFEDVN IARTAAGAAF
GALTAAGQSC TSVERLYVHE SVHDEFVDTL VEVVSSLRLV ESPGDDRDGD GDIGCMTTDF
QVRTVAEHVL DARARGARVR TGADWDAAAV LDGRPGLSGR PFRLVPPMVV TDLPDDALLA
TEETFGPVVP VLRFAGEQEV IERANASAYG LTASVWSADA ERAERVARQL RCGGVSINNV
MATEATPALP FGGVGESGMG RYKGVAGLRA FTNPQAVVVD SDGTKLEANW YPYTARKHAL
FTSMMRAWFS DGPTRLARFA VAGARLERHA QKARRE