Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2095 |
Symbol | |
ID | 4595540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2238419 |
End bp | 2239969 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639776698 |
Product | aldehyde dehydrogenase |
Protein accession | YP_923291 |
Protein GI | 119716326 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.110223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGAGG GCTTCGTACG CGCCGAGGTC GCGGGTGTCG TCGGGCGGGC CCGGAGCGGG CAGCGCTCGC TGGGCGCGCT GTCGGTGGCC GAGCGCCTGG TGCACCTGCG CGCCCTGCGC GGCGCGATCG CGGGCCGGGT CGACGAGATC GTCGACCGGG TGCAGACCGA GACCGGCAAG TCGCGCTCGG ACATCTTGAT GTCGGAGATC TTCGGCGCGA TGGACGCGAT CGCCTGGCTC GAGGCCAACG CCGACGAGGC GCTCGCCGAC GAGAAGGTGC CCACACCGAT GACCCTGATG GGCAAGAAGT CGCGGGTCTG GTTCCAGCCG CGCGGGGTCG TCCTCGTCAT CTCGCCGTGG AACTACCCGT TCTTCCAGGC GGTCGTCCCG ATCGCGAGCG CCCTGGCCGC CGGCAACGCG GTGGTCTACA AGCCGAGCGA GCACACCCCG TTGGAGGGAC TGGTCGAGTC GCTGGCCGAG CAGGCGGCGA TCGCGCCGCA CTGGCTGCAG ATCGTGTACG GCGACGGGTC GGTCGGCGCG GAGGTGATCG GGCAGCGACC CGACCAGGTG ATGTTCACCG GGTCGACCCG CACCGGCCGG GCGATCCTGC GCCAGGCGGC CGAGCTGCTC ATCCCCGTGG AGCTCGAGCT CGGCGGCAAG GACCCGATGA TCGTCTTCGA GGACGTCAAC ATCGCCCGGA CGGCGGCCGG GGCCGCGTTC GGGGCGCTCA CCGCGGCCGG CCAGTCCTGC ACCTCGGTCG AGCGGCTCTA CGTCCACGAG TCGGTCCACG ACGAGTTCGT CGACACCCTC GTCGAGGTGG TCTCGTCGCT GCGGCTCGTC GAGTCGCCCG GGGACGACCG CGACGGCGAC GGCGACATCG GCTGCATGAC CACCGACTTC CAGGTGCGTA CCGTCGCCGA GCACGTCCTC GACGCCCGCG CCCGCGGCGC CCGGGTGCGC ACCGGCGCCG ACTGGGATGC GGCGGCGGTC CTGGACGGCC GGCCCGGCCT GTCCGGCCGG CCGTTCCGGC TGGTGCCGCC GATGGTGGTC ACCGACCTGC CCGACGACGC GCTGCTGGCG ACCGAGGAGA CGTTCGGACC AGTGGTACCG GTGCTGCGAT TCGCCGGCGA GCAGGAGGTG ATCGAGCGCG CCAACGCCTC GGCGTACGGC CTGACCGCGA GCGTGTGGAG CGCCGACGCC GAGCGGGCCG AACGGGTCGC GCGGCAGCTG CGCTGCGGCG GCGTGTCGAT CAACAACGTG ATGGCCACCG AGGCGACTCC CGCGCTGCCG TTCGGCGGGG TCGGCGAGTC GGGCATGGGC CGCTACAAGG GCGTGGCCGG GCTGCGCGCG TTCACCAACC CGCAGGCGGT CGTCGTCGAC TCCGACGGCA CCAAGCTCGA GGCCAACTGG TACCCCTACA CCGCCCGGAA GCACGCCCTG TTCACCTCGA TGATGCGGGC CTGGTTCAGC GACGGACCGA CCCGGTTGGC CCGGTTCGCG GTCGCCGGCG CGCGGCTCGA GCGCCATGCC CAGAAGGCTC GCCGTGAGTA G
|
Protein sequence | MNEGFVRAEV AGVVGRARSG QRSLGALSVA ERLVHLRALR GAIAGRVDEI VDRVQTETGK SRSDILMSEI FGAMDAIAWL EANADEALAD EKVPTPMTLM GKKSRVWFQP RGVVLVISPW NYPFFQAVVP IASALAAGNA VVYKPSEHTP LEGLVESLAE QAAIAPHWLQ IVYGDGSVGA EVIGQRPDQV MFTGSTRTGR AILRQAAELL IPVELELGGK DPMIVFEDVN IARTAAGAAF GALTAAGQSC TSVERLYVHE SVHDEFVDTL VEVVSSLRLV ESPGDDRDGD GDIGCMTTDF QVRTVAEHVL DARARGARVR TGADWDAAAV LDGRPGLSGR PFRLVPPMVV TDLPDDALLA TEETFGPVVP VLRFAGEQEV IERANASAYG LTASVWSADA ERAERVARQL RCGGVSINNV MATEATPALP FGGVGESGMG RYKGVAGLRA FTNPQAVVVD SDGTKLEANW YPYTARKHAL FTSMMRAWFS DGPTRLARFA VAGARLERHA QKARRE
|
| |