Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3998 |
Symbol | |
ID | 9247870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4781498 |
End bp | 4782499 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Alcohol dehydrogenase GroES domain protein |
Protein accession | YP_003681901 |
Protein GI | 297562927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.582496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.235514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTGCCG CCATCATCAC CGAACCGGGC TCCGTCACCG TGGGGGACCG CCCCGACCCG CACCCCGCCG CCGACGGCGT GGTGGTCCGG GTCGGCGCCT GCGGGATCTG CGGGACGGAC CTGCACATCG CCGACGGGGA GTTCCCGCCC AGCCCCTACC CGCTGGTGCC CGGGCACGAG TTCGCCGGAA CGGTCACGGC CGTCGGCGAG AACGCCCCGG GGGACCTGCG CCCCGGCGAC CGGGTGGCCG TGGACCCCTC CCTGTTCTGC GGGCACTGCG AGTACTGCCG CGCCGGACGG GGCAACCTGT GCGCCAACTG GGGCGCGATC GGCGACACCG TGGACGGCGC GTTCGCCGAG TACGTGGCCG TCCCCGCCGC CAACTGCTAC CGCCTGCCCG ACTCGGTGAG CATGCGCGAG GGCGCCCTGG TCGAGCCGCT CTCGTGCGCC GTCCACGGTG TCCGGCGCAT CGGGGTGGAG ACCGGCGAGC GCTTCCTGGT GGTGGGCGCC GGGACCATGG GCCTGCTCCT CCAGCAGCTG TTGCAGAACT CCGGCGCCCG CGTGACGGTG GTGGACCGCA ACACCCGCCG CCTGGCCATC GCGACCGACC TCGGCGCCGC CGCGACCGCG ACGGACGCGT CCGAGCTGGG CGACGAGCGC TTCGACGCCG CGGTGGACGT CACGGGCGCG CCCTCCGCCA TCGAGGCGGC CTTCGACTCG CTGCGGCGGG GAGGGCGCCT GCTGGTCTTC GGGGTCGCCG ACGAGGCGGC CCGCGTGGCG CTGTCGCCGT TTCGGATCTA CAACGACGAG ATCACCGTCG TGGGCTCCAT GGCCGTGCTC AACAGCTACG GGGCGGCCGT GGACCTCATC AGCAGCGGCG CGGTGCGCAC GGCGCCGCTG CTCACCGACG CCCTGCCCCT GGAGAAGTTC CCCGAGGCGC TGGCCATGAT GCGCGCGGGC ACCGGGGTGA AGGTCCAGGT CGTCCCCGAC GAGACCGCCT GA
|
Protein sequence | MRAAIITEPG SVTVGDRPDP HPAADGVVVR VGACGICGTD LHIADGEFPP SPYPLVPGHE FAGTVTAVGE NAPGDLRPGD RVAVDPSLFC GHCEYCRAGR GNLCANWGAI GDTVDGAFAE YVAVPAANCY RLPDSVSMRE GALVEPLSCA VHGVRRIGVE TGERFLVVGA GTMGLLLQQL LQNSGARVTV VDRNTRRLAI ATDLGAAATA TDASELGDER FDAAVDVTGA PSAIEAAFDS LRRGGRLLVF GVADEAARVA LSPFRIYNDE ITVVGSMAVL NSYGAAVDLI SSGAVRTAPL LTDALPLEKF PEALAMMRAG TGVKVQVVPD ETA
|
| |