Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4221 |
Symbol | |
ID | 9248095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5040381 |
End bp | 5041421 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Alcohol dehydrogenase GroES domain protein |
Protein accession | YP_003682119 |
Protein GI | 297563145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.41128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGTCG CCCGCTTCTA CGCCCCCGGT GACATCCGAC TGGAGCAGGC CCCCGAGCCG ACCGCCGGAC CCGGACAGCT CAAGATCGCC GTCGTCAACT GCTCCACGTG CGGCACCGAC GTGAAGATCT CCCGGCACGG ACACCACCAC ATCCGTCCGC CCCGCGTGAT CGGCCACGAG ATCGCCGGGC GGATCGTCGA GGTGGGCGAG GGCGTCACCG GCTGGGCCGA GGGCGACCGC GTCCAGGTCA TCGCCGCCAT CCCCTGCGGC ACCTGCGTGG AGTGCTCCGA CGGCCGGTTC ACCGTGTGCT CGCGCCAGGA GTCCATGGGT TACCACTACG ACGGCGGCTT CGCCGAGTAC ATGATCATCC CCGAGTCGGT CCTGGCCGTG GACGGGGTCA ACCGCGTCCC CGACAACATC GACCTGGCCG AGGCCTCCGT CGCCGAGCCG CTGGCGTGCG TGCTCAACGG CCAGGAGATC GCCGGAGTCG GCGAGGGCGA CACGGTCGTG GTCATGGGCG CCGGGCCGAT CGGCTGCCTG CACGTCCGGC TGGCCCGTGC GCGCGGCGCC GCGAAGGTCT ACCTGGTGGA CCTCAACCGG GGCCGCCTGG ACATGTCCGC CGACATCGTC CAGCCCGACG CGTCGATCTG CGGCGCCGAG ACCGACGCCG TGGAGGAGGT GCTCCGCCTG ACCGACGGCC GGGGCGCCGA CGTCGTCATC ACCGCCGCCG CCTCCGGGCG CGCCCAGGAG GACGCGCTGC GCATGGTCTC GCGCAGCGGC CGGATCAGCT TCTTCGGCGG CCTGCCCAAG GACGCGCCGA TCATCCAGCT GGACTCCAAC GCCGTGCACT ACCGGGAGAT CTCGATCTTC GGCGCCAACG GCTCCAGCCC CGAGCACAAC CGCCGCGCCC TGGAGCTGAT CTCCTCCGGC GCCGTGCCGG TGGCGGACCT GATCACCGAG CGGATGTCCC TGTCCGACGT GCACAAGGCC ATCGAGACGG TGGCCTCGGG CACCGCGATC AAGGTGACCA TCCAGCCGTA G
|
Protein sequence | MLVARFYAPG DIRLEQAPEP TAGPGQLKIA VVNCSTCGTD VKISRHGHHH IRPPRVIGHE IAGRIVEVGE GVTGWAEGDR VQVIAAIPCG TCVECSDGRF TVCSRQESMG YHYDGGFAEY MIIPESVLAV DGVNRVPDNI DLAEASVAEP LACVLNGQEI AGVGEGDTVV VMGAGPIGCL HVRLARARGA AKVYLVDLNR GRLDMSADIV QPDASICGAE TDAVEEVLRL TDGRGADVVI TAAASGRAQE DALRMVSRSG RISFFGGLPK DAPIIQLDSN AVHYREISIF GANGSSPEHN RRALELISSG AVPVADLITE RMSLSDVHKA IETVASGTAI KVTIQP
|
| |