Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2247 |
Symbol | |
ID | 9246097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2689943 |
End bp | 2691037 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Alcohol dehydrogenase zinc-binding domain protein |
Protein accession | YP_003680175 |
Protein GI | 297561201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.792853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACGA CCGTCAGGGC CGCCGTCTTC TCCGAGCGCG GCGCCCCTCC CCGGATCCGC GACCTGGTCC TGCCCGACCC CGGCCCCGGC CAGGTGCGGG TGCGCCTGGC AGCGGCCGGG GTGTGCCACT CCGACCTGTC CCTGTCCAAC GGCACCCTGG CGCAGAAGTG GCCCGCGGTG CTGGGCCACG AGGGTGCCGG AACCGTCGAC GCCGTCGGCG AGGGCGTCAC GGAGGTGGTC CCCGGCCAGC AGGTGATCCT GAACTGGGCC CCCTCGTGCC GCGAGTGCTG GTTCTGCCGC CAGGGCGAAC CTCACCTGTG CGAGCACGCG CTGGACCGCA CCGTACTGCC CTACGCGGAG CTCGCCGACG GCACGCCCGT CTACCCCGGC CTGGGCTGCG GCGCGTTCGC CGAGGCCACC GTGGTGCCCG CCTCCGCCGT CGTGCCGCTG CCCGATGGGA TCGACCCGGC GGTGGCCGCC GTGCTGGGCT GCGCGGTGCT CACCGGCTGG GGCGCGGTCC ACAACTCCGC GGGCGTGCGC GAGGGCCAGT CGGCCGTGGT GCTGGGCCTG GGCGGGGTGG GCCTGTCGGT GCTCCAGGCC GCGCGTCTGG CCGGGGCCGA CCCGGTGGTC GCGGTGGACG TCTCCCCCGC CAAGGAGGAG CTGGCCCGTT CACTGGGCGC CACCGAGTTC CTGCTCGCCG ACGAGACCCT GGTCAGGGCC GTGCGCGCGC TGACGGGCAG GCGCGGCGCC GACCACGCCT TCGAGGTGGT GGGGTCGGCG AAGGCGGTGC GCTCGGCCTG GGACGTGACC CGGCGCGGCG GCACGGTCAC GGTGGTGGGC GTGGGCAGGG TGGACGACGA GGTGTCCTTC AACGCGCTGG AGCTGTTCCA CCAGGCGCGC ACGCTGCGCG GGTGCGTGTA CGGCTCCAGC GACCCGGAGC GCGACGTCCC GCTCATCGCC GAGCGGGTGC GTTCGGGGGA GCTGAAGCTG GCGGCGATGG TCACCGACGA GATCCCGCTC GAAGGCGTGC CCGAGGCCTT CGAGCGCATG GCCCGGGGCA GGGGCGGCCG GTCGCTGGTG CGCTTCGGGG CCTGA
|
Protein sequence | MSTTVRAAVF SERGAPPRIR DLVLPDPGPG QVRVRLAAAG VCHSDLSLSN GTLAQKWPAV LGHEGAGTVD AVGEGVTEVV PGQQVILNWA PSCRECWFCR QGEPHLCEHA LDRTVLPYAE LADGTPVYPG LGCGAFAEAT VVPASAVVPL PDGIDPAVAA VLGCAVLTGW GAVHNSAGVR EGQSAVVLGL GGVGLSVLQA ARLAGADPVV AVDVSPAKEE LARSLGATEF LLADETLVRA VRALTGRRGA DHAFEVVGSA KAVRSAWDVT RRGGTVTVVG VGRVDDEVSF NALELFHQAR TLRGCVYGSS DPERDVPLIA ERVRSGELKL AAMVTDEIPL EGVPEAFERM ARGRGGRSLV RFGA
|
| |