Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1404 |
Symbol | |
ID | 9245254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1720919 |
End bp | 1721965 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Alcohol dehydrogenase zinc-binding domain protein |
Protein accession | YP_003679342 |
Protein GI | 297560368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.306134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCG TCGCCGCCAC CACCGCCACC CAGGTCCTGC TGCCCGGCCG AGTCGAACCC TCCGGCCTCC AGGTTCGGCC CCGAGCACTG TCCGCTCCCG GAGCCGGACA CGTCCTGCTG CGCATGGAGG CGACCGGGGT ATCGTTCGCC GAGCAGCAGA TGCGCCGGGG CAAGTACTTC GACCAGCCCC CGTTCCCGTT CGTCCCGGGC TACGACGTCG TCGGCACCGT CGAGGCCGCC GGTCCGGGCG TGAACGCCTC CCTCGTCGGG CGCCGCTTCG CCGCCGTCAC CAAGACCGGC GCCTGGGCCA GCCACCTGGT CCTGGACGCA CGCGATCTCG TGGCGGTGCC GGAGGGGACG GACCCGGCGC AGGTCGAGAC GCTGCTCGTC AACGGCATCA CCGCCTGGCA GATGCTGCAC CGCACGGCCC GCGTGCGACG GGGCGGCACC GTCGTCGTAC TGGGCGCCAA CGGCGGCGTG GGCACGGTCC TGGTCCAGCT CGCCCTGCAC GCCGGGATCA CCGTGATCGG CACGGCCGCG CCGCGCCACC ACGAGGCCGT GCGCGCGCTC GGCGCGACCC CCGTCGACTA CCGCGACCCG CACGTGTACG ACAGGATCCG GGACCTGGCG CCCGAGGGCG TGGACGCGGT GTTCGACCAC GTCGGCGGCG GGAACCTGGT CCGGTCGTGG CGCCTGCTGC GGCGCGGCGG CACACTCGTC TCCTACGGCA CGGCGTCGAC GAAGGACGTC GAGGGCGACT CCCGGCTGCC CGTGCTCGCG CTCTTCGGCC GGCTCCTGGT GTGGAACGCC CTGCCCAACG GCCGGAGCGC CCACTTCTAC AACTTCTGGG CCGGACGCGC CCGCCGCGCC GACGCCTTCC GCGCCCGGCT GCGCGAGGAC CTGACCGAGG TGCTGCGGCT GCTCGCCGAC GGGGTCCTGA CACCGCAGGT GGCCGCCCGC GTCCCGCTCT CGGAGGCCTC CCGCGCGCTG GCCCTCGCCG AGTCCCGCAC GGTCGTCGGG AAGGTCGTGC TCGTCCCCGA CGCGTGA
|
Protein sequence | MNAVAATTAT QVLLPGRVEP SGLQVRPRAL SAPGAGHVLL RMEATGVSFA EQQMRRGKYF DQPPFPFVPG YDVVGTVEAA GPGVNASLVG RRFAAVTKTG AWASHLVLDA RDLVAVPEGT DPAQVETLLV NGITAWQMLH RTARVRRGGT VVVLGANGGV GTVLVQLALH AGITVIGTAA PRHHEAVRAL GATPVDYRDP HVYDRIRDLA PEGVDAVFDH VGGGNLVRSW RLLRRGGTLV SYGTASTKDV EGDSRLPVLA LFGRLLVWNA LPNGRSAHFY NFWAGRARRA DAFRARLRED LTEVLRLLAD GVLTPQVAAR VPLSEASRAL ALAESRTVVG KVVLVPDA
|
| |