Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5023 |
Symbol | |
ID | 9248912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 164491 |
End bp | 165732 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | monooxygenase FAD-binding protein |
Protein accession | YP_003682910 |
Protein GI | 297563937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCAG CCGATGGTGC CGACATCGTG ATCGTCGGCG CTGGGATCGG AGGGCTGGCC ACAGCCCTCG CCCTGCACTC CCACGGAATC AGCGCCACCG TTCTGGAAAC CGCCGAGGAG ATCCGTCCGC TCGGGGTCGG CATCAACGTC CAGCCCGCGG CGATCGCCGA GCTGACCGCC CTCGGCCTCG GCGACGCGCT CGCCGCGACG GGCATCCCCA CCCGGGAACA CCTCTACCTC GACCACAGGG GCACCACCCT CTGGAACGAA CCCCGCGGCG TGGCGGCGGG CAACGAGTAC CCGCAGTACT CGATCCACCG CGGTGAACTC CAACTGCTCC TGCTGGAGGC CGTCCGCGAG CGTCTGGGCC CCGGCGCGGT CCGCACCGGC CTGCGCCTTG ACTCCTTCGA GCAGACCGCG ACGGGCGTGC GCGCCCTGGC CCGCGACCAG TCCGGCGGAG CCGCGGAGGT CACGGGAGCG GCGTTGGTCG GCGCGGACGG GCTGCACTCG CGCGTCCGCG CGCAGCTCCA CCCCGACCGC ACCGCCCTGT CCGGCGGAGG CGTCCACATG TGGCGGGGCC TGACCGAGCT GGACGGCTTC CTGGACGGGC GCACCATGAT CGTCGCCAAC GACGAGCACT CCGCCAGGCT CGTCGCGTAC CCGATCTCGG CCCGCCACGC GGCGCGCGGC CGCGCACTGC TCAACTGGGT GTGCCTGGTG CCCTCCCCCG GTCTGTCCGT CGACGCGGAC TGGGACGACA ACGGGCGCAT CGAGGAACTC GCCCCGCACT ACGCGCACTG GGACTTCGGC TGGCTCGACG TCCCCGGCGT GCTCGCCCGC AGCGAGCGGA TCCTCCAGTA CCCCATGGTG GACCGGGACC CCCTGGAGCA CTGGGGCGAG GGCCGCACGA CCCTGCTCGG GGACGCGGCC CACCTGATGT ACCCCATCGG GGCCAACGGC GCGTCCCAGG CCGTCCTCGA CGCGGCGGCC CTCGCCGCCG AGTTGGGGAC CGGTGGCGAC GTGGAGGCGG CGCTGCGGCG CTACGAGGAC GTGCGGCTCC CCGCGACCAA CGAGATCGTC CGGGCCAACC GCAGGATGGA CCGCTCGGAG CGCTCGATGG CGGGGCGGAC GGACCGGGAG AAGAGCACCC TGCTGGAGGC GGTCACCGAC GACTACCGTG AGGCGGTCGA GCGGCGGCTG GACAACGGGG GCCTGGACAC CTCGGGCGCC CGCGCCCGCT GA
|
Protein sequence | MTSADGADIV IVGAGIGGLA TALALHSHGI SATVLETAEE IRPLGVGINV QPAAIAELTA LGLGDALAAT GIPTREHLYL DHRGTTLWNE PRGVAAGNEY PQYSIHRGEL QLLLLEAVRE RLGPGAVRTG LRLDSFEQTA TGVRALARDQ SGGAAEVTGA ALVGADGLHS RVRAQLHPDR TALSGGGVHM WRGLTELDGF LDGRTMIVAN DEHSARLVAY PISARHAARG RALLNWVCLV PSPGLSVDAD WDDNGRIEEL APHYAHWDFG WLDVPGVLAR SERILQYPMV DRDPLEHWGE GRTTLLGDAA HLMYPIGANG ASQAVLDAAA LAAELGTGGD VEAALRRYED VRLPATNEIV RANRRMDRSE RSMAGRTDRE KSTLLEAVTD DYREAVERRL DNGGLDTSGA RAR
|
| |