Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4665 |
Symbol | |
ID | 9248547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5541684 |
End bp | 5543273 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Berberine/berberine domain protein |
Protein accession | YP_003682557 |
Protein GI | 297563583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.293743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCACA GCCACGGCAT TCCTCCCCTG ACCACCGTCA AGCCCTCCGA CAAGCGCTAC CCAGCCCTCG ACCGGGGGTT CAACCAGCGT TGGATCGCCC GTCCCGACTA CGTGCGGTTG GTGGCCACAC CCGAGGGCGC CGCCCGCGCC CTGTCCGAGG CGGTCAACCG GACGACCGCG GACCTGGACC GCACACGGAT CACCGTGCGC TCGGGCGGCC ACTGCTACGA GGACTTCGTC TGCGGCGACG ACGTCCGCGT GATCCTGGAC GTCAGCCCCA TGTGCGGGAT CGGCTACGAC CCCGGACTGG AGGCGTACTG CGTGGAGGCG GGCGCCACCA ACTGGCACGC CTACAGCCAC CTGTATCCGC TCAGCGGGAA GGCCCTGCCG GGCGGGTCCT GCTACTCGGT CGGCATGGGC GGGCACATCA CCGGGGGCGG CTACGGTCTG CTCTCCCGCC AGTACGGGCT CACCGTGGAC TACCTGTACG CGGTGGAGGT GGCGGTGGTG CGCCCCGACC GCTCGGTCGA ACTCGTCCTG GCCACCCGGG ACGACCCCGA CCCCGACCGC CGGGAGCTGC TGTGGGCGCA CACCGGGGGC GGCGGCGGGA ACTTCGGGAT CGTCACCCGG TTCTGGTTCC GCGACCTGCC CGAACCCCCG TCGAACGTGC TGCTCAGCGG CCTCTCGTGG AAGTGGTCGG ACTTCACCAA GGACGACTTC GCCGCCCTGG TCACGGCGTA CGGCGAGTAC TTCCGCGACC ACCAGGACCC GGACGAGGCG TCCGGCAGGC TCTTCGCGCT GCTCAAGCTC AACCACGTCA GCAACGGGGA GATCGGCCTC GTCGCCCAGG TCGACGCCGA CGACCCCGAG GGCGGGCGGG CCATGGAGGA GTTCCTGGAC GCCGTCGACG GCCGGATCGC ACCGAAGTCC GGGCAGATGA CGACCCCCAT GGGGGAACAC CCCGCGGTCC AGGGCATGCA GACCCCCCGG AGGCTGCCCT GGCTGACGGC CACCCAGGTA CTCAACGGCT CCGGCGAGAA CCAGTGCGGC AAGTACAAGT CGGCCTACCT GCGCAAACCC TTCAGCCCGG CGCAGATCGA GGCCATGTGG GCCTATCTGG GCAAGGAGCA CTACACCGAC TACGTCAACA AGGAGGCCCT GATCCAGGTC GACTCCTACG GCGGGGCCGT CAACCAGCCG CCGGCGGACA CCGCCGTGCC CCAGCGCGAC TCCGTGCTCA AGGTGCAGTA CCAGGTCTAC TGGAAGCGCA CCGAGCCGGA GTCAGCCGTG GCCGGGCACC TGGCGTGGAT CCGCGACGTC TACCGCAAGA CGTTCGCGTC CACCGGCGGG GTCCCCGTGA TCGGAGACGC CACCGACGGG TGCTACGTCA ACTACCCGGA CGTCGACCTG AGCGACCCGG CCTGGAACAC CTCGAAGGAC CGCTGGTCGA AGCTCTACTA CAAGGACTCC TACCCCCGGC TCCAGCGGGT GAAGGAGCGG TGGGACCCGC TCAACGTGTT CCGGCACCGG CAGTCGGTCG AACTCCCCGC GTCGGCGGGC GATGCCGGTC AGGGCTCCGG ATCCTCCTGA
|
Protein sequence | MSHSHGIPPL TTVKPSDKRY PALDRGFNQR WIARPDYVRL VATPEGAARA LSEAVNRTTA DLDRTRITVR SGGHCYEDFV CGDDVRVILD VSPMCGIGYD PGLEAYCVEA GATNWHAYSH LYPLSGKALP GGSCYSVGMG GHITGGGYGL LSRQYGLTVD YLYAVEVAVV RPDRSVELVL ATRDDPDPDR RELLWAHTGG GGGNFGIVTR FWFRDLPEPP SNVLLSGLSW KWSDFTKDDF AALVTAYGEY FRDHQDPDEA SGRLFALLKL NHVSNGEIGL VAQVDADDPE GGRAMEEFLD AVDGRIAPKS GQMTTPMGEH PAVQGMQTPR RLPWLTATQV LNGSGENQCG KYKSAYLRKP FSPAQIEAMW AYLGKEHYTD YVNKEALIQV DSYGGAVNQP PADTAVPQRD SVLKVQYQVY WKRTEPESAV AGHLAWIRDV YRKTFASTGG VPVIGDATDG CYVNYPDVDL SDPAWNTSKD RWSKLYYKDS YPRLQRVKER WDPLNVFRHR QSVELPASAG DAGQGSGSS
|
| |