Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4054 |
Symbol | |
ID | 9247926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4849236 |
End bp | 4850384 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | oxidoreductase domain protein |
Protein accession | YP_003681956 |
Protein GI | 297562982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTGA GTAAGGTCAC AAGCCTGGGT GAGGCGGCCG ACCCCGGGTC CACCGCCCCC AACGCCTACC GCGCCGCGGT GATCGGTACC GGCTTCATGG GCCGGGTCCA CTCCCACGCC GTCCGCGCCG CGGGAGGCGA GGTCGTCGGG GTCGCGGGCT CCTCCCATGG CAAGGCCGAA CAGTTCCGCA CCGCGCACGG CGTCGCCCGG GCCCACGCCG ACGCCCTCGA ACTCGTCCGC AGCGACGACG TGGACGTGGT CCACGTGTGC ACCCCCAACC ACCTGCACGC GCCGCTGAGC CTGGCCGCCC TGGCCGCCGG CAAGCACGTC GTGTGCGAGA AACCCCTGGC CACCGACGCC GACACCGCCC GCGAGCTGGT CCGGGCCGCG GAGGAGGCCG ACCGGGTCGC CGTCGTCCCC TTCGCCTACC GCTTCCACCC CATGGCCCGC GAGGCGCGCG CCCGCGTGGC CTCGGGCTCC ATCGGCCGCG TCAGCCTCGC CCACGGCGGC TACCTCCAGG ACTGGCTCCT GTACCCGGAC GAGGACAACT GGCGGGTGGA CCCCGAACTG GGCGGCCCCA CCCGCGCGTT CGGGGACATC GGCTCGCACT GGTGCGACAT GCTGGAGTTC GTCACCGGCG ACCGCATCAC CTCGGTCAGC GCGCAGACCT CGCGGGTCAA CGACACCCGC GCCGGACGCT CGGTGGCCAC CGAGGACCTG GTGGCCTTCC AGTTCTCCAC CGCGGGGGGC GTGGTCGGCG GCGCCGTCAT CAGCCAGGTC TCCCCCGGCC GCAAGAACCG GCTCGTGCTG GAGGTCTCCG GCACCGAGGG CACCCTGCTG TTCGACCAGG AGCGGCCCGA GACCCTCTGG GCGGGCGGCC GGGGCCGCAG CTGCACCATC AGCCGCGACG ACCCCGAGCT GAGCGCCGAC GCCGCGCGCC TGTCCACGAC CCCCGTCGGC CACCCGCAGG GCTACCAGGA CTGCTTCAAC GCGCTCGTGG CCGACACCGG AGCCGCCATC GCCGGGCAGA CCCCCGAGGG CCTGCCGGTG TTCGCCGACG GTTTGCGCGC CGCCGTGCTG GCCGAGGCCG TCCTGACCTC GGCGCGGGAA CGCCGCTGGG TCGACGTGCC GGAGGTGGAC GGGGCATGA
|
Protein sequence | MSVSKVTSLG EAADPGSTAP NAYRAAVIGT GFMGRVHSHA VRAAGGEVVG VAGSSHGKAE QFRTAHGVAR AHADALELVR SDDVDVVHVC TPNHLHAPLS LAALAAGKHV VCEKPLATDA DTARELVRAA EEADRVAVVP FAYRFHPMAR EARARVASGS IGRVSLAHGG YLQDWLLYPD EDNWRVDPEL GGPTRAFGDI GSHWCDMLEF VTGDRITSVS AQTSRVNDTR AGRSVATEDL VAFQFSTAGG VVGGAVISQV SPGRKNRLVL EVSGTEGTLL FDQERPETLW AGGRGRSCTI SRDDPELSAD AARLSTTPVG HPQGYQDCFN ALVADTGAAI AGQTPEGLPV FADGLRAAVL AEAVLTSARE RRWVDVPEVD GA
|
| |