Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0376 |
Symbol | |
ID | 9244211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 458162 |
End bp | 459886 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003678330 |
Protein GI | 297559356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTGA CGCAGACCAC GACGGAGGAC CGGCTCAGAG GCCTCCGCGC ATGGCAGCGG GAGGCGTTCG AGGAGTACTT CCGCCGGGAA CCGCGGGACT TCCTCGCCGT GGCCACCCCC GGTGCGGGCA AGACCACCTT CGCCCTCACC CTGGCCAGCG AACTCCTCCA ACGCCACACG GTGCGCGCCA TCACCATCGT GTGCCCCACC GACCACCTCA AGAAGCAGTG GGCCGAGGCG GCCGCCCGGT TCGGCATCGC CATCGACCCC GAGTTCCGCA ACGGCCAGGG CGCCCTGGGC CGCCAGTACG TCGGCGTGGC CGTCACCTAC GCCCAGGTCG CCGCCCACCC GATGCTGCAC CGCAACCGGA CCGAGGCGCG CAAGACCCTC GTCATCTTCG ACGAGGTCCA CCACGCCGGG GACGCCCTGT CCTGGGGCGA CGCGGCCCGC GAGGCCTTCG ACCCGGCGGC GCGCCGCCTC TCCCTGACCG GGACCCCCTT CCGGTCCGAC ATCAACCCCA TCCCCTTCGT CGACTACGTC CAGGACAGCG CCGGGGTGCG CCGCTGCTCC TGGGACTACA GCTACGGGTA CGGGCCCGCC CTGGCCGACG GGGTCGTGCG CCCCGTCATC TTCATGGCCT ACTCCGGCGA GATGCGCTGG CGCACCCGCG CGGGCGACGA GCTCGCCGCC AGGCTGGGCG AACCCCTCAC CCAGGACGCG CTCTCCCAGG CGTGGCGGGC CGCGCTGGAC CCCAAGGGCG ACTGGATCAA GCGCGTACTC CAGGCCGCCG ACCGCCGCCT GACCGAGGTC CGCAAGACCC ACCCCGACGC GGGGGCCCTG GTCATCGCCA GCGACCACGA GAACGCCCGC GCCTACTCGC GCATCCTGCG CCAGATCACC GGCAAGGGCG CCACGGTCAT CCTGTCCGAC GACCCCGGGG CCTCCAAGAA GATCTCCCGG TTCGCCGCGG GCGACGACCG CTGGATGGTC GCGGTGCGCA TGGTCTCCGA GGGGGTGGAC GTGCCCCGGC TGATGGTGGG CGTGTACGCC ACCTCCACCA GCACCGCGCT GTTCTTCGCC CAGGCCATCG GCCGCTTCGT GCGCGTGCGC CAGCGCGGCG AGGTCGCCTC GGTCTTCCTG CCCTCCGTGC CCACCCTGCT GGAGTACGCG GGCGAGATGG AGCGCGAGCG CGACCACGTG CTCGACCGGA CCCCTGGGGA GGGCGACGAG TACCCGGAGG AGGACCTGCT CCGGGAGGCC AACAAGAAGC GGGACACCCC CGACGCCGGG GAGGAACTGC CCTTCGAGAC CATGGAGTCG GCGGCGGAGT TCGACCGCGC CCTCTACGAC GGGGCCGAGT ACGGCGGCGT GCCGGGCTCC ACGGAGGAGG AGGACTTCCT GGGCCTGCCC GGTCTGCTCG ACCCGCAGCA GGTCTCCCAG CTCCTGCGCA AGCGCAAGGC CGACCTCAAG GCGAGCGAGG TCAAGGCCCG CAAGGTGGAG GAGCCCGCCG AGGAGGACGG GCCCACCCAC CAGGTCCTGG CCGACCTGCG CCGCGAGCTG AGCGGTCTCG TGGGCGCCTG GCACCACCGC ACCGGCAAGC CGCACGGAGT GATCCACAAC GAGCTGCGCC GCGCCTGCGG CGGGCCGCCC GTCGCACAGG CCACCCCGAC GCAGATCCGC GAACGGATCG CCAAGATCCG CGTCTGGGCC GTCGGCGGGC GATAG
|
Protein sequence | MTVTQTTTED RLRGLRAWQR EAFEEYFRRE PRDFLAVATP GAGKTTFALT LASELLQRHT VRAITIVCPT DHLKKQWAEA AARFGIAIDP EFRNGQGALG RQYVGVAVTY AQVAAHPMLH RNRTEARKTL VIFDEVHHAG DALSWGDAAR EAFDPAARRL SLTGTPFRSD INPIPFVDYV QDSAGVRRCS WDYSYGYGPA LADGVVRPVI FMAYSGEMRW RTRAGDELAA RLGEPLTQDA LSQAWRAALD PKGDWIKRVL QAADRRLTEV RKTHPDAGAL VIASDHENAR AYSRILRQIT GKGATVILSD DPGASKKISR FAAGDDRWMV AVRMVSEGVD VPRLMVGVYA TSTSTALFFA QAIGRFVRVR QRGEVASVFL PSVPTLLEYA GEMERERDHV LDRTPGEGDE YPEEDLLREA NKKRDTPDAG EELPFETMES AAEFDRALYD GAEYGGVPGS TEEEDFLGLP GLLDPQQVSQ LLRKRKADLK ASEVKARKVE EPAEEDGPTH QVLADLRREL SGLVGAWHHR TGKPHGVIHN ELRRACGGPP VAQATPTQIR ERIAKIRVWA VGGR
|
| |