Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3770 |
Symbol | |
ID | 9247639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4530116 |
End bp | 4531378 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003681674 |
Protein GI | 297562700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAC ACCGCCTCAG GAAGGCCCCC AGGGCCGCGA AGGCCGCGCT GACCGGCGCC CTCGCCCTCC TGACCGCCGG ACTCCTCGCC GGTTTCAGCC AGCTCTCCCC CGCTCCCGCC GAGGTCGAGA CCGTGTCCGA CAGCGACTCA GCCCAGGTCC AGCAGACCGG CCAGTGGCTC ACCGGGTACT GGCACAACTT CAACAACGGC TCCACCGTGA TGCCGCTGTC GGAGATCCCC TCCGAGTACA ACCTGGTGGC CGTCGCCTTC GCCGACAACC ACCCCACCCT CGACGGCGGA ATCACGTTCA ACCTGGCCAG CGGCGAGCTG GGCGGCTACA CCGACCAGCA GTTCCGCGAC GACATCGCGG CCATCCAGGC CGAGGGCCGC AAGGTCATCA TCTCCGTGGG CGGCGAGCGC GGCCACGTGG ACGTCACCAA CGCCACCCAG GCGGGCAACT TCGCCGACAC CGTGTACCAG CTGATGCAGG ACTACGGCTT CGACGGGGTC GACATCGACC TGGAGCACGG CATCAACGCG CAGTACATGT CCCAGGCCCT GCACGACCTG AGCGGCCGGG CCGGGTCGGA CCTGATCATC ACGATGGCGC CGCAGACCAT CGACTTCCAG AGCCCCGACC GGGAGTACTA CAAGCTGGCG TCGGACATCT CCGACATCCT CACCATCGTC AACATGCAGT ACTACAACTC CGGCTCGATG CTGGGGTGCG ACGACCAGGT GTACCACCAG GGCACCCCGG ATTTCGTCGC CGCGCTGGCC TGCATCCAGC TGGAGATGGG GCTGAGCCCG GACCAGGTGG GCCTCGGGCT GCCCGCCGTG CAGTCCGCCG CCGGGGGCGG GTACATGGCG CCCGGCCAGG TCGTGCGCGC CCTGGACTGC CTGGAGGCCG GGACCGACTG CGGCTCCTTC TCCCCCGCCG CCCCGTACGG CCCCATCGGC GGTGTGATGA CCTGGTCCAT CAACTGGGAC GCCACCAGCG GCTACGCCTT CGCCGAGACC ATCTCGGCGG GCCTGGCCAC CGGCCCCGGC AACGGCGGCG GGCCCACCGA GGAGCCGACC GAGGAGCCCA CCGAGCAGCC CGGCGACTGC ACCGCCGCGG CCTGGACGGC CGACAGCGTC TACACCGGCG GCGACGTGGT CTCCCACGAG GGCTCGGAGT ACCGCGCCCA GTGGTGGACC CGCGGCGAGG AGCCGGGCAC CACCGGCGAG TGGGGCGTCT GGAGGCTCGT CGGCACCTGC TGA
|
Protein sequence | MPEHRLRKAP RAAKAALTGA LALLTAGLLA GFSQLSPAPA EVETVSDSDS AQVQQTGQWL TGYWHNFNNG STVMPLSEIP SEYNLVAVAF ADNHPTLDGG ITFNLASGEL GGYTDQQFRD DIAAIQAEGR KVIISVGGER GHVDVTNATQ AGNFADTVYQ LMQDYGFDGV DIDLEHGINA QYMSQALHDL SGRAGSDLII TMAPQTIDFQ SPDREYYKLA SDISDILTIV NMQYYNSGSM LGCDDQVYHQ GTPDFVAALA CIQLEMGLSP DQVGLGLPAV QSAAGGGYMA PGQVVRALDC LEAGTDCGSF SPAAPYGPIG GVMTWSINWD ATSGYAFAET ISAGLATGPG NGGGPTEEPT EEPTEQPGDC TAAAWTADSV YTGGDVVSHE GSEYRAQWWT RGEEPGTTGE WGVWRLVGTC
|
| |