Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0360 |
Symbol | |
ID | 9244195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 437530 |
End bp | 439488 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003678314 |
Protein GI | 297559340 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAGCAC GACTTCGCCA ACGAATCGCG GCACTGGCCG CCGCGGTCGT CCTACCCCTC GCACTGGCAC CCGTCCCGGC CGCGTCGGCC GACACCGCGG GCGTCACCGT CACCTACGTG GAGACCAGTC GCTGGGAGAC CGGCTACGGC GGGCAGCTGA CCATCGCCAA CGGCTCGGGG TCGGCGCTGA CCGACTGGAG CATCGGCTTC CGGCTCCCGT CCGGCACCGC CATCACCAGC CTGTGGAACG CCACCCTCAG CCGCTCCGGC GACGCCTACA CCGTCACCCC GCCCTCGTGG GGCGCCTCCG TCCCGGCGGG CGGCAGCTAC TCCATCGGGT TCAACGGCAC CCACGGCGGC GGCGACACCG CTCCCGTGGA CTGCACGGTC AACGGCGGCG GCTGCTCGGG CGAGCCCGGC GAGGAGGACA CCGAGCCTCC CACCGCGCCG ACCGGCCTGA CCGTCACCGG CACCACCTCG ACCACCGTGG CCCTGCAGTG GGGCCCCGCG GACGACAACG CCGGGGTCGC GGGCTACGAG GTCCTCTCCG GCGGCGAGGT CGTCCGCGCG GTCACCGGCA CCACCGCCAC CGTCACCGGG CTGGCGCCCC AGACCGAGCA CACGTTCACC GTGCGCGCCT ACGACACCTC CAACAACAGG GGTCCCGAGA GCGGCGCCGT CACCGCCACC ACCGACGCGG ACGGCGGCGG CCCCACCGAC CCGCCCCAGG AGCGCCGGGT CGCCTACTTC ACCCAGTGGG GCATCTACGG CCGCGACTAC CTGGTGAACG ACCTGGTCAC CTCGGGCACC GCCGAGAAGC TCACCCACAT CAACTACGCC TTCGGCAACA TCAACGCGAA CGGCGAGTGC TTCATGGCCA ACCAGCTCGG CCAGGGCGAC GCCTGGGCCG ACTACGGCCG CTCCTTCGGG GCCGCCGACA GCGTCGACGG GGTCGGCGAC ACGTGGGACC AGGACCTGCG CGGCAACTTC AACCAGCTGC GCGAGCTCAA GGAGATGTAC CCCGACCTCA AGGTCAACAT CTCCCTGGGC GGCTGGACCT GGTCCGAGCA CTTCTCCGAC GCGGCGCTGA CCGCCGAGTC GCGTGAGCGC ATGGTCTCCT CCTGCATCGA CCAGTTCCTG CGCGGCAACC TGCCCGTGTT CGACGGCGCG GGCGGCCCCG GCTCCGCCTA CGGCGTCTTC GACGGCATCG ACCTGGACTG GGAGTGGCCG GGATCGGCGG GCCACGAGCA CAACACCGTC CGCCCCGAGG ACAAGGAGAA CTTCACCGCC CTGGTGCAGG AGTTCCGCGA CCAGCTGGAC GCCCTGGAGG CCGAGACGAG CCGCCAGTAC GAGCTGACCG CGTTCCTGCC CGCAGACCCG GAGAAGGTCG AGCTCGGCTT CGAGATGCCG CAGCTCATGA CCGACTTCGA CTTCATCACG GTGCAGGGCT ACGACTACCA CGGCGGTTGG GAGACCACCG CCAACCACCA GTCTAACCTG CTCCTGGACC CGGCCGACCC CGGCCCGGAC CTGTACTCCA CCGAGACCAC GGTCCAGGCC TACCTCGACC GCGGCGTCGA CCCCGCCGAC ATGGTGCTCG GCGTGCCGTT CTACGGCCGC GGCTGGACCG GTGTGGAGCC CGGTCCGAAC GGCGACGGTC TCTTCCAGAG CGCTACCGGT CCCGCCCCCG GTAGCTACGA GGCGGGGATC GACGACTGGA AGGTCCTGAA GGACCTGGTG GGCACCGGCG GCTACGAGCT GTACCGCGAC GACGCGCTGG GCACCGCCTG GCTGTACAAC GGCAGCACCT TCTGGACCTA CGACGACGAG ATCTCCATGG CCCAGAAGAC CGACTGGGCC CAGGCCCAGG GCCTGGGCGG CGTCATGATC TGGTCCGTTG ACGGCGACGA CGCCAACGGC AGCCTCATGA ACGCCATCGA CACGGCGCTG GCCGGGTAG
|
Protein sequence | MRARLRQRIA ALAAAVVLPL ALAPVPAASA DTAGVTVTYV ETSRWETGYG GQLTIANGSG SALTDWSIGF RLPSGTAITS LWNATLSRSG DAYTVTPPSW GASVPAGGSY SIGFNGTHGG GDTAPVDCTV NGGGCSGEPG EEDTEPPTAP TGLTVTGTTS TTVALQWGPA DDNAGVAGYE VLSGGEVVRA VTGTTATVTG LAPQTEHTFT VRAYDTSNNR GPESGAVTAT TDADGGGPTD PPQERRVAYF TQWGIYGRDY LVNDLVTSGT AEKLTHINYA FGNINANGEC FMANQLGQGD AWADYGRSFG AADSVDGVGD TWDQDLRGNF NQLRELKEMY PDLKVNISLG GWTWSEHFSD AALTAESRER MVSSCIDQFL RGNLPVFDGA GGPGSAYGVF DGIDLDWEWP GSAGHEHNTV RPEDKENFTA LVQEFRDQLD ALEAETSRQY ELTAFLPADP EKVELGFEMP QLMTDFDFIT VQGYDYHGGW ETTANHQSNL LLDPADPGPD LYSTETTVQA YLDRGVDPAD MVLGVPFYGR GWTGVEPGPN GDGLFQSATG PAPGSYEAGI DDWKVLKDLV GTGGYELYRD DALGTAWLYN GSTFWTYDDE ISMAQKTDWA QAQGLGGVMI WSVDGDDANG SLMNAIDTAL AG
|
| |