Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1406 |
Symbol | |
ID | 9245256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1722970 |
End bp | 1724775 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003679344 |
Protein GI | 297560370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.334429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGTGA GCCGCGTGCC AGGCTGGATC GAGGACTACG CAATGATCGG CGACATGCAG ACCGCCGCGC TGGTCGGGCG CGACGGGTCC ATCGACTGGG CGTGCCTTCC CGACTTCGAC TCCTCGGCCT GTTTCGCCGC GCTGCTGGGC GACGAGCAGA ACGGCTGCTG GACCCTGCGC CCCGCCGAGG GCGAACCGCG CGCCACCCGC CGCCGCTACC GGGGCGACAC GCTCATCCTG GAGTCGGAGT GGGACACCCC CTCCGGGTCG GTCCGGGTCA TCGACTTCAT GCCGCCGCGC GGCGGCGCCC CGCACATCGT GCGCATCGTC GAGGGGCTGA GCGGCTCCGT GCGCATGGAG ACGACCATGC GCATCCGCTT CGACTACGGC CACGTCGTGC CGTGGGTGCA CCGGACCGGG GCCGAACTGG TGGCCATCGC CGGACCCGAC GCCATCTGGC TCAGCACCCC CATCTCCCTC CAGGGCCACA ACTTCACCCA CGACGCCACC TTCACCGTCA CCGCGGGCCA GCGGGTGCCC TTCGTGATGA CCTGGCACCC CTCCCAGGTG GAGGAGTCCG ACCACCTGGA CGCGGAGAAG GCGCTCTCGC GCACCGAGCG CTTCTGGGAG AAGTGGGTCA ACCAGTGCAC CTACGAGGGC CCCTACCGCG AGGCGGTGAT CCGCTCCCTC ATCGTGCTCA AGGCCCTGAC CTACCGCCCC ACCGGCGGGA TCGTCGCCGC CCCCACCACC TCCCTGCCCG AGGAGATCGG CGGGGTGCGC AACTGGGACT ACCGCTACTG CTGGCTGCGC GACGCCACCA TCACGCTGGA GGCGATGATC CGCTCCGGCT ACAAGGACGA GGCGCTGGCC TGGCGCGAGT GGCTGGTGCG GGCGATCGCG GGCGAACCCC AGCTCATGCA GATCATGTAC GGCATCCGGG GCGAGCGCAG ACTCACCGAG TGGGAGGCCG AGTGGCTGCC GGGCTACGAG GCCTCCCGTC CGGTCCGGAT CGGCAACGCC GCCGTGGGCC AGTACCAGCT CGACGTCTAC GGCGAGGTCA TGGACGTGCT GCACCTGGCC CGCCGCCACA ACATCCGCGG CGGCGACTAC CTGTGGGGCC TCCAGCGCTC GCTGGTCAAC TACCTGGAGT GGTGCTGGGA CGAGCCGGAC GAGGGCCTGT GGGAGGTGCG CGGGCCCCGC CAGCACTTCG TGCACTCCAA GGTGATGGCC TGGGTGGCGG CCGACCGCGC GGTGCGCAGC ATCGAGGAGT TCGGCAAGGA GGGGCCCATC GAACGCTGGA GGGCCCTGCG CGACACCATC CACGCCGAGG TGTGCGAGTA CGGCTACGAC CCCCAGCGCA ACACGTTCAC CCAGTACTAC GGCAGCAAGG AGCTGGACGC GGCGCTCCTG CTGATCCCCG AGGTGGGTTT CCTGCCCTAC GACGACCCGC GCGTGGTCGG CACCATCGAG GCGGTGCGCA AGGACCTGAT GGTGGACGGG TTCGTGCTGC GCTACCGCAC CGACCTGGAC GACTCCGCCG ACCAGCTGCC CGGCAACGAG GGCGCGTTCC TGGCGTGCAG CTTCTGGATG GCCAACGCGC TGCTGTCGAT CGGCCGCCAG GACGAGGCCC GCGAGCTGTT CGAGCGGCTG CTGTCCCTGC GCAACGACGT GGGCCTGCTG GCCGAGGAGT GGGACCCGCG CGAGAACCGC CAGGTCGGCA ACTTCCCCCA GGCGTTCAGC CACGTGCCGC TGGTGACCAC CGCGCTCAAC CTGTCCACCC GCCAGGGGGG ATGGCGCGCC GAGTAG
|
Protein sequence | MGVSRVPGWI EDYAMIGDMQ TAALVGRDGS IDWACLPDFD SSACFAALLG DEQNGCWTLR PAEGEPRATR RRYRGDTLIL ESEWDTPSGS VRVIDFMPPR GGAPHIVRIV EGLSGSVRME TTMRIRFDYG HVVPWVHRTG AELVAIAGPD AIWLSTPISL QGHNFTHDAT FTVTAGQRVP FVMTWHPSQV EESDHLDAEK ALSRTERFWE KWVNQCTYEG PYREAVIRSL IVLKALTYRP TGGIVAAPTT SLPEEIGGVR NWDYRYCWLR DATITLEAMI RSGYKDEALA WREWLVRAIA GEPQLMQIMY GIRGERRLTE WEAEWLPGYE ASRPVRIGNA AVGQYQLDVY GEVMDVLHLA RRHNIRGGDY LWGLQRSLVN YLEWCWDEPD EGLWEVRGPR QHFVHSKVMA WVAADRAVRS IEEFGKEGPI ERWRALRDTI HAEVCEYGYD PQRNTFTQYY GSKELDAALL LIPEVGFLPY DDPRVVGTIE AVRKDLMVDG FVLRYRTDLD DSADQLPGNE GAFLACSFWM ANALLSIGRQ DEARELFERL LSLRNDVGLL AEEWDPRENR QVGNFPQAFS HVPLVTTALN LSTRQGGWRA E
|
| |