Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0361 |
Symbol | |
ID | 9244196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 439912 |
End bp | 441591 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003678315 |
Protein GI | 297559341 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAGCAC GACGTTTCAG TACCCGCCTC GCGGCCGCCG CGGCCGCCGT GCTGCTCGCC CCGCTCGCGG TGACCGCGGT CCCCGCCGCG GCCGCCGAGG AGACCGGCCC CGCCGACCTC ACGGTGACCT ACACGGAGTC GGCCCGGTGG AACACCGGCT ACACCGGCCA GATCACCGTC CACAACGCCT CGGACACCCC CGTCGAGGAC TGGGCCGTCG AGTTCTCGCT CCCCGACGGC AGCCGCATCC ACGAGCTGTG GAACGCCACC CTCGCGGGCA CCCCCGGCGG CTACACGATC ACCCCGCCGC ACTGGGGCGC CAGCGTTCCG GCGGGCGGCA GCTACTCCTT CGGCTTCAAC GGGCTGCACT CCGACGGCGC CACCGACCTC GTGGACTGCC TGGTCAACGG CTCCCCCTGC GCCGGGGACG ACGGGAGCGG CCCCTCCCAC GACAGCTACA GCGTCGCCTA CTACACCCAG TGGAGCGCCG GGGAGCGCGA CTACCACGTC CGCGACCTGG TCGACAGCGA CAGCGCGGAG AAGCTCACGC ACATCAACTA CGCGTTCGGC AACGTCACCG CCGAGGGCGT GTGCGCCACG GGTGACGGGA CCAGCCAGGA GGGCGCCGTC GCCGACTACG CGCACCCCGT GCCCGCCGCG GAGAGCGTCG ACGGGACCGC CGACGACCCC GACCAGGAGC TGCGCGGCAA CTTCAACCAG CTGCGCGAGC TCAAGGAGAT GTACCCCGAC CTCAAGGTCA ACATCTCCCT CGGCGGCTGG GAGTGGTCCC GGTACTTCTC CGACGCGGCC CTGACCGAGG AGTCCCGCGA GAGGCTGGTC AGCTCCTGCG TCGACCTGTA CCTGCGGGGC AACCTGCCCG AGGTCGACGG CGCGGGCGGC GAGGGCGCCG CCTACGGGGT CTTCGACGGC ATCGACCTGG ACTGGGAGTG GCCCGGCTCC GACGGCCACC CGCACAACAC CGTCCGCCCC GAGGACAAGG AGAACTTCAC CGCCCTGGTG CAGGAGTTCC GCGACCAGCT GGACGCGTTC GGCGAGGAGA CCGACCGCCA CTACGAGCTG ACCGCGTTCA TGCCCGCCGG AGGCTGGCGC CTGGACGCCG GGTACGAGCT GGACGAGCTC ATGACCGACT TCGACTTCGT CACGGTGCAG GGCTACGACT ACCACGGCAC CTGGGAGAGC ACGACCAACC ACCAGTCCAA CCTGGTCGTG GACGCCCGCG ACCCCCAGCC GGTGATCTCC ACCGAGCTCA TCGTCCAGGC CTACCTGGAC CGGGGCGTGG ACCCCGGCAA GCTCGTCCTG GGCGTGCCCT TCTACGGACA GGGGTGGACG GGCGTGGAAC CCGGCCCCGA CGGTGACGGC CTCTTCCAGA CCGCGACGGG CCCCGCGCAG GGGGCGTACG CGGCGGGCAC CGAGGACTGG AAGGTCCTGG AGGAGAAGGT CGAGTCCGGT GAGTTCGAGG TCTTCCGCAA CGACGGGGCC GGTACCGCCT GGATCTACGA CGGCCAGACG CTCTGGAACT ACGACGACGA GACCGCCATG ACGCAGAAGA CCGACTGGGC CCGGGACAAG GGCCTCGGCG GGGTCATGAT CTGGTCCATC GACGGCGACG ACGCCGAGGG CAGCCTCATG GCCGCCATCG ACACGGCGCT GAGCGACTAG
|
Protein sequence | MRARRFSTRL AAAAAAVLLA PLAVTAVPAA AAEETGPADL TVTYTESARW NTGYTGQITV HNASDTPVED WAVEFSLPDG SRIHELWNAT LAGTPGGYTI TPPHWGASVP AGGSYSFGFN GLHSDGATDL VDCLVNGSPC AGDDGSGPSH DSYSVAYYTQ WSAGERDYHV RDLVDSDSAE KLTHINYAFG NVTAEGVCAT GDGTSQEGAV ADYAHPVPAA ESVDGTADDP DQELRGNFNQ LRELKEMYPD LKVNISLGGW EWSRYFSDAA LTEESRERLV SSCVDLYLRG NLPEVDGAGG EGAAYGVFDG IDLDWEWPGS DGHPHNTVRP EDKENFTALV QEFRDQLDAF GEETDRHYEL TAFMPAGGWR LDAGYELDEL MTDFDFVTVQ GYDYHGTWES TTNHQSNLVV DARDPQPVIS TELIVQAYLD RGVDPGKLVL GVPFYGQGWT GVEPGPDGDG LFQTATGPAQ GAYAAGTEDW KVLEEKVESG EFEVFRNDGA GTAWIYDGQT LWNYDDETAM TQKTDWARDK GLGGVMIWSI DGDDAEGSLM AAIDTALSD
|
| |