Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0923 |
Symbol | |
ID | 9244768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1129337 |
End bp | 1131577 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | cellulose-binding family II |
Protein accession | YP_003678873 |
Protein GI | 297559899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0984309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCCA ACCCCCCTGC CGCCCGTCAT CGGACACGGC GCGGCGCCTG CCTCGCGACC GCACTCGCCA CGGGCGCCGC GCTCGCCGCC GTGGCGACCG GCCCCGCCTC CGCCGCCGCC GGCTGCGAGG TCGCCTACAC CGTCCAGAAC CAGTGGAACG ACGGGTTCAC CGCCGCCGTG TCCGTCACCA ACCTCGGTGA TCCCGTCGAC GGCTGGACCC TGGAGTGGGA CTTCACCGCG GGCGAGCGCG TCACAAGCGG CTGGAACGGC GAGTTCTCCG GCTCGGGGAA CCGTGTCAGC GTGGCCGACA CCGGGTGGAA CGCCGCCATC GCCACCGGAG CCTCGGTCAG CTTCGGCTTC AACGGCTCCC ACGGCGGCAC CCCGGAAGTC CCCGACTCCT TCACGCTCAA CGGCACCGTC TGCACCGGCG GCACCGAGGA ACCGCCCGGT GAGGAGCCGC CCGACCCCGA GGACCCGCCC ACCACCGGAG AGGGGCGCCA GGTCGAGGAG CTGGACCGCG GTCTGGTGAG CGTGCAGGGC TCGCAGGGCA ACCTGGTGAG CTGGCGGCTG CTGGCCACCG ACCCCGAGGA CGTCGCCTTC AACGTCTACC GGGGCTCCAC ACGCCTGAAC TCACAGCCCC TCACCGGAGC CACCTCCTAC CTGGACGGCG GAGCCTCCGC CGACGCCCGC TACACGGTCC GCCCCGTGAT CGACGGCCAG GAGGCGGAGG CCTCGGGGGC CTCACTGGCG CTGCGCGCCG GCTACCTGGA CGTGCCCCTG GACGTCCCGT CCGGCGGCAG CGGCTACACC TACGACGCCA ACGACGCCAG CGTGGGCGAC CTCGACGGCG ACGGCGAGTA CGAGATCGTC CTGAAGTGGG AGCCCACCAA CGCCAGGGAC AACTCCCAGT CGGGCCGCAC CGGCCCCGTA CTGCTCGACG CCTACGAACT CAGCGGTGAG CGGCTCTGGC GCATCGACCT GGGACGCAAC ATCCGCGCCG GTGCGCACTA CACCCAGTTC CAGGTCTACG ACTTCGACGG AGACGGGCGG GCCGAGGTGG CCGTGAAGAC GGCGGACGGC ACCGTGGACG GCACCGGCGG GGTCATCGGC GACGCCGGGG CCGACCACCG CAACGGCGAC GGCTACGTCC TGAGCGGGCC GGAGTTCCTC ACCATGTTCG ACGGCCGCAG CGGGGCCGAA CTGGACTCGG TGGACTACGT TCCCGGGCGC GGTGACGTCT GCGACTGGGG CGACTGCTAC GGCAACCGGG TGGACCGGTT CCTGGCGGGC GTCGCCTACC TGGACGGCCA GCGGCCCAGC CTGGTCATGG CGCGCGGCTA CTACACGCGC TCGGTGATCG CGGCCTGGGA CTGGCGCGAC GGGCGCTTCA CCCGGCGCTG GACCTTCGAC AGCGACGAGG CCGGGTCGCG GTGGGCCGGT CAGGGCAACC ACCAGTTGAG CATCGCCGAC GCCGACGGTG ACGGCCGCGA CGAGGTCATG TACGGGTCGA TGGCCGTGGA CGACGACGGA AGCGGGATGT GGGCCACCGG CCACGGCCAC GGGGACGCCA TGCACGTGGG CGACCTGCTG CCCGACCGCT CCGGCCTGGA GGTCTTCACC ATCACCGAGC GCACGGGAGG CGCGGCGGGC GCCTACGTGG CCGACGCCGA CACCGGCGGG GTGGTCTGGC GCAGGCCGAC CGCCAGCGGT GAGGAGGGGC CGGGCCGGGG CGTGGCCGCG GACGTGTGGC CGGGCAACCC CGGCGCGGAG TTCTGGGTCA CGGGCGGCGG TATCAGCGGC ATGTACGACG CCGGGGGGAA CCGCCTGGGC GTTCGCGCCC CCTCGTCGGC GAACTTCGTG GCCTGGTGGG ACGGCGACCC GCTGCGGGAA CTGCTGGACG GCACCCGTAT CGACGAGTAC GGGACCGGTG GCGACACCCG GCTGCTGACC GGCTCCGGGG TCGCCTCCAA CAACGGCACC AAGTCCACAC CGGCGCTGAG CGGGGACATC CTCGGTGACT GGCGTGAGGA GGTCGTCTGG CGCACCGCGG ACAACCGGGC GCTGCGGATC TACTCCACGC CGCTGCCCAC CGATCTGCGG ATGCCGACGC TGATGCACGA CACCCAGTAC CGGGTGGCCG TCGCCTGGCA GAACACCGCC TACAACCAGC CGCCGCACCC GAGCTTCTTC ATCGGGGACG GCATGGAGCC CGCTCCCCGG CCGGAGGTGT ACACGCCCTG A
|
Protein sequence | MPPNPPAARH RTRRGACLAT ALATGAALAA VATGPASAAA GCEVAYTVQN QWNDGFTAAV SVTNLGDPVD GWTLEWDFTA GERVTSGWNG EFSGSGNRVS VADTGWNAAI ATGASVSFGF NGSHGGTPEV PDSFTLNGTV CTGGTEEPPG EEPPDPEDPP TTGEGRQVEE LDRGLVSVQG SQGNLVSWRL LATDPEDVAF NVYRGSTRLN SQPLTGATSY LDGGASADAR YTVRPVIDGQ EAEASGASLA LRAGYLDVPL DVPSGGSGYT YDANDASVGD LDGDGEYEIV LKWEPTNARD NSQSGRTGPV LLDAYELSGE RLWRIDLGRN IRAGAHYTQF QVYDFDGDGR AEVAVKTADG TVDGTGGVIG DAGADHRNGD GYVLSGPEFL TMFDGRSGAE LDSVDYVPGR GDVCDWGDCY GNRVDRFLAG VAYLDGQRPS LVMARGYYTR SVIAAWDWRD GRFTRRWTFD SDEAGSRWAG QGNHQLSIAD ADGDGRDEVM YGSMAVDDDG SGMWATGHGH GDAMHVGDLL PDRSGLEVFT ITERTGGAAG AYVADADTGG VVWRRPTASG EEGPGRGVAA DVWPGNPGAE FWVTGGGISG MYDAGGNRLG VRAPSSANFV AWWDGDPLRE LLDGTRIDEY GTGGDTRLLT GSGVASNNGT KSTPALSGDI LGDWREEVVW RTADNRALRI YSTPLPTDLR MPTLMHDTQY RVAVAWQNTA YNQPPHPSFF IGDGMEPAPR PEVYTP
|
| |