Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1109 |
Symbol | |
ID | 9244956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1358972 |
End bp | 1361338 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | glycoside hydrolase family 65 central catalytic |
Protein accession | YP_003679056 |
Protein GI | 297560082 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.824966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.328257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCT GGACGCTGGT CTACGAGGGG CGGGACCCGG ATGGCCAGGG CGTACGCGAG ACCCTGTGCA CCTTGGGGAA CGGTTACTTC GCCACGCGCG GCGCGCCGCC CGAGGCCCGT GACGACGGAG TCCACTACCC GGGCACCTAC GTCGCCGGCT GCTACGACCG CGCCGTCTCC GAGGTCGACG GGCACCGGGT GGAGAACGAG GACCTGGTCA ACGCCCCCAA CTGGCTCCCC CTGACCTTCC GCGTCGGTGA CGGCGACTGG TTCGAACGAC CGGACCCGGC ACTCCCGCAG CGCACCGAGC TGGACATGCG CCGAGGAGTG CTGACGCGGA CCTTCCACGT GGTGGACGAC GGCAGGAGGA CACGGGTGGC CCAACGGCGC CTGGTGTCGA TGGACGCCCC GCATCTGGCC GCCCTGGAGA CCACCTTGGT GCCCGAGGGT TGGAGCGGGA CCGCGGTCGT CCGGTCCGCC CTGGACGGGA GGGTGGCCAA CCGCGGCGTG GCGCGCTACC GCGACCTGAA CGGACGTCAC CTCCACCCGC TGGACACCGG CTCCGACGGC CCCGGCCTCG ACTGGCTGCG CTGCCGGACC CTGTCCTCGG GCGTCGAGGT CGCGCTCGCC TCCCGGACCC TGGTCTCCCA GGGACCCCGG CCCGCGGCCC GTGAAAGCCC CGCGGGCGAC GGCTGGGCGG CCACCGACCT GATCCTGGAC CTGAGGAGCG GTGAACAGAC CACCGTGGAG AAGACGGTGG CCCTGTACAC CTCGCGCGAC CGCGCCGTCG GCGACATCCT CGACGCGGCC CGCGACGCCC TGGAACGGGC GGGCGGATTC GACGAACTGC TGCGCCGACA CACCACCGCC TGGCACCACC TGTGGCGGTC CTGCGCACTG GAGGCCGGGG ACGAGGAGGA ACAGCGGGTC CTCAACCTGC ACCTCTTCCA CCTCCTGCAG ACGCTGTCGC CCCACACGGC CGACCTCGAC GCGGGTGTGC CCGCGAGGGG CCTGCACGGT GAGGCCTACC GCGGCCACGT CTTCTGGGAC GAGCTGTTCG TCCTTCCCTT CCTCAACCTC CACTTCCCCG AGACCGCGCG AGCCCTGCTG CGCTACCGGT GGCGCAGACT GCCCCAGGCG CGGGCCATCG CCCGCGCCGC GGGGCTGAGG GGCGCTCTCT TCCCCTGGCA GAGCGGGAGC GACGGCAGCG AGGAGTCCCA GAGCACACAC CTCAACCCCC GCTCGGGGAG GTGGATCCCC GACCACTCGC ACCTGCAGCG CCATGTCGGG CTCGCGGTCG CCTACAACGT CTGGCAGCAC CACCAGGCCA CCGGCGACAC CGCCTTCCTG ACCGGGTTCG GCGCGGAACT GCTGTTGGAG GTCGCCCGCG CCTTCGCGGA CATGGCCGTC TACGACAAGG CTTTGGACCG CTACGTGATC CGCGGTGTGA TGGGCCCCGA CGAGTACCAC GACGGCTACC CCGGCCGCGA GGACCCCGGT CTGGACGACA ACGCCTACAC CAACCTCATG GCGGTGTGGG TCATGCTGCG GGCGCTGGAC ACCCTGCGGG CGCTGCCGGG ACCGAGCCGC AGGGACCTGG AGGAGTCCCT CGGGCTGGAC GCGGACGAGG TCGAGCGGTT CGAGACCCTC ACCCGCAAGA TGCGCGTCCC CTTCCACGAG GGAGTCATCA GCCAGTTCGC CGGTTACGGG GACCTGGAGG AGCTCGACTG GCGGGACTGC CGCGGCGTCC GGCGCCTGGA CCGCTTCCTG GAGGCCGGGG GCGACAGCTG CAACCGCTAC AAGGCGTCCA AGCAGGCCGA CGTGCTGATG CTGTTCTTCC TGCTACCCGC CGAGGAGATC GCCGACATGC TGCGCCGCCT CGGCTACACC TACGATCCCG GGCTCATCCC CCGCACGGTC GACTACTACC TTGCACGCAC CTCGCACGGG TCGACCCTCA GCTCCGTGGT GCACTCCTGG GTGCTGGCCC GGACCAACCG GGAGGAGTCC TGGGACTTCT TCCGCAGGGC GCTGAGCACC GACGTCGACG ACGTCCAGGG CGGGACCACG GCCGAGGGCA TCCACCTGGG GGCCATGGCG GGCACCGTCG ACCTCCTCAC CCGGTGCTAC ACCGGCCTGA CCACACGCGG TGGAGCCCTG CACCTGAGCC CCCTGCTGCC CGCCGAACTG GACCACCTCT CCTACGGACT GCGCTACCAC GACCACTGGG AGGTGGGCGT GGACGTGCGC CGCGACCACG TGCGGGTCAC CCTGCCACCC TCGGCCGGGC CGCCCGTCCG GGTCCGCGTC AAGGAACGCC ACGCCCTGGT CGCCCCCGGC TCGTCCTGTG TCCTACCCCT GTGGTGA
|
Protein sequence | MSRWTLVYEG RDPDGQGVRE TLCTLGNGYF ATRGAPPEAR DDGVHYPGTY VAGCYDRAVS EVDGHRVENE DLVNAPNWLP LTFRVGDGDW FERPDPALPQ RTELDMRRGV LTRTFHVVDD GRRTRVAQRR LVSMDAPHLA ALETTLVPEG WSGTAVVRSA LDGRVANRGV ARYRDLNGRH LHPLDTGSDG PGLDWLRCRT LSSGVEVALA SRTLVSQGPR PAARESPAGD GWAATDLILD LRSGEQTTVE KTVALYTSRD RAVGDILDAA RDALERAGGF DELLRRHTTA WHHLWRSCAL EAGDEEEQRV LNLHLFHLLQ TLSPHTADLD AGVPARGLHG EAYRGHVFWD ELFVLPFLNL HFPETARALL RYRWRRLPQA RAIARAAGLR GALFPWQSGS DGSEESQSTH LNPRSGRWIP DHSHLQRHVG LAVAYNVWQH HQATGDTAFL TGFGAELLLE VARAFADMAV YDKALDRYVI RGVMGPDEYH DGYPGREDPG LDDNAYTNLM AVWVMLRALD TLRALPGPSR RDLEESLGLD ADEVERFETL TRKMRVPFHE GVISQFAGYG DLEELDWRDC RGVRRLDRFL EAGGDSCNRY KASKQADVLM LFFLLPAEEI ADMLRRLGYT YDPGLIPRTV DYYLARTSHG STLSSVVHSW VLARTNREES WDFFRRALST DVDDVQGGTT AEGIHLGAMA GTVDLLTRCY TGLTTRGGAL HLSPLLPAEL DHLSYGLRYH DHWEVGVDVR RDHVRVTLPP SAGPPVRVRV KERHALVAPG SSCVLPLW
|
| |