Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0709 |
Symbol | |
ID | 9244551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 870885 |
End bp | 872282 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | CBS domain containing protein |
Protein accession | YP_003678660 |
Protein GI | 297559686 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCCGT CGGCCTGGCC TGACATCGCC CCGTTGACCA CCGGTGAGCC CGTGGAGTGG ATCAGCGCGT TCGCGGTGGC CCTGGTGCTG ACCGCCGTCG CGGGCTTCCT GGTGGCCGCG GAGGTCGCCG TCACCCGTGT GATGTCGGTG GGCCTGCCCG AGGCGAGTTC CGGTGAGCGC GTGGCCAAGT CCGGGCGCCG CTGGGAGCGG GTGTCGGCGG ACCCCACCCG CCACCTCAAC GTGCTGCTGT TCCTGCGCGT GGTGTGCGAG GTGTGCGCGG TGCTGGCCGC CGCCGTGGGC ATGGTGTTCC TGCTGGGGCT CGGCTGGCCC GCACTCGGGT TGACCGCCGC GGCGATGGTC GTGGTGGAGT CGGTGCTGAT CGCGATCGCG CCGCGCATCC TCGGCCGCCA GTTCTCCGGG GCCTTCGCGC GCGCCAGCGC GGCGGTGGTC TACCCGGTCC AGGTCGTGGT GGGCCCCATC GCCCAGCTGT TCGTGGGGGC GGGTCGCGCG CTCACCCCGC GCGCCAAGGG CGACCGCGAG GGCCCCTTCA GCACCGAGGT GGAGCTGCGC CAGCTCGTGG ACCTGGCCGA GCGGGGGGAG GTCATCGACC CCGAGGAGAG CCAGATGATC CACTCGGTGT TCAAGCTCGA CGACACGCCC GTGCGCGAGG TGATGGTGCC CCGCCCCGAC ATCGTGTTCA CCAGCCGCGA GGCCGACGCC GACGCGGTGC TGACCCTGGC GCTGGCCAGC GGTTACTCGC GCATCCCGGT GACGGGTGAG GACGAGGACG ACGTGGTCGG CATCGTCTAC CTCAAGGACC TGGTGTACCA GCTGCGCGAC CGCTGGGCGG CGGCCGCGGG CACCGACGAC GAGGTGCCGC TCACCGCGGG CGACGTGATG CGCCAGGCGA ACTACGTCCC CGACACCAAG CCCATCGACG AGCTGCTGCG CGAGATGCAG CAGCAGCGCA ACCACGTCGC GGTGGCCATC GACGAGTACG GCGGCACGGC GGGCCTGGTG ACCCTGGAGG ACATCGTGGA GGAGATCGTC GGTGAGATCA CCGACGAGTA CGACCACGAG ATCCCGCCGG TGGCCTGGCT CGACGCGGAC CGGGTGCGGG TGACCGCCCG CCTGCCGCTG GGCGAGCTGG ACGAGCTGTT CCCGGACCGG GACCTGGACG TGGCGGACGT GGAGACCGTC GGCGGACTCG TGGCGTTCGT CCTGGGCCGG GTGCCCGTGG GAGGGGACCG GGCCGAATAC GCTGGTCTCC GACTGACCCT CGAGGGCAAG AGCAGCCGCC GGAGCCGGAT GACCACGGTC CTGGTGGAGC GCCTGCCCGC CGACGCCGAA GGGGCGCCGG AGGAGGACAA CGACGACGTG AGGAGCACGG AGCAGTGA
|
Protein sequence | MTPSAWPDIA PLTTGEPVEW ISAFAVALVL TAVAGFLVAA EVAVTRVMSV GLPEASSGER VAKSGRRWER VSADPTRHLN VLLFLRVVCE VCAVLAAAVG MVFLLGLGWP ALGLTAAAMV VVESVLIAIA PRILGRQFSG AFARASAAVV YPVQVVVGPI AQLFVGAGRA LTPRAKGDRE GPFSTEVELR QLVDLAERGE VIDPEESQMI HSVFKLDDTP VREVMVPRPD IVFTSREADA DAVLTLALAS GYSRIPVTGE DEDDVVGIVY LKDLVYQLRD RWAAAAGTDD EVPLTAGDVM RQANYVPDTK PIDELLREMQ QQRNHVAVAI DEYGGTAGLV TLEDIVEEIV GEITDEYDHE IPPVAWLDAD RVRVTARLPL GELDELFPDR DLDVADVETV GGLVAFVLGR VPVGGDRAEY AGLRLTLEGK SSRRSRMTTV LVERLPADAE GAPEEDNDDV RSTEQ
|
| |