Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0775 |
Symbol | |
ID | 9244620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 954263 |
End bp | 956203 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | lipolytic protein G-D-S-L family |
Protein accession | YP_003678725 |
Protein GI | 297559751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.812985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGGGG AGAGGCGAGC GTTGGGAGCG GACGGCACAC GGCTGCGCGG AACCCACCGG CACGGGGTGG GCCTCCATCC CCGCCTGCTG TACGCCCTGG CCGCGGCGGC CGTGGCGGTG CTCCTCGTGG TCGTGGTCAA CACCGCCGTC TCCCGGGACG ACGGCACCGG GCCCTCCGAG TCCAGGGAGC AGACCGTCCC CGCCGTGAAC GCGGTCCCCT TCAGCGGTCT CGTCGACCGG ATCGGCGTCT CCCGGGACAC CTCCGTCGCC ACCGTCGACC TCGACGGCTC CGGGAACAGC CTGTCCTCCC AGGCGCTGGC CGCCGCCGGG TGGACGCCCG GCCGCGAGGT GGTCCTGCTC GGCACGCCCA TGGAACTGCC CGACTACGCT CCCGGCCGCC CCGACCACCT GGTCTCGGAC GGCCAGTTCC TGCGGCTCGC GGACGAGCAC TACCGGTCGC TCACCTTCCT GGCCACCGCG ACCCGCACCG ACGGCATCGA GGCCGAGGTC ACCGGAACCG GCCGCGTGGT CTACGCCGAC GGCCGGGAGC AGGAGTTCAC CCTGTCCGTG CCGCACTGGA CCGCCGGACC CGCGAGCGAG GCCGCGCTCA CCCTGCCCTA CGCCAACTCC GCGCGGCACG CCGGTCCCAG CCCCACCCTC GGCGCGGCGC GCCTGTACGC GCGCGGCGTC ACGGTCGACC CCGCCCGTGA GATCTCCCAC GTGGTGCTGC CCGAGACCGC CGACGAGGCG GGCCGCATCC ACGTGTTCTC CGTGGGCGGG CGCGCCGCGG AGACGGAGTG GACGGGCACC TGGGCGCGCG CCACCTCCGG CTACATGGAG GTCGGACCCT GGCGGGACCA GACCCTGCGG CTGTCGGTGC GCACGACCAC CGGCGGCCAC ACGGTCCGCA TCCGCCTGGA CAACACCTTC GCGGCCGCAC CCGTCACCGT GGGCGCGGCC TCCGTCGCGC TGCGCGGAAG CGGCGCGGCC TCCCGCGGCG CCGCGGTCCC ACTCACCTTC GAGGGGCGCT CCGACACCGT CATCCCGGCT GGGGGACAGG TGTTCAGCGA CCCCGTCGAG ATGCTCCTGC CGCCGCAGAC CGACGTGCTG GTCAGCATCC ACCTGCCCGA ACAGGTGACC ACCGCGCCGG TGCACTACGC CTCGGTGGAC ACCAACTACA CCAGCGCGCC CGGAAGCGGC GACCTGACCC TGGACACCAC GGGGGAGCCG TTCACCGGCC GGGTCGCGCA GTGGCCCTTC CTCACCGCGG TGGAGGTCTC CGGGGGGCCG GGCGCGATCG CCGCCTTCGG CGACTCCATC ACCGACGGGA TCCGCTCCAC CCCCGACGCC CACGCCCGCT GGCCCGACGT GCTCAGCGCC CGCCTGTCCG AGCGGCCGGG GCTGCCCAAC CCGGGCGTGC TCAACCTCGG CGTGGCCGGG AACCACGTGA TCCGGGACGG CTACCCGGGT GAGGGCGTCT CCACCAACCC CACCGGGGTG GCGCTGACCC ACCGGGTGCA CCGGGACGTG CTGGCCCAGA GCGGGGTGAA CACCCTGGTC GTCTTCGCCG GGATCAACGA CCTGCGCTGG AGCACCCCGC CCGAGTCGGT GATCGCCGGG ATCGAGGAGA TCGCGGGGCT GGCCCGCGAG AACGGCATGC GGGTGTTCGT GGCCACGCTC GGCCCCTGCG GGGGCGAGGC GCGCTGCACC GAGGAGGTCG ACCGGGCCCG CCAGCAGGTC AACGACCACC TGCGGGCGCG GGCGAACGAT CCCCTGTCGC CCTTCGACGG CGTGTGGGAC TTCGACGCGG TTCTGCGCGA CCCGCAGGAA CCCAGCCGCA TGCTGCCCGC CTACGACTCG GGCGACCACC TCCACCCGGG GGACGCCGGA CTGCGCGCGC TGGCCGAGTC CGTGGACCTC TACCAGCTCG TCGGCGGCTA G
|
Protein sequence | MRGERRALGA DGTRLRGTHR HGVGLHPRLL YALAAAAVAV LLVVVVNTAV SRDDGTGPSE SREQTVPAVN AVPFSGLVDR IGVSRDTSVA TVDLDGSGNS LSSQALAAAG WTPGREVVLL GTPMELPDYA PGRPDHLVSD GQFLRLADEH YRSLTFLATA TRTDGIEAEV TGTGRVVYAD GREQEFTLSV PHWTAGPASE AALTLPYANS ARHAGPSPTL GAARLYARGV TVDPAREISH VVLPETADEA GRIHVFSVGG RAAETEWTGT WARATSGYME VGPWRDQTLR LSVRTTTGGH TVRIRLDNTF AAAPVTVGAA SVALRGSGAA SRGAAVPLTF EGRSDTVIPA GGQVFSDPVE MLLPPQTDVL VSIHLPEQVT TAPVHYASVD TNYTSAPGSG DLTLDTTGEP FTGRVAQWPF LTAVEVSGGP GAIAAFGDSI TDGIRSTPDA HARWPDVLSA RLSERPGLPN PGVLNLGVAG NHVIRDGYPG EGVSTNPTGV ALTHRVHRDV LAQSGVNTLV VFAGINDLRW STPPESVIAG IEEIAGLARE NGMRVFVATL GPCGGEARCT EEVDRARQQV NDHLRARAND PLSPFDGVWD FDAVLRDPQE PSRMLPAYDS GDHLHPGDAG LRALAESVDL YQLVGG
|
| |