Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3711 |
Symbol | |
ID | 9247580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4454337 |
End bp | 4456175 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | lipolytic protein G-D-S-L family |
Protein accession | YP_003681615 |
Protein GI | 297562641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.106997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGGAT CCGACGAACG CTCCGTGTGG TGGTCGCGCG TGGCCGACCG CTTCGGCAGG CTTCGGTTCC GCTGGCCCGG ACCGCGCCGC TCCCGGCCCG GCGAACCGGG GCGGGACCGC CCGCGAGCGC CGCGTCTGGC GTTGCTCCTG GACCGGTGGC CCCGCCGGCT CCGCCTCACG TTGCTCCGGC CCCGCTGGGA CAGCCACTCC CGTTTCGAAC CCGGGCCCCT GGGGCTGCTC GGCGTCATGC TCTCGGCGAT GGCGGTCACC CTGCTGCTGA TCCAGAGCGG CTCCGGGGGA GGCGCCGCGG AGGGCGGGCG CGCCGAGGAG CCCGTCACCG TGCCGGGCCC GCCGGACGGC GAACTGCGCG TCATGGTCGT CGGCGACTCG CTCTCCCAGG GCAGCAGCGG CGACTACACC TGGCGCTATC GCCTGTGGTC CCACCTGCGC GAGGCCGGGG TGGAGGTGGA CTTCGTCGGC CCCTACGACG GCGTGTACGG GTTGGCGGAC CGCGAGTTCG GCAAACAGGA CTACGCCGAC CCCGACTTCG ACACCGACCA CGCCGCCCGC TGGGGCACCA CCGCGCAGGA CCTGTCCACC GGGGCCGCGC GGGTGGCCGC CGAGTACCGG CCGCACTACC TGCTCCTCCT GGCCGGTACC GAGGACATCC TGGCGGGGGA GGACGCCGAC AGCGCGCTGG AGGGGGTGGG CGAGACGGTG TCCACGGTGC GCGTCGTGCG GGGGCAGACC CGGTTCGTCC TCGGGGAGCT CCCGCCGGTG GAGGACCCGC GGGCCAACGC CGAGATCGAC CGGTTCAACA TGGGTCTGGT CGACCTCGCC GCTCAGCTCA CCTCCAGCGA ATCGCCCGTG GTGGTCGCGC GGGTCGCCGA CGGGTACGCC CCGGCCAGCG ACAACTGGGA CGAGGCGCAC CCCAACGCGC GCGGCGAGCT GAGGATCGCG GCGGCCTTCG CCGACGCGCT CGCCGAACCG CTGGGGGTCG GACCGGCCTA CCGGCGGCCG CTGCCGGAAC CGGCGGTGGG GCCCACGAGC GCGCCCGAGC CGGTGGCGGA GGAGTCAGAG GACGGCCTCG TGCTCAGTTG GGAGGCCGTG CCCGGTGCGA CCGGCTACCG GGTGGTGCAG CGGCGCGTGG GGCCGGACCC GGACGAGGCC GTCGTGCTTC CCGTGGACGT GCGCGCCGAC GGGGACAGCC GTTCCGTGCT GGTGGGGGCG CTCTTCAGCG GAGCCCGCTA CGAGTTCGTG GTGCGCTCGT TCAAGGGCCG GGACGAGGGG GTGGAGTCCG AGCCCCTCCC GTGGGTGTGG GACGACGATC CGCCGCCGGG GCCGTCCTGG CTGCGCGTCG TCGACGGCGG GGCGACCGTG GAGTGGGAGG AGGTCGAGGA GGCCTCCCAC TACGAGGTGT GGGTGCGCGC GTTGGACTGC GGGGTCGCCG ACGACCGGCG GGCGCCCGTG GACGGCGGCC CGTACGCGTC CCCGGAGGGT GACCACGCCT CGGAGGAACC CGTCGAATCG CCCGTCCCCA GCGGCGGGCC GTCCGACCCC GTGCCCGACG CGGACCCCGG TCCGCGGCCC ACCCCCACGC CCGCTCCGGC CCCCACGCCG TCCGTGCCGG AACCCGAGCC CGTCACGCCC GGCTCGGACT GCGAGCGCCG TGACGGGCTC GGCCCCGGGG ACGGGCGGGG GTGGCGCACG CTGGGCCCGG CGGGGGAGGA GCCGCGCTGG TCGGTGACGG TGTCGGGCCC CTACGAACTC GTGGTCCGCT CCTACCGCGA CTACGTGGAG GGCGGCTACT CCGACAGCGT TCTGCTCGCG CGGGGCTGA
|
Protein sequence | MRGSDERSVW WSRVADRFGR LRFRWPGPRR SRPGEPGRDR PRAPRLALLL DRWPRRLRLT LLRPRWDSHS RFEPGPLGLL GVMLSAMAVT LLLIQSGSGG GAAEGGRAEE PVTVPGPPDG ELRVMVVGDS LSQGSSGDYT WRYRLWSHLR EAGVEVDFVG PYDGVYGLAD REFGKQDYAD PDFDTDHAAR WGTTAQDLST GAARVAAEYR PHYLLLLAGT EDILAGEDAD SALEGVGETV STVRVVRGQT RFVLGELPPV EDPRANAEID RFNMGLVDLA AQLTSSESPV VVARVADGYA PASDNWDEAH PNARGELRIA AAFADALAEP LGVGPAYRRP LPEPAVGPTS APEPVAEESE DGLVLSWEAV PGATGYRVVQ RRVGPDPDEA VVLPVDVRAD GDSRSVLVGA LFSGARYEFV VRSFKGRDEG VESEPLPWVW DDDPPPGPSW LRVVDGGATV EWEEVEEASH YEVWVRALDC GVADDRRAPV DGGPYASPEG DHASEEPVES PVPSGGPSDP VPDADPGPRP TPTPAPAPTP SVPEPEPVTP GSDCERRDGL GPGDGRGWRT LGPAGEEPRW SVTVSGPYEL VVRSYRDYVE GGYSDSVLLA RG
|
| |