Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0532 |
Symbol | |
ID | 9244373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 651149 |
End bp | 652798 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 81% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003678485 |
Protein GI | 297559511 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00736806 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCCC GCGCGCCCCT GCCTCCCGGT CTCGGCATCG AGATCGACCG CGCGGCGCGC CTGGCCGACG AGGGGCACGT CCTCCTCGGC GGTAGCCCGC CGCGGGCGGT GCGCCTCGGC CCGGAGCGGG TCAGCGCGCT GATCCGCTGG CTGAGCCGGG CGGTGCCGCG GGACGCGGAC GAGGGCCTGT TCGCACGCGA CCTCATCAGG GCGGGCCTCG CCCACCCCCG CCCCCTGCCC CTCACCCCGG GGGACACGGC CGAGGTGGCC GTCGTCGGAC GGGCCGACTC CGCCGCTCTG CACGCCACCC TGGACCACCT CGACGAGCAC GGCCACGGGA CGCGGACCGT CGTCGTGGGC GCCACCGGGC CCGAAGCCCG GGCGGCCCGC CGCCGGGGCG TCCGCGTCGT CCTCGGGCCG ACCGGGGGAG CCGGGGCCGG GGCGGCGGCG CTGCGCGCGT GCTCCGCCGA GTTCGTGGCC CTGGTCGAGG CGGGCACCCG TCCCGCTCCG GGGTGGCTGG AGACCGCGCT CGGACACTTC GCCGACCCGG ACGTGGCGGC CGTGGTGCCC CGGGTCCTGA CCGACCGCTC CGCCTGCCTG GGCCACACGC GGATGACCGT GGCCTCCGTC GCCGCCCGCC GAACGGGCGC GGACCGGGGC GCCGACCCCG CCCCCGTCCT GCCCTGGGGG CACGCTCTGC CCTGGCAGGA GCGCCCGGGA CCGGCCAACG AGCACACCGA CCCTCTGCGG CCGGTCCCCG TCCTGGTGCT GCGCCGCGGT GCCGCCGACC TCGACCCCGG CCTCGGCGCC GCCGCCGGGC TCGACCTGTT GTGGCGGCTC GCCGAACAGG GCTGGTCGGT GCGCTACGAG CCCCGTTCCA GGGTGTGGGC ACCGCCGACC ACCGACCTGG GCGCGTACCT GCGCGCCTGC TTCACCTCCG GGGCGGTCGC CGGTCCCCTG GCCCGCCGCC GCGGCGCGCA CGCCGCCGGG CCCGCGCTGT CCTGGCCGGG CGCGGTCGGA CTCGCGCTGT TGTTCGCGGG ACGGCCCGGC GCCGCCCTGG CGGCCGGTGC GCTGGGCGGC GCGGCGGTGA CGGGCTCCCT CGTGGTGGGA GCGGGCACCC CGCTGCCCGA GGCGGCGCGA CTGGCCGGGC TGGACCTCGC GCACACCGTG CGCACGGGCA CGCGCGCGGT CCGCACGGCC TGGTGGCCGC TGGCCGCGGC GGCGGTGGGC GCGGCGGTGC TCGGACGGCG CGGCCGCGGG GCACCGGGCG TGCCCGCTCC GGCCGCCTCG GACCCGCTCG CGGGTCGAGG CCGGTCAGCG CGGCGAGGCG GTGCCGGCGG ACGCGCGGGC CGCGTCGCCG CCCTGGCCGC GGGAGCGGCC CTGGTCGTGC CGCACGTGGC GGCCTGGCAC CGGGGCAGGG GCGCCGCACT GGCCGGTCCG GTGACCTGGA CGGCCCTGGG GATGGCGGGC GACGCGGCGC GCTCCCTGGG CACCTGGTGG GGGATCGCGC GGTCGGGTTC GCCCGCGCCC CTGGTGCCCC GGCTCGTGCC CCCGGCCGCG ACCGGCACGG CACCGCGGGA GCGCTCCGGC GGGACGCGTC GTGCACGGGG ACCCAAGGAC GGGTCAACGG CGGCCGTGAG CCGGGCTTAA
|
Protein sequence | MPSRAPLPPG LGIEIDRAAR LADEGHVLLG GSPPRAVRLG PERVSALIRW LSRAVPRDAD EGLFARDLIR AGLAHPRPLP LTPGDTAEVA VVGRADSAAL HATLDHLDEH GHGTRTVVVG ATGPEARAAR RRGVRVVLGP TGGAGAGAAA LRACSAEFVA LVEAGTRPAP GWLETALGHF ADPDVAAVVP RVLTDRSACL GHTRMTVASV AARRTGADRG ADPAPVLPWG HALPWQERPG PANEHTDPLR PVPVLVLRRG AADLDPGLGA AAGLDLLWRL AEQGWSVRYE PRSRVWAPPT TDLGAYLRAC FTSGAVAGPL ARRRGAHAAG PALSWPGAVG LALLFAGRPG AALAAGALGG AAVTGSLVVG AGTPLPEAAR LAGLDLAHTV RTGTRAVRTA WWPLAAAAVG AAVLGRRGRG APGVPAPAAS DPLAGRGRSA RRGGAGGRAG RVAALAAGAA LVVPHVAAWH RGRGAALAGP VTWTALGMAG DAARSLGTWW GIARSGSPAP LVPRLVPPAA TGTAPRERSG GTRRARGPKD GSTAAVSRA
|
| |