Gene Ndas_0532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0532 
Symbol 
ID9244373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp651149 
End bp652798 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content81% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003678485 
Protein GI297559511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00736806 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCC GCGCGCCCCT GCCTCCCGGT CTCGGCATCG AGATCGACCG CGCGGCGCGC 
CTGGCCGACG AGGGGCACGT CCTCCTCGGC GGTAGCCCGC CGCGGGCGGT GCGCCTCGGC
CCGGAGCGGG TCAGCGCGCT GATCCGCTGG CTGAGCCGGG CGGTGCCGCG GGACGCGGAC
GAGGGCCTGT TCGCACGCGA CCTCATCAGG GCGGGCCTCG CCCACCCCCG CCCCCTGCCC
CTCACCCCGG GGGACACGGC CGAGGTGGCC GTCGTCGGAC GGGCCGACTC CGCCGCTCTG
CACGCCACCC TGGACCACCT CGACGAGCAC GGCCACGGGA CGCGGACCGT CGTCGTGGGC
GCCACCGGGC CCGAAGCCCG GGCGGCCCGC CGCCGGGGCG TCCGCGTCGT CCTCGGGCCG
ACCGGGGGAG CCGGGGCCGG GGCGGCGGCG CTGCGCGCGT GCTCCGCCGA GTTCGTGGCC
CTGGTCGAGG CGGGCACCCG TCCCGCTCCG GGGTGGCTGG AGACCGCGCT CGGACACTTC
GCCGACCCGG ACGTGGCGGC CGTGGTGCCC CGGGTCCTGA CCGACCGCTC CGCCTGCCTG
GGCCACACGC GGATGACCGT GGCCTCCGTC GCCGCCCGCC GAACGGGCGC GGACCGGGGC
GCCGACCCCG CCCCCGTCCT GCCCTGGGGG CACGCTCTGC CCTGGCAGGA GCGCCCGGGA
CCGGCCAACG AGCACACCGA CCCTCTGCGG CCGGTCCCCG TCCTGGTGCT GCGCCGCGGT
GCCGCCGACC TCGACCCCGG CCTCGGCGCC GCCGCCGGGC TCGACCTGTT GTGGCGGCTC
GCCGAACAGG GCTGGTCGGT GCGCTACGAG CCCCGTTCCA GGGTGTGGGC ACCGCCGACC
ACCGACCTGG GCGCGTACCT GCGCGCCTGC TTCACCTCCG GGGCGGTCGC CGGTCCCCTG
GCCCGCCGCC GCGGCGCGCA CGCCGCCGGG CCCGCGCTGT CCTGGCCGGG CGCGGTCGGA
CTCGCGCTGT TGTTCGCGGG ACGGCCCGGC GCCGCCCTGG CGGCCGGTGC GCTGGGCGGC
GCGGCGGTGA CGGGCTCCCT CGTGGTGGGA GCGGGCACCC CGCTGCCCGA GGCGGCGCGA
CTGGCCGGGC TGGACCTCGC GCACACCGTG CGCACGGGCA CGCGCGCGGT CCGCACGGCC
TGGTGGCCGC TGGCCGCGGC GGCGGTGGGC GCGGCGGTGC TCGGACGGCG CGGCCGCGGG
GCACCGGGCG TGCCCGCTCC GGCCGCCTCG GACCCGCTCG CGGGTCGAGG CCGGTCAGCG
CGGCGAGGCG GTGCCGGCGG ACGCGCGGGC CGCGTCGCCG CCCTGGCCGC GGGAGCGGCC
CTGGTCGTGC CGCACGTGGC GGCCTGGCAC CGGGGCAGGG GCGCCGCACT GGCCGGTCCG
GTGACCTGGA CGGCCCTGGG GATGGCGGGC GACGCGGCGC GCTCCCTGGG CACCTGGTGG
GGGATCGCGC GGTCGGGTTC GCCCGCGCCC CTGGTGCCCC GGCTCGTGCC CCCGGCCGCG
ACCGGCACGG CACCGCGGGA GCGCTCCGGC GGGACGCGTC GTGCACGGGG ACCCAAGGAC
GGGTCAACGG CGGCCGTGAG CCGGGCTTAA
 
Protein sequence
MPSRAPLPPG LGIEIDRAAR LADEGHVLLG GSPPRAVRLG PERVSALIRW LSRAVPRDAD 
EGLFARDLIR AGLAHPRPLP LTPGDTAEVA VVGRADSAAL HATLDHLDEH GHGTRTVVVG
ATGPEARAAR RRGVRVVLGP TGGAGAGAAA LRACSAEFVA LVEAGTRPAP GWLETALGHF
ADPDVAAVVP RVLTDRSACL GHTRMTVASV AARRTGADRG ADPAPVLPWG HALPWQERPG
PANEHTDPLR PVPVLVLRRG AADLDPGLGA AAGLDLLWRL AEQGWSVRYE PRSRVWAPPT
TDLGAYLRAC FTSGAVAGPL ARRRGAHAAG PALSWPGAVG LALLFAGRPG AALAAGALGG
AAVTGSLVVG AGTPLPEAAR LAGLDLAHTV RTGTRAVRTA WWPLAAAAVG AAVLGRRGRG
APGVPAPAAS DPLAGRGRSA RRGGAGGRAG RVAALAAGAA LVVPHVAAWH RGRGAALAGP
VTWTALGMAG DAARSLGTWW GIARSGSPAP LVPRLVPPAA TGTAPRERSG GTRRARGPKD
GSTAAVSRA