Gene Ndas_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0388 
Symbol 
ID9244226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp475627 
End bp478980 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycosyl transferase family 51 
Protein accessionYP_003678342 
Protein GI297559368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.62292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGACG TGATTTGCCA TGGGGGCGGG GAACTGAGCG AGAGCACACA CGGCGAAAGC 
GGGGCGGACG GACAGCGTCC CGGGAAGGAC ACGCCGGACA CCAGGAACAC CTCTGCCGAA
GAGCTGTCCA ACAAGCACAG TCCCGAGAGT GGAGTACCCA ACATCGAGGG GGACCAATCC
ACTCATGACA CTCCTGGAGG AGAAGAAAGC GTAGGAGAAC AGCCCGAACC GGAGTTCATC
AACCCGGTCA CCTTCAAGAC CAGCGGTACC TTCCTCCGTG ACAAGGTCGC CCAGAACCTC
GCGGAACAGG GCTTCGGTTC CGACGACACC GGCGACGACG CCGCGGAGGA CTCCGGCGTT
CCCGCCGACG CGCACGGCAC CGATGCGGTC CACACCTCTC CCGCGGCCGG CTCCCCCGCC
TCAGCCGGCA GCACGGGCAC GGAGGAGTCC GACACGCGCG AGGACGGGGG TCGGGGGGAC
GGACGGTCCT CCCAGGCCGC CCCGCCCTTC GCGGCCGACG ACACGTCCGT CGACCAGGAC
GGTACGGCCG GCGCCGACGA GCCCCCATCG CCGGTGTCCG ACCGGACCGA GCCCATCTCG
CTCGACTCCG TCCGCGCCGC CGCCTCCGCG GAGGCGGCGG AGACGTCCGA CGACGCCTCC
GGCAACGCCG GTGCCGAAGA GGGAGGGACA GACCTCTGGA GCCCGCGCGG GCAGGAGCGC
ACGGGAGTTC CCAAGAGCCC CGTGGCCGGA ACCGTGACCG ACGTCCAGGA CTTCGAGGAC
GGGTCCGCCG AGGCCGCCAC GTCCGCTGTG CCCGCCGCAC CGGACGCGGT CGCTCCGGAC
CGTGACGGAG AGCAGAGGAT CGACGAGTTC TCCGCGGCCG AGACGGCGTC CCTGTGGCCC
GAGCCGCGTC GCCTCTCCTC CTACGTGGCC GATTCGCCCT CCCGGGAGCA GGACCGGGGC
GCCCGCCCCG GTTTCGCCGC GGCCGCGGCC GGAGCGGCGG GTGCCGCGGG AGCGGCCGCC
GCCCACTCCG GGGCACACGG CGACCCCAGC GGTCCCGGTG ACACCACCAG TTCCAGTAAC
TCCGGTGGAC CCAGCGGTCC CACCGGCCCT GGCGGCCCCA AGGGTCCGGC TGGGCCTGGC
GGTACCGGGC CCGGCGGGCC CAAGGGCCCG GGTGACCCCA AGGCACGGGG TAAGAAGAAG
GGCGCCGGTC AGAAGGCCAA GAAGCCCATG TGGTGGAGGA TCCTGCGGGT CTTCCTCATC
GTCACCGGCG TGTTCTTCAT CATCGGGTGC GGTGTCTTCG CCTTCTTCTA CAGCACCGTG
GAGGTTCCGG ACGCGGCCAA GGCCGACGTC CTGGAGGAGG GTTCCACCTT CTACTTCGCC
GACGGCGAGA CCGAGTTCGC CTACCGGGGC ACCCACCGGG AGATCCTGTC CTACGACGAG
ATGACCGCGG GCGGCGACCA CGTCGTCGAG GCGGTCATCT CCGCGGAGGA CCGCGGTTTC
TGGACCGAGC CCGGGGTGTC CGTGAGCGGC ACCGCCCGCG CGGTCTGGTC CACCGTCACC
GGCCAGCAGG TCCAGGGCGG CTCCACCATC ACCCAGCAGA TGGTCCGCAA CTACTACGAG
GGCATCTCCC GGGACGTGTC CATCACCCGC AAGGTGCGGG AGATCATCAT CGCCCTCAAG
GTCGACCGGA GCGAGTCGAA GGAGTGGGTG ATGGAGCAGT ACCTCAACAC CATCTACTTC
GGCCGCAACG CCTACGGAGT CCAGGCCGCC TCGCAGGCGT ACTACCACAA GGACGTCCAG
GACCTGGAGC CCGCCGAGGC CGCGTTCCTG GCCGCCGCCA TCCAGCAGCC CACCCCCTTC
GGTGAGGCCG ACGTGGAGAC CACCCCCTCC ATGGAGCGGC GCTGGGAGTA CGTCGTCAAC
GGCATGGTCA CCACGGAGGC CATCACCCAG GCCGAGGCCG ACGCGATGGA GTTCCCGGCG
CCCGAGCCGG AGCGGCCCAG CGAGGGCACC GACCTGAGCG GCTACAAGGG CTACATGCTC
CAGGAGGCCA TGAACGAGCT GGAGCGGCTC GGCTACACCG AGGACCAGAT CAACCGCCAG
GGCTACAGGA TCGTCACCAC GTTCGACCAG GACATGATGG ACGCCGCCTA CGCCGCGGTG
GAGGAGATGG TCCCGCTCGA GAACCGGCCC GAGGGCGTCA ACGTCGGTCT GACCACCGTC
GACCCGGCCA CGGGCGAGGT CCTGGCCTTC TACGGCGGCC ACGACTACTG GGAGAACCAG
TACGACAGCT CCTTCCTCGG CGCCGCGCAG ACCGGTTCGG CGTTCAAGCC CTACGTGCTC
GCCACCGCCC TGGAGCAGGG CTACAGCCTC AACTCCACGG TGGACGGGCG CGGGCCGCGC
ACCATCGCGG GCTCGCGCAT CCAGAACGCG GGCAACTCGC CGGGCGGCAT CATGACGCTC
ACGCAGGCCA CCCAGGTCTC CAACAACCTC GGTTTCATCG AGCTGGCCCA GGAGGTCGGT
TTCGAGAACG TCCGCGACAC CGTCTACGCC GCCGGGTGGC CCGAGGGGAG TGTGCCCGAC
AGCCAGCTGG TTCCGGTCAT GCCGCTGGGC GCCTCCAGCG CCCGCACGGT CGACCAGGCC
AGCGGGTACG CCACCTTCGC CAACGGCGGC GTCCACGTCG AGACGCACGT GGTGCGGGAG
ATCATCGACT CCGAGGGCGA GAACGTGCGC CCGGAGGTGG AGAGCAACAG GGCTCTGGAG
GAGGAGACCG CGGCCGACGT CACCCACGCC CTCCAGCAGG TGGTCAACGG CGGTACCGGT
ACCGGGGCGC GGCTGGCCAA CCACCCCACC GCGGGCAAGA CCGGTACCAC CGACGGCAGC
GTCGCCGCCT GGTTCGTCGG TTACACCCCG CAGCTGTCCA CGGCGGTGGG GATCTACAGC
GGCAACAACG AGGGCTTCAG TATCCCCGGA TACGGCTCTC TGTCGGGCGG TACGCTCCCG
GCGTCGCTGT GGAACCGGTA CATGAGCACC GCGATGGAGG GTTACGAGCC CGGTTCGTTC
CCGAGCCCGG CCTTCAGCGG CACCACCGAG AACTGGGCTC CGGACGTGTC CACCGAGCAG
CCCCAGCAGC CGCAGCAGCC GGAGGCGCCC GCGGAGCCGG AGACGCCGGT GGAGCCGGAG
GTGCCCACCA CGCCGGAGAC GCCGGTGGAG CCGGAGGTGC CCACCACGCC CGAGTGGCCG
GAGATCCCCG GTCCCGGGAC CGATCCGGGT CCCGGTGAGG GCGGCGGCGG AGAGGAAGGC
GGTGGCGGTG GCGGCGGTGA GAGCCCGCCG GCCAGACGCG ACGACCTCTG GTGA
 
Protein sequence
MPDVICHGGG ELSESTHGES GADGQRPGKD TPDTRNTSAE ELSNKHSPES GVPNIEGDQS 
THDTPGGEES VGEQPEPEFI NPVTFKTSGT FLRDKVAQNL AEQGFGSDDT GDDAAEDSGV
PADAHGTDAV HTSPAAGSPA SAGSTGTEES DTREDGGRGD GRSSQAAPPF AADDTSVDQD
GTAGADEPPS PVSDRTEPIS LDSVRAAASA EAAETSDDAS GNAGAEEGGT DLWSPRGQER
TGVPKSPVAG TVTDVQDFED GSAEAATSAV PAAPDAVAPD RDGEQRIDEF SAAETASLWP
EPRRLSSYVA DSPSREQDRG ARPGFAAAAA GAAGAAGAAA AHSGAHGDPS GPGDTTSSSN
SGGPSGPTGP GGPKGPAGPG GTGPGGPKGP GDPKARGKKK GAGQKAKKPM WWRILRVFLI
VTGVFFIIGC GVFAFFYSTV EVPDAAKADV LEEGSTFYFA DGETEFAYRG THREILSYDE
MTAGGDHVVE AVISAEDRGF WTEPGVSVSG TARAVWSTVT GQQVQGGSTI TQQMVRNYYE
GISRDVSITR KVREIIIALK VDRSESKEWV MEQYLNTIYF GRNAYGVQAA SQAYYHKDVQ
DLEPAEAAFL AAAIQQPTPF GEADVETTPS MERRWEYVVN GMVTTEAITQ AEADAMEFPA
PEPERPSEGT DLSGYKGYML QEAMNELERL GYTEDQINRQ GYRIVTTFDQ DMMDAAYAAV
EEMVPLENRP EGVNVGLTTV DPATGEVLAF YGGHDYWENQ YDSSFLGAAQ TGSAFKPYVL
ATALEQGYSL NSTVDGRGPR TIAGSRIQNA GNSPGGIMTL TQATQVSNNL GFIELAQEVG
FENVRDTVYA AGWPEGSVPD SQLVPVMPLG ASSARTVDQA SGYATFANGG VHVETHVVRE
IIDSEGENVR PEVESNRALE EETAADVTHA LQQVVNGGTG TGARLANHPT AGKTGTTDGS
VAAWFVGYTP QLSTAVGIYS GNNEGFSIPG YGSLSGGTLP ASLWNRYMST AMEGYEPGSF
PSPAFSGTTE NWAPDVSTEQ PQQPQQPEAP AEPETPVEPE VPTTPETPVE PEVPTTPEWP
EIPGPGTDPG PGEGGGGEEG GGGGGGESPP ARRDDLW