Gene Ndas_0250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0250 
Symbol 
ID9244084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp310786 
End bp313464 
Gene Length2679 bp 
Protein Length892 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycosyl transferase family 51 
Protein accessionYP_003678205 
Protein GI297559231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0340145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAGT CCAACCAGCG CCCGCCGCGT GGGCGGCGGC ACGAGGCCCC TCGCCGGAAC 
TGGCGCGGGG CCCTCTCCCG GGCGGTACCG GCCGCCGCGG GTCCGCGTGT GCGGCGGTGG
GCCGAGGCCC TGCGCCGGAA GGTGTCGGAG CCGACCCCGG CGGGGGACCG CCGGGAGACG
GTCCAGCGCC TGGCCGGAAC CGGCGCGGTC GCCGGTCTGC TCACGGCGGC GCTGGTCATG
CCCTGGGTCG GCGGCCTCGG CCTGGCGGCC AGGGACTCCG CGGCGGCCTT CATGGCCCTG
CCCAGCGACC TGGCCGTGCC GCACCCCGCC GAGCGCGTGC TGCTGACCGA CGTCGACGGG
GAACCGATCG CCGAGGTCGC CGAGCGCGAG CGCGACGTGG TGCCGCTGGA CGAGATCAGC
CCCTGGGTGC CCGCCGCCCT CATGGCGATC GAGGACGACC GCTTCTACGA GCACGCCGGA
CTGGACCTGC GCGGCACGCT GCGCGCCGCC GTCCGCACCG TCCTGGGCAA CACCCAGGGC
GGGTCCACCA TCACCCAGCA GTACGTGAAG AACCTCCTCA TGGAACAGGC CGACACCGAG
GAGGAGCTGG CGAGCGCCAA CGCGCGCACC CTGACCCGCA AGGTGCTGGA GCTGCGCTAC
GCCATCGAGC TGGAGGAGAA GCTCACCAAG GACGAGATCA TGGAGGGCTA CCTCAACCTC
GCCTACTTCG GCCAGAACGC GTACGGCATC GAGGTCGCCG CCGAGCGCTA CTTCTCCGTC
CCGGCCTCCG AGCTCGACCC CGCGCAGGCC GCCACGATCG TGGCGCTGGT GCGCGCGCCC
TCGTACTACG ACCCGCTCAC CAACCCCGAG GCCTCCGTCG AGCGCCGCAA CCTGGTGCTG
GACCGGATGG TCGCCACCGG ACACCTGGAG AGCGCGCAGG CGCAGGAGTA CAAGAGCCGG
GGCCTGGAGG TGGACGAGAC CCCGCGCGCG GGCAGCTGCT TCAGCAGCGA GCAGCCCTTC
TTCTGCGACT ACGTCATGCG GTGGCTGGGC GGCTCCGACG CGCTCGCCGG GACCCAGGAG
GAGCGCGACC GGATACTGGA GCGGGGCGGC ATCACCGTGC GCACCACCCT GGACCTGGAC
ATGCAGGAGG CCGCCGAGCA GGCGATCGAG CGCTACGTCC CCGCGGGCGA CTCCCACAAG
TTCGCCGCCG AGGTCCTCGT GGAGCCCGGT ACCGGCCGGG TGCGGGTGAT GGCCCAGAAC
ATGCGCTACG GCTTCGACGA CGAGCCGGGC ACCACCTCGA TCAACCTGTC CGTGGACCAC
GAGGACGGCG GGTCGCTGGG CTACCAGGCG GGTTCGACGT TCAAGCCGTT CACCCTGGCC
GCCGCCCTGG ACGCCGGGCT CAAGTACGAC ACCAGCTTCT CCTCGCCCGA GTCCACGACG
GTGAGCGGCC TGGAGAACTG CGAGGGCGGC AGGATGGCGC CCTGGGACGT GCGCAACGCC
GGGGAGAGCG ACGGCGGCAG GCACAACATG ATCAGCGGGA CGAAGGGCTC GGTGAACACC
TACTTCGCCC AGCTCCAGGA GCGCGTCGGC CTGTGCGAGA CGGCGGAGAT GGCCCAGAGC
CTGGGCATCC ACCGCGCGGA CGGCGAGGAC CTGCAGGTGT GGAGCTCCTT CACCCTGGGC
GACCAGGAGG TCTCCCCGCT CACCATGGCC AGCGCCTACG CCGTCTTCGC CTCCCGGGGC
ACCTACTGCG AGCCGGTCCC CGTGGCCTCG GTCCTCTTCG AGGGCGAGGA CGGCGAGGAG
GTCGAGATGG GCACCGAGTG CGAGGAGGGC GCGCTGGACA CCGAGGTCGC CGACGGCGTC
AACCACCTGC TCCAGCAGAC CTTCGAGGGC GGTACCGCCA ACGGCCTGGA GATCGGACGC
CCTGTGGCGG GCAAGACCGG CACCACCGAC AGCGCGGCCT ACGCGTGGTT CGCGGGCTAC
ACCCCGAACC TGGCCGGGAC CGTGGTGGTC GGCGACATCC GGGGCGGGGA GCAGCACACG
CTCCAGGGCG TGACCATCGG CGACCGCTAC TACGGCATCG TCTACGGGGC CACGCTGCCC
GGTCCGATCT GGCAGGCCAC CATGCGCGAG GCCGTGGCGG ACCTGCCGGA GGAGGAGTTC
GCCCCCTCGC CGAAGGTCTA CGGCAAGGCC TCGGACAAGC CATCGGGCGG CGGTGGCGAC
AACGGTGACA GTGACGACAG TGACGCCGGT GGAGGCGATG GAGGCACGGC TGGCGGTGAC
GGCGGCGTCG CGGCCGGTGG TGGTGGCGGA GGCACTGGCG GTGGCGGCGG TGCCGGGGGT
GACGATGGCT CGACCGGCGG CGGTGGCACC GGTGACGGTG GTGGGGGAGC CACCGGCGGT
GGTGGCGGGG GTTCGACCGG GGGTGGTGGC GGTACCGGAG GCGGCGGAGG ATCGACCGAC
GGCGGTGGAG GCACCGGTGA CGGCGACGGC GGGGGTTCAA CCGGTGGAGG CGGCGGTACC
GGTGGCGGCG GCGGTGGTGG TGGAGACGGC AGCGGTACCG GCGGAGGCGG CGGCGGTACC
GGTGGCGGCG AGTCCCCGGG CGGGGACGGC GGCGCGCCCT GGGGCGGCGC GACCCCGGGA
CCGCAGGCGC CCGAGGGCGG CGCCAGTCCC GGGGGCTGA
 
Protein sequence
MTESNQRPPR GRRHEAPRRN WRGALSRAVP AAAGPRVRRW AEALRRKVSE PTPAGDRRET 
VQRLAGTGAV AGLLTAALVM PWVGGLGLAA RDSAAAFMAL PSDLAVPHPA ERVLLTDVDG
EPIAEVAERE RDVVPLDEIS PWVPAALMAI EDDRFYEHAG LDLRGTLRAA VRTVLGNTQG
GSTITQQYVK NLLMEQADTE EELASANART LTRKVLELRY AIELEEKLTK DEIMEGYLNL
AYFGQNAYGI EVAAERYFSV PASELDPAQA ATIVALVRAP SYYDPLTNPE ASVERRNLVL
DRMVATGHLE SAQAQEYKSR GLEVDETPRA GSCFSSEQPF FCDYVMRWLG GSDALAGTQE
ERDRILERGG ITVRTTLDLD MQEAAEQAIE RYVPAGDSHK FAAEVLVEPG TGRVRVMAQN
MRYGFDDEPG TTSINLSVDH EDGGSLGYQA GSTFKPFTLA AALDAGLKYD TSFSSPESTT
VSGLENCEGG RMAPWDVRNA GESDGGRHNM ISGTKGSVNT YFAQLQERVG LCETAEMAQS
LGIHRADGED LQVWSSFTLG DQEVSPLTMA SAYAVFASRG TYCEPVPVAS VLFEGEDGEE
VEMGTECEEG ALDTEVADGV NHLLQQTFEG GTANGLEIGR PVAGKTGTTD SAAYAWFAGY
TPNLAGTVVV GDIRGGEQHT LQGVTIGDRY YGIVYGATLP GPIWQATMRE AVADLPEEEF
APSPKVYGKA SDKPSGGGGD NGDSDDSDAG GGDGGTAGGD GGVAAGGGGG GTGGGGGAGG
DDGSTGGGGT GDGGGGATGG GGGGSTGGGG GTGGGGGSTD GGGGTGDGDG GGSTGGGGGT
GGGGGGGGDG SGTGGGGGGT GGGESPGGDG GAPWGGATPG PQAPEGGASP GG