Gene Ndas_0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0113 
Symbol 
ID9243944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp142172 
End bp143836 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678069 
Protein GI297559095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTACC CCGATCCCCC ACAGCAGCCC CAGTGGCAGC CTCAGCCGCC CGGGCAGGGT 
TACCCTCCGC CCCAGGCTCC CGGCCAGCAC CCCGCGGGCT TCGGCGCGGG CGCGCCCGGC
GGTCCGCCGC CCGCGGGTCC CTTCCCCGGG GGCCAGTACT CCGGCGGTCA GGTGCCGCCG
CCCTCCGGCG GCGGCCGCAA CAGGCGCTGG CTGATCCCGG CCGCCGCGGG CACCGCGCTC
CTCCTCATGG GCGGCACGGT GTGGGCCACC GTCTCGCTGG TCAGCTTCGG CGGCCCCCAG
CCCGAGGACG TGCTCCCGGG CAGCGCGCTC GCGTTCGGCA AGATCGACCT GTCCATCGAC
GGCTCCCAGG CCATGGAGCT GTTGCGGTTC GTGGACCAGC TCCCCGACGA GGTCACGGCC
GAGATGGACG AGCCCGACGG CGACATGACG GGGCTGGTCG CCGAGGGTTT CGTGGCCGCC
TTCCCCGAGG CGCGGCAGTC CGAGGTCGAG GAGTGGATCG GCCAGCGCGT GGGCTTCGCG
ATGTGGCCCG CCAGCGGTGA GGCCGAGCTG GGCGAGGGCG CCGGCGTGGC CGGGGCGTTC
GCGCTCGCGG TGGAGGACGA GCGGCTCGCC CAGGAGAACC TGGAGCGGCT CAGCGGCGAG
TACGACGACC TCTACTTCGA GGTGATCAAC GACTTCGCGC TCCTCACCAC CTCCGACGCG
GCGCTGGCCG ACCTGCACGC GCAGGTGGAG GAGCACGGTC CCCTGGCCGA CGCCGACACC
TTCTCCGGCG ACATGGCCGA GGTGCCCGGC GGCAGCCTGG CCGCGGCGTG GTCGGACCTC
GCGGGTCTGA TGGAGGTCGA GGAGTTCGCC CGCGAGTTCG AGGCCGACCT GGCCGCCGAG
ACCGGTGAGC TGAGCGGCCG GATGACCGCG TCCCTGCGGG TGGACGGCGA GTACCTGGAG
GCGCGCACGG ACATCTTCGA CCTCACGGTG GACGGCACCG ACCTGGCGTG GCTGGCCGAG
ACGCCGGGGG CGAGCGTGGC GGCGATGGAG TCGCTGCCCG AGAGCACGGT CATGGCCGTG
GGCGCCAGCG GCCTGGACAC GGCGCTGGCC GACGCCTACG AGAGCGACTC GATCCCGTTC
ATGACGAGCA GCGGCCAGAT GCAGGAGATG GAGCGCCAGT TCAACTCGAT GGGCGCCCCG
CTGCCGGAGG GGTTCACCCA GCTGCTGGGC TCCTCGACCG CGTTCGGTGT CACCGACATG
GACCTGGAGG GCTTCTTCGG ATCCTCCTAC AGCGGCGGCG GGGAGGCGTC CTTCCAGTAC
CGCGCGGTGG GCGGTGACGA GCAGATCCTG GGCGACTTCG TCGAGAGCGC GGTCGTCGAC
TCCTACAGCA CCCCGCCGGG GGTGAGCACC GACGGCGACG CCGTGGTGGT GTCCCAGGGC
AGCAGCGCCA CCGGGCGCCT GGGCGACGAC CCGGTCTTCC AGCAGACCAT GCAGGAGATG
GACTCGGCGG TGATGGCCGG GTACATGGAC CTGCGCCAGG TGCTGACCGA GAGCGAGGTG
GAGGCCCCCG GCCAGTGGGG CGCCGTGGGC CTGGCGCTGA GCGTCACCGA GGAGGGGCAG
CGCAGCAGCG TGGAGCTGCG CTGGTCGCCC AGCGGCGGCG AGTAG
 
Protein sequence
MSYPDPPQQP QWQPQPPGQG YPPPQAPGQH PAGFGAGAPG GPPPAGPFPG GQYSGGQVPP 
PSGGGRNRRW LIPAAAGTAL LLMGGTVWAT VSLVSFGGPQ PEDVLPGSAL AFGKIDLSID
GSQAMELLRF VDQLPDEVTA EMDEPDGDMT GLVAEGFVAA FPEARQSEVE EWIGQRVGFA
MWPASGEAEL GEGAGVAGAF ALAVEDERLA QENLERLSGE YDDLYFEVIN DFALLTTSDA
ALADLHAQVE EHGPLADADT FSGDMAEVPG GSLAAAWSDL AGLMEVEEFA REFEADLAAE
TGELSGRMTA SLRVDGEYLE ARTDIFDLTV DGTDLAWLAE TPGASVAAME SLPESTVMAV
GASGLDTALA DAYESDSIPF MTSSGQMQEM ERQFNSMGAP LPEGFTQLLG SSTAFGVTDM
DLEGFFGSSY SGGGEASFQY RAVGGDEQIL GDFVESAVVD SYSTPPGVST DGDAVVVSQG
SSATGRLGDD PVFQQTMQEM DSAVMAGYMD LRQVLTESEV EAPGQWGAVG LALSVTEEGQ
RSSVELRWSP SGGE