Gene Ndas_0063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0063 
Symbol 
ID9243893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp80292 
End bp82079 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678021 
Protein GI297559047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00586745 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGTACGT CAGGATCTCC CCCCGACCGC GGCCGTCCCT CCCAGCGCAC GCGGTCCAGC 
GGTGCGGCGG AGCCCGACGC GGCCGCCGCG CCCCGACCCG TCTACGCGAC CGGCGGGATC
GCCGCGGCCA CCGCCGCGGC GCTGGGGTTG GCCGTGATCG TCACCCTCAC CATCATCGGG
TGGGTGGCCG CGCCGCACGA CACCTTCGGC GAGGACATCA TCGACATCCT CCAGGGTGCG
GTGCTGGCCT GGCTGGTGGG CCACCACGTG TCGTTCTCGG TGCCCGACGG TCAGATCGCC
CTGCTGCCGT TGGGGCTGGT GCTGTTGCCG GGGCTGCTGC TGTACCGGTC GGGACGGTGG
CTGGCGCGTT CCTGCGACAT CCCCCGCCTG CGGTACGTGT ACCGGGCGGC GTTGGCGATC
GCCGGGCCGT ACGCGGCGAT CGCGGGCACC CTGGCCCTGC TGGCGCGGAC CGAGGCGGTC
GAGCCGAGTA TGCCGCGCGC GCTGGTCATG GGGTTCGTGA CCGCCTTCCT GGCCGGAGGG
CTGGGGGTGC TGCGCCAGCT GATGCGCGAC AAGGAGGTGC CCGTCCGGGA CCTGCTGCGG
CTGATGCCCG CCCGATCGCG CTCCCTGCTG GTGGGGATGC TCGCCTCCAC GGGGACGCTC
CTGCTGGGCG GGCTGGTGCT GTTCCTGGTG GCGCTGGCGA TGAGCCTGCC CGAGGTCGTC
GAGACGACCC GGGTGCTCGG CCCGGGCCTG GTGGGGGGCG CGCTGCTGAT CGTGGTCCAG
TTGGCGTACC TGCCCAACGC GGTGGTGTTC GCGCTCTGCT ACGCGCTGGG CCCGGGGTTC
GCGGTGGGGG CGGGGACCGT CGTCGCGCCG ACGGGGGTGT CGGTGGGAGC CCTGCCGATG
CTGCCGATGC TGGCCGCGCT GCCGTCGAAC GGGGCCGCGC CGGTGGCCTC GCTGCTGGCG
TTGGCGGTGC CGTTCGTGGC GGGTGCGGTG GGCGGTGCCC TGACCCAGCG CAGCGCCCCG
GACGTGGTCA GCGAGGCCGC TCCGCTGTGG GGGTTCGTGT GCGGTGTGAC CACGGGGCTG
CTGTGCGCGG CGCTGGCCGC GCTGGCCGGT GGGCCGCTCG GCGACGAGAG GCTGAGCGAG
GTCGGGCCGT CGGCGTGGCA GGTGGGCCTG GTGGCCGCCC TGGAGGTGGG TGTGGCCGCC
GCCGTCGTGG CGTGGGTGGC GAACTGGTGG TACACGCGGG GTGAGCGCCC GGCGCGCAGG
GCCCCGCGTT GGACGGCCTG GCGTGGGCGG CGCCGCGCGA AGGAGGACAC GGCGTCTTCG
GCGGGGGAGG CGCCCGCCGC GAGGGCCACG CCGCGCTCCG CCAAGGGCGT GACGCGCCCG
GCCAAGGAGG CGGCGCGTTC GGCCAGGGAC GCGGCGCGCT CCGCTGAGGA GGAACCGCGG
GAACGTCCGG AGGTGGAGCT GCCCCCGGAC GTGCCGGTCG GTGAGGGTCT GGCCACGGTC
ACCCCGCTGC GTCGCCGTGA GCAGGAGGTG CGACCGGACC CCGAAGAGGA CGCCTCGCCG
GAGGCCCCCG AGGACGAGGA GCCGGACGCG GACCGGGCCG CCGAGCGCGC GGAGCGCAAA
CGCGCGCGCC GGGAGCTGCG CGACCAGCGG CGCGCGGAGC GCAGGGCCAG GCGGGCCGAG
GGCGGGCGCT GGTGGCGCCG CCGGCCCGCC GAGGACGAGG AGTCCGAGGA GATGTACGGG
ATCACCTACG AGGCCGAACC CGACGGTGTC GACACCCCGC AGCGCTGA
 
Protein sequence
MSTSGSPPDR GRPSQRTRSS GAAEPDAAAA PRPVYATGGI AAATAAALGL AVIVTLTIIG 
WVAAPHDTFG EDIIDILQGA VLAWLVGHHV SFSVPDGQIA LLPLGLVLLP GLLLYRSGRW
LARSCDIPRL RYVYRAALAI AGPYAAIAGT LALLARTEAV EPSMPRALVM GFVTAFLAGG
LGVLRQLMRD KEVPVRDLLR LMPARSRSLL VGMLASTGTL LLGGLVLFLV ALAMSLPEVV
ETTRVLGPGL VGGALLIVVQ LAYLPNAVVF ALCYALGPGF AVGAGTVVAP TGVSVGALPM
LPMLAALPSN GAAPVASLLA LAVPFVAGAV GGALTQRSAP DVVSEAAPLW GFVCGVTTGL
LCAALAALAG GPLGDERLSE VGPSAWQVGL VAALEVGVAA AVVAWVANWW YTRGERPARR
APRWTAWRGR RRAKEDTASS AGEAPAARAT PRSAKGVTRP AKEAARSARD AARSAEEEPR
ERPEVELPPD VPVGEGLATV TPLRRREQEV RPDPEEDASP EAPEDEEPDA DRAAERAERK
RARRELRDQR RAERRARRAE GGRWWRRRPA EDEESEEMYG ITYEAEPDGV DTPQR