Gene Ndas_4518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4518 
Symbol 
ID9248398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5360393 
End bp5362012 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682412 
Protein GI297563438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.714804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGA CGACGGCCAC GGCGGGGGGC CAGGCCCCGA TCGGGGGGCG TCCGCCATCG 
GAGGGCGGAC GGCTCCTCCC GGAGGGCGCC GGACGGCCTT TCCCGGCCGA CGGCGGTGTG
GAGGACCTGG TCGACGCCCT CTGCGCGCTC CGGTGCGTCC TCCTCCCGGA CGCGCGCGAG
CAGTTGGCCT ACCAGCTGCC GACGCGGGTC CGCCGCAACC TGCGCCTGGA GGGTTCACCG
GTCGAGTTCG CCCGCCACCT GGTACGCCTG TGCCGGGAGC AGGTGCGCCT GGGCTGCCTC
GTGCGCTGGA TCCTCTACCT GGAGGAGCGC GGCCGCAGCT CGCTGGCGGC GGCCCGGGCC
GCGGAGCCGC TGGTCCACGC CGAGGAGTGG GAGCAGCTGT ACCGCCTCCT GCCCGAGGGA
ACCACCGACG CCGACGTCGC CCTCGTCCGC CGGGAGCTGG CGGGGCACTC CCCGGACCTG
GTCGCCGAGG CCTTCGAGAC CGCCCGGACC GACCAGCTCG CCCCGGCCGA GGCGCCGCCG
CACACGGCCT GGGACCGGCT CGTGGACCTG GCCGAGATGA CCGTGCCCGA GGGCGTCGAA
CTCCCGCTGC GGGTCTTCTG CGCCCTGCTG CCCTGCGCCG TCCCGCTGCG CGACTGCGCC
GACCTGCTCC TGCTCTGGGG CGGCGGGCGG CGCGAGGGCG CGGCCGCGCC GCCCGGCACG
CGCGCGCCGG CCCGGCTCGT CGTGCACGTC ACCCAGGGCC GGAGCCGGGA CCGCTACGAC
GTCGAGTACT GGACGGTGCT CAGCGAGAGA CGGGGCAGGG CCCCGGACTT CTGCGGCCAC
GGCCACACCC CGCACATGGA GGCCGAGCGC ATCGGCGCCC ACGTCGGCGG CCTGCTCACC
CTGCTGGAGG TGGACCACCG CACGGGGTAC CACGAGGGCG TGCGGGTCGA GCTGGTCGCA
CGGCTGGACC TGCTCCGCCG CCTGGAGGCC GAGCGCTGGC AGGAGGCCGG TGAGGGCGAC
CGCAGGCTCG GCGCGCGGGC CCAGGTGGTC TACCGGGCGG AGGAGCTGGT GGACCCCGCG
CGCGGCGACA CCGCGCAGGC CCGGCGTACC TGCGCCCGCC GGTGGGAGGG GCTCGCGAGC
GCGTGCGAGG TCCTGCACCT GGACACCGCG CACGAGGCCG GGCAGAGGAG GCGCAGGGGC
GGACGGACCT ACCGCGAACC CCTGGTCGAA AGGCTCCAGG ACGACAGGAT CGTCGCCCTC
TCCGTTCCCT CCCATCTGGA TGAATGCCAC ATGTCGGTCT GGTCGGCGCT CATGATCGGG
GTTCCGGTCG TGATCTGGCG TTCCACCGAC AATGGTCCTG GAGCAAGTCC GTGGCTCAAT
CTGGGGAAGG TGGGGCGGGA GGTAATGGTG CCACCCGAAA GAATACGAGC GCTTCCCAAG
GCGCTCCACC AATCGCGTTC CGGATGCGTT TCCCCGGACG ATACCGGGTA CATCGAAGAA
AGCTTCGAGG TCGCCGTCTT CTACCACGAC ACCCTTCCGG TTCTGCCCGC CCCCAGACCG
ATGACCCCCC CACAGCATCC GACAGCCCCC CTTGGGCGCG GCTCGAAGGG CCTGCCATGA
 
Protein sequence
MSETTATAGG QAPIGGRPPS EGGRLLPEGA GRPFPADGGV EDLVDALCAL RCVLLPDARE 
QLAYQLPTRV RRNLRLEGSP VEFARHLVRL CREQVRLGCL VRWILYLEER GRSSLAAARA
AEPLVHAEEW EQLYRLLPEG TTDADVALVR RELAGHSPDL VAEAFETART DQLAPAEAPP
HTAWDRLVDL AEMTVPEGVE LPLRVFCALL PCAVPLRDCA DLLLLWGGGR REGAAAPPGT
RAPARLVVHV TQGRSRDRYD VEYWTVLSER RGRAPDFCGH GHTPHMEAER IGAHVGGLLT
LLEVDHRTGY HEGVRVELVA RLDLLRRLEA ERWQEAGEGD RRLGARAQVV YRAEELVDPA
RGDTAQARRT CARRWEGLAS ACEVLHLDTA HEAGQRRRRG GRTYREPLVE RLQDDRIVAL
SVPSHLDECH MSVWSALMIG VPVVIWRSTD NGPGASPWLN LGKVGREVMV PPERIRALPK
ALHQSRSGCV SPDDTGYIEE SFEVAVFYHD TLPVLPAPRP MTPPQHPTAP LGRGSKGLP