Gene Ndas_3967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3967 
Symbol 
ID9247838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4743404 
End bp4745962 
Gene Length2559 bp 
Protein Length852 aa 
Translation table11 
GC content79% 
IMG OID 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_003681870 
Protein GI297562896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCG ACGCTCCCGG CCCGTCCGGA CCGGAACTGA GGGAGTACAC GGCACTGCTG 
CGCCGCCGCT GGCGGTTCGT CGGCGCCGGG GTGCTCGGCG GGCTGGCCCT GGCCTCCGCC
GCGGTGGTCG CACTCCCCGC CGCCTACACC TCCGTCTCCG CCGTCCAGGT CCAGCCCAGC
GGGATGGCCG AGTTCACCGG GGAGCGCTCC GGCCGCCTGG CCGGGGACGT CAACCTCGAC
ACCGAGGCAC AGGTCCTGCT GTCGGAGCGC GTGTCGTCCG CCGTGGCCGA GGCCCTGGCG
GAGGAGGGCG GTGCCGCGCC CTCCGTGGCG GACCTGCGCG AGCGGGTGGA CGTGAGCGTT
CCCCCCAACA GCAGCGTCCT GGAGATCAGC TACTCCGCCG GGAGCCCCGA GGCCGCGCGG
GCGGGCGCGC AGGCCTACGC CGACGCCTAC CTCGAACTGC GCCGCGAGCG GATCGACGGG
CTGATCGAGA GCCACCTGGA GGCGCTGCGC GGTGAGCAGG AGGCCCGTTA CGAGGCCCTC
GCCGAGACCG CCGGGGAGTC CGCCGCCCCC GGCGCGGACG CGCGGGTGGA GGCTCTGCGC
GCGGAGATCA CCGAGCTGGG CAACGGCATC AGCCCGCTCA GCGCCCTGGC GGAGACGGTC
GAGCCCGGCA GTGTCATCAC CCCGGCGGGG CTCCCCGAGC GCGCGAGCAG CCCGATACCC
GCCCTGTGGC TGGTCGGCGG CGCCGCGCTC GGCCTGCTGA CGGGGCTGCT GGCGGCCGTG
GTGCGCGACC GCCTCGACCC GAGGCTGCAC GACGCCGAGG AGACCGGGCG GATCGGGGCG
GTGCCGGTGC TGCTCGACCT GTCCGAGCGG GTCGGGCGCG GCCAGCGCTC CCCCGGGCTG
CTGCCGGACG GCGACCGGCG CGGGCAGCGG GTCAACGAGT TCGCGCACCT GGTGCGCGCC
CGCCTCGCCG CGGTCCCCGT TCCCGCGGCG GCCGGCGGAC CCTCGGAGGT CGCTGGGCCG
ACCGGGGCGG GTGGGACGGG GGTGTCCGAC CGCGCCGGAG CGGGCGCGGG CGGCGCCCGG
GGGGAGGGCG AACTGGCCGT GCTCCTCGGG GACGAGGAGG CGGTCCTCGG CCGCGTGCTG
CTCGTCACCG CGACCACCCC GGGCCGCGCC GGTGCGGCGA CCGCGGTGAA CCTGGCCGCC
TCCCTGGCCC GCACGGGCTC GGAGACGCTG CTGGTGTGCG CCGACCCCCG CACCGACGCG
GTCGGCGAGC TGCTGGGGCT GCCGGAGGGC CCCGGCCTGG CCGAGGCGCT GCTGGAGGGT
GAGGACCCCG CCGACCTGGA GGTGCGGCCC GACGCGGTGC CCCGTCTGCG GGTGCTGCGC
TACGGGTCGC CGGGTATGGA CGCGCCGGTG CAGGGCACCG CCATGCCGGA ACTGGTGCGG
CTGCTGCGGG CGGGGGCGGA GTACGTGGTG GTGGCGGTGG CGCCGGTGAG CGAGCGGGCC
GACGCGCACG CGCTGGCCGG TTCGGCCGAC CTGATGCTGC CGGTGGTCGA ACTGGACCGC
ACGCGCCGCG CCGAACTGGG GGAGCTGCTC GTCCTCGCCG ACCGGTTCGG GGTTCCCGTT
CCGGGCACGG CCGTGCTGCC GCGCCAGCCG CTGGCCGGAC CCGCGCCGGT GGCGTCGCCG
TCCGCGGCCG AGCCGGTCGC CGGGGAACAG ACCACCGGGC CCGACCGCAC GCCCGGAAGG
GACGGAGCGG CCGGGGCGGA CGAGGCGGCC GGGGCCGCGA AGGGCCGCGC CGGTGCGGGG
ATCACGCTGA CCGGCATCGT CAGCGAGCTG CCCGCCACCG CCGGGACCGC GGGGACGCGG
GGGCGCGGCG GGGCGGCCCG GCCCCCGGCT CCCAGGGACG CCGAGGGTGT CCCCGGGGTT
CCCGGTCCCC GGCGGGCGGA GGCCGCGGAG GAGCCCGAGG CCGCGGAGAG CGGGAGGAGC
ACGGCCGACG GTGCCGCCGG CACGCCTTCG CGGGCTCGCG ACGGGGACGC GGAGTCCGCG
GTCCCGGGCG GCGGCGAGCG CGCCGGTGAC GAGCGCGCCG GAGGCGACGA CACCGCCCAG
GAGCGCTCCG GGGTGTCCGA AGCCGAGACG GCCGAGCGGA TCGCGGAGGC GGCCGGGGCG
GACGACGGCA CCGCGCAGGA CGCGGCGGCC CCCCGCGACG GCGGGATACC CGCGGGCCTG
GAGGTCCCCC GCGTGCTGGG AGGCGACGGC GCCACCCAGA TCCCGGTGAC GCCCGAGGCC
GCCGGGACCG GCCGGACCGA GGAGGACGCC GCGGCCCCGG GTACGGCCTC GGAGGCGGAG
ATGGCCTTCG GGCTGGCCCG GACCGCCGAG GCGCGGGAGC CCCACGACCC CGAGGCGACC
GTCAGCGGCG CCGAGGCCAC GGAGGCCCTC GTGGCCTTCG CGGCCGAGGG GGCCGAGGGG
GCCGAGGAGT CCCGGCCGAC CGGAGCCGCC GGGGACTCCG ACTCCCCGGA CAGCGCCCCT
GACACCGCCC CGGACACGGC CCCGGGGACA CGGAACTGA
 
Protein sequence
MDTDAPGPSG PELREYTALL RRRWRFVGAG VLGGLALASA AVVALPAAYT SVSAVQVQPS 
GMAEFTGERS GRLAGDVNLD TEAQVLLSER VSSAVAEALA EEGGAAPSVA DLRERVDVSV
PPNSSVLEIS YSAGSPEAAR AGAQAYADAY LELRRERIDG LIESHLEALR GEQEARYEAL
AETAGESAAP GADARVEALR AEITELGNGI SPLSALAETV EPGSVITPAG LPERASSPIP
ALWLVGGAAL GLLTGLLAAV VRDRLDPRLH DAEETGRIGA VPVLLDLSER VGRGQRSPGL
LPDGDRRGQR VNEFAHLVRA RLAAVPVPAA AGGPSEVAGP TGAGGTGVSD RAGAGAGGAR
GEGELAVLLG DEEAVLGRVL LVTATTPGRA GAATAVNLAA SLARTGSETL LVCADPRTDA
VGELLGLPEG PGLAEALLEG EDPADLEVRP DAVPRLRVLR YGSPGMDAPV QGTAMPELVR
LLRAGAEYVV VAVAPVSERA DAHALAGSAD LMLPVVELDR TRRAELGELL VLADRFGVPV
PGTAVLPRQP LAGPAPVASP SAAEPVAGEQ TTGPDRTPGR DGAAGADEAA GAAKGRAGAG
ITLTGIVSEL PATAGTAGTR GRGGAARPPA PRDAEGVPGV PGPRRAEAAE EPEAAESGRS
TADGAAGTPS RARDGDAESA VPGGGERAGD ERAGGDDTAQ ERSGVSEAET AERIAEAAGA
DDGTAQDAAA PRDGGIPAGL EVPRVLGGDG ATQIPVTPEA AGTGRTEEDA AAPGTASEAE
MAFGLARTAE AREPHDPEAT VSGAEATEAL VAFAAEGAEG AEESRPTGAA GDSDSPDSAP
DTAPDTAPGT RN