Gene Ndas_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0820 
Symbol 
ID9244665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1011303 
End bp1012517 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content76% 
IMG OID 
Productlipid A biosynthesis acyltransferase 
Protein accessionYP_003678770 
Protein GI297559796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.539919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0772166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAAC GCACGGCCGA CCTGGCCTAC ACGGCGGGGT GGGCGATGAT CCGCCGCACG 
CCCGAGAGCG CGGGGCGGGC GCTGTTCCGA CGCCTGGCCG ACCGCTCCTG GCGCGCCCAC
GACGAGAGCA CGCGCGGCCT GGAGCGCAAC CTGAGGCGCC TGGTCGGCCC CGGGGCCACG
GACGCCCAGC TGCGCGCGCT CTCCCGCGCG GGAATGCGCT CCTACATGCG CTACTACTAC
GAGATGTTCC GCCTCCCGGC GATGGGTGAG GAGTACGTCC TGGGCCGGAC CCGCGCCACC
GGGATCGAGG TCCTGGAGGA GCACGTCCGG TCGGGCCGCG GTGTGGTCGC CGCCCTGCCC
CACATGGGCA ACTGGGACCA CGCCGGGGCC TGGATCGCCC TGAGGGGCAC CCCCCTGACC
ACCGTCGCGC AGCGGCTGCG CCCCGAGAGC CTGTTCCAGC GCTTCACCGC CTACCGGGAG
TCCCTGGGCA TGGAGGTGCT GCCGCTGACC GGAGGCTCGA ACACCGTGGG CACCCTGGCC
CGGCGGCTGC GCGGGGGCGG ACTGGTGTGC CTGCTCGCCG ACCGCGACAT CAGCGGCACC
GGCCTGGAGG TGGACTTCTT CGGGGAGCGT GCGCGCGTGC CCGCCGGGCC CGCCGCGCTG
GCCCTCAACA CCGGCGCGGC CCTGATGCCG GTCTCGCTGT GGTACGACGG CCCGTACTGG
AACATCCGGG TCCACGACGA GATCCCCGTC TCCGGGGGAG CCACCCGCGC CGAGCGGGTC
CAGGCCACGA CCCAGGAGCT GGTCCGCGTC TTCGAGGGGG CGATCGCCGA GCACCCCGAG
GACTGGCACA TGCTCCAGCC GGTGTTCAGC GCCGACCACG CGCGTGTCTC GCGCGGCCGC
GGAGCCGACG GCGGCGTTCC GGCCCCGGTC GCCGCCGACC GCGCGCGCGT CCCGCGAGGG
GCGACCGCGG AGACCGCTGT GCCCGCGGTG ACGGACGAGA GCACCGCGTC GGGAGGGGCG
GGCGACGGCA CCGCGCCGGG TGGCGCTGTC CGGGAGGACC CGGTCGGCGG CACGGTACCG
GACGAGGGTG CGCCGGGAGG GGCGGGCGAC GCAACCGCAC CGGGCACTCG GGGCGGGTTC
ACCGCGGCGA ACGGGGTAGG GGCCCCTCAA GGCAGCGGGG CGAGGCCCCC CGGACGAGAC
GAACGGAACG GGTGA
 
Protein sequence
MDERTADLAY TAGWAMIRRT PESAGRALFR RLADRSWRAH DESTRGLERN LRRLVGPGAT 
DAQLRALSRA GMRSYMRYYY EMFRLPAMGE EYVLGRTRAT GIEVLEEHVR SGRGVVAALP
HMGNWDHAGA WIALRGTPLT TVAQRLRPES LFQRFTAYRE SLGMEVLPLT GGSNTVGTLA
RRLRGGGLVC LLADRDISGT GLEVDFFGER ARVPAGPAAL ALNTGAALMP VSLWYDGPYW
NIRVHDEIPV SGGATRAERV QATTQELVRV FEGAIAEHPE DWHMLQPVFS ADHARVSRGR
GADGGVPAPV AADRARVPRG ATAETAVPAV TDESTASGGA GDGTAPGGAV REDPVGGTVP
DEGAPGGAGD ATAPGTRGGF TAANGVGAPQ GSGARPPGRD ERNG