Gene Ndas_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0389 
Symbol 
ID9244227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp479333 
End bp480424 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678343 
Protein GI297559369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCA CCCCCGCGCC CGCGCCCACC GCCGGACCGG CCCCTCACGA CGCCCTGAGC 
CTGCCCGGCG TGCTCGACGC CGGGCAGCTG CACCGCTCGG CCTCCTACCT CGCGCGCTGT
CAGGAGGACG GCGGGGCCAT CCCGTGGTTC CCCGGCGGGC ACACCGACGT GTGGGACCAC
GTGGAGTGCG CCATGGCCCT GACCGTGACC GGCCGCAGCG CGCCCGGACA CGCCGAGGCG
GCCCGACGCG CCTACCTGTG GCTGGCCGAC AGCCGTGCGC CCGGGGGCGG GTGGCCCGCC
AAGTTCCGCC AGGGCGTCCC GGTGACGCGG CTGCGCGAGG CCAACCACGC CGCCTACCCC
GCCGTGGGCC TGTTCCACCA CCTGCTCGTC ACCGGCGACA CCGCGTTCGC CGAGCGGATG
TGGCCGGTGG TCGAGGAGGG ACTGGAGTTC GTCCTGGCCC TGCGCGGCGA GCACGGCGAG
ATCCTGTGGG CCCGCTCGGA GAACGGCGCC CCCGGGGACC ACGCCCTGCT GACGGTGTGC
GCGAGCGTGC ACCACGCGCT GCGCTGCGGC GCCGCCCTGG CCGCGCGCCT GGGCCGCTCC
CGCCCCGCGT GGACGGCCGC CGCCGACCGC CTCGCGGTGC TCATCAACGG GCACGAGGAC
CTGTTCGCCG ACCGCGGGCG CTTCTCGATG GACTGGTTCT ACCCCGTCCT GGGCGGCGCC
GTGCGCGGCG CCGCCGCCAA GGAGCGCATC GCCGAGCGCT GGGACCGCTT CGTGGTGCCC
GGGCTGGGCG TGCGCTGCGT GAGCGACCAG CCGTGGGTGA CCGCGGCGGA GACCTCGGAG
CTGGTGCTGG CCCTGGCCGC CGTCGGCGAC GTGGACGCGG GCGTGCGCCT CCTGCGGGAC
GTGCAGCACC TGCGCGACGC CGACGACGGC GCGTACTGGA CGGGCTACCA GTTCGCCGAG
CAGGTGCGCT GGCCGGTGGA GCGCAGCACG TGGACCTCGG CCGCCGTGAT CCTGGCGGTG
GACGCGCTCA CCGGGACCAC ACCGGGCTCG CGGGTCTTCC TGCACACCTG GGACGGGGAC
CCCGCCGACT AG
 
Protein sequence
MSVTPAPAPT AGPAPHDALS LPGVLDAGQL HRSASYLARC QEDGGAIPWF PGGHTDVWDH 
VECAMALTVT GRSAPGHAEA ARRAYLWLAD SRAPGGGWPA KFRQGVPVTR LREANHAAYP
AVGLFHHLLV TGDTAFAERM WPVVEEGLEF VLALRGEHGE ILWARSENGA PGDHALLTVC
ASVHHALRCG AALAARLGRS RPAWTAAADR LAVLINGHED LFADRGRFSM DWFYPVLGGA
VRGAAAKERI AERWDRFVVP GLGVRCVSDQ PWVTAAETSE LVLALAAVGD VDAGVRLLRD
VQHLRDADDG AYWTGYQFAE QVRWPVERST WTSAAVILAV DALTGTTPGS RVFLHTWDGD
PAD