Gene Ndas_5533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5533 
Symbol 
ID9249436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp724875 
End bp726002 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content76% 
IMG OID 
Productprotein of unknown function DUF58 
Protein accessionYP_003683418 
Protein GI297564445 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0610248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACGA CACGGGGATG GCTGGTCGCC GGCTCGGGCG TGCTCCTGCT CGCGGTCGGC 
GTCCTGTCCC AGTACCAGGA GATCGCGCTG CTCGGCGGTG TCGCCGTGAC CGTGGTCGCC
GTGGCGGTGC TGCTGGTGGG GCGCCCCGCC GGGGTGCGCG TCCAGCGCTC GGCCTCCACC
ACCCGGACCT CTCCCGGCAC CACCGTGCGC GTGCGGGTCG AGGCGAGCAA CACCGGACGG
CGCTCCGTGC AGGTCAGTGA GCGCGTCCTC GGCTCCGACG GCGAGCGCGC GGTGCCGCTG
CGCCCCCTGG CCGCACGCGC CACCGGCGGC TCCGACTACC GGATCGGGGC CCTGCGCCGC
GGGGTGGTCG AGCTGGGTCC CCTGCGGGCC GGGCGCTCCG ACCCGTTGGG ACTGGCCTCG
CTGCACCGCG ACCACGGCGG TACCGAACGG GTCTGGGTGC ACCCCCGCTG GGAGCACCTG
CGCGCCGTAC CGATCGGCCG GGTGGCCGAC CCCGACGGCG CGGCGGACGG CGCGCCCGCG
GGCACCCTGA CCTTCCACGC CCTGCGCGAC TACGTGCCCG GCGACGACCT GCGCCACGTC
CACTGGCGCA GCTCCGCGCG GCTGGACAGG CTCGTCGTGC GCGAGTACAT CGACACCTCC
CAGACCCGGA TCTGCGTCAT CGTCGACGAC CGCCCCACAC CCGGCGGCGA GGCCCGCCTG
GACGAGGTGG CCGGCGCGGC GGCCTCCATC GCGGCCACCG CCGTCCGCTC GTCCCTGCAC
TGCGAACTGC GCCTGGCCAG CGGCAGGGGC AGGGAGAGCA CGGGCGGCCT GCCCCCGCTG
CTCGACCTGC TCTCCGAGGC CCGGAGCACT CCGGGGGCAG ACCTGCACCG CGCCCTGCTC
CTGGCCCGCA CCCGTCCCGC CGGTGACACC GCGGTCCTGG TCAGCGGCGC GCTCACCGCC
GAGGACCTGC GGTCGTTCGG GCGGCTCGGC GACCGCTACG CGGGCCTGAT CGCCGTCGTC
GTCGGATCGG AGGAGCACCC GACCGCGCCC CCGGACGTCA CCCTGCTCAC CGCCGGTGAC
ACCGCCGGGT TCGCCGACCG ATGGAACGAG GCGCCGTGGT CACGCTGA
 
Protein sequence
MPTTRGWLVA GSGVLLLAVG VLSQYQEIAL LGGVAVTVVA VAVLLVGRPA GVRVQRSAST 
TRTSPGTTVR VRVEASNTGR RSVQVSERVL GSDGERAVPL RPLAARATGG SDYRIGALRR
GVVELGPLRA GRSDPLGLAS LHRDHGGTER VWVHPRWEHL RAVPIGRVAD PDGAADGAPA
GTLTFHALRD YVPGDDLRHV HWRSSARLDR LVVREYIDTS QTRICVIVDD RPTPGGEARL
DEVAGAAASI AATAVRSSLH CELRLASGRG RESTGGLPPL LDLLSEARST PGADLHRALL
LARTRPAGDT AVLVSGALTA EDLRSFGRLG DRYAGLIAVV VGSEEHPTAP PDVTLLTAGD
TAGFADRWNE APWSR