Gene Ndas_5530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5530 
Symbol 
ID9249433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp721769 
End bp723244 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content72% 
IMG OID 
ProductNLP/P60 protein 
Protein accessionYP_003683415 
Protein GI297564442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.479781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG TCCCCTCCGT CCGCGACGCC GCGGCGCGCC TGCGGGCGCG GGTACGCGGC 
CTCCTCCTGC CCCGGCGCGG GCGCGACGGC GACCGGGGAG CCGCGAGCGG GACCGCGCGC
GTGGGCGGGG GCGTCACGGC GCTCGCGGTC ATGGGCGGCA TCGCCTACGC CTCCACCATG
TGCGGCCCGG TCGGCGGCGC CACGGCGAGC GGCTTCGGCG GACCGGAATC GTCGGCGGGT
GTGATCCAGG CGTCCGCGTC GTCGTCGCGC GAGACGATCA CCATCAGCTC CCACGGCGAA
CAGGGCTCCT CCACCAAGAC CGTCCGGGTG GACGTACCCG AGAACGTCCT GCGGGCGCAC
CGCACCGCAG CCGAGGCCTA CGGCCTGCCC TGGGAGCTCC TCGCCGCGGT CGGCGCGATC
GAGACCCAGA ACGGCGCCTA CGTCTCCACG GACCCGAACT GGCACTCCGG CCTGAACGAG
GGCGAGCGCA ACCCCTACGG CGCGGCCGGG ATCGTCCAGT TCGGTGTGCA GGACCCGCGG
ACGGGGCAGG TCGGCGGCCG GCTCGGCAGT GCGGGCAACG CGTGGGGAGG CAAGCCGAAG
GAACCGGTCG CGGACCGCCG CTGGCACTAC GACGTCGGCG AGATGCCCGC CAACCCGCGT
TACTTCGGCA TCGACGGCAA CGGCGACGGG ATGGTCAACG TCTGGGACCC GTGGGACAAC
ATCACCTCGG GTGCCTTCCG GCTCGCCTAC TACGCCCAGC AGGCCCGGGA GAACGGGGTC
GCCTCCGTCT GCGGGAGCGG ACGGGGGGAC CTGGACCCCA TCGAGTGCAC GGTCTACAGG
CACAACCCCG CCAGGTGGTA CGTCGCGCAA GTCCTGGAGG TGGCCGAGTA CTACCGGGGT
TCCGGCATCG CCCCGACCTC GCCCTCCCTC AACATCCAGA CCGCGTCCTT CGGCGGCGGC
CGGGACTGCG AGGAGCAGGG GGGCGGCGCC TCCCGGGCCG CGTTCCTGGG CGACGTCAGC
GACAGGCACC GGAGCGCGGT GCAGTTCGCC AGGGACCAGA TAGGCAAACC CTACGTGTGG
GCCGCCACGG GTCCCGACGC CTATGACTGC TCGGGCCTGG TCATGGCAGC CTGGAACTCC
GCGGGCGTCG CGATCCCCCG GACCACCTTC GCCCAGTGGC AGGACGGAGA GCCCGCCGCC
ACCTGGGACG GCGCGCAGGT CGGAGTGGTG GCCGACGGCG TCCTCGACGT CGGCGAGCTG
CAGGCGGGCG ATCTGCTCTA CTTCCACTAC TCCGGGCAGT CCCCGTCCCA CATGGGGATG
TACACCGGCG GCGGCATGAT GATCCACGCA CCGGCGCCGG GCAAGACCAT CACCGAGGTG
TCGATCGAGA CCGAGCACTA CCGGACGGTC TTCGTGGGGG CGATCCGGAT CGATCCGGGA
TCGTCCGGCG ACGAGCCGGA GAGCATGATG GCGTGA
 
Protein sequence
MSRVPSVRDA AARLRARVRG LLLPRRGRDG DRGAASGTAR VGGGVTALAV MGGIAYASTM 
CGPVGGATAS GFGGPESSAG VIQASASSSR ETITISSHGE QGSSTKTVRV DVPENVLRAH
RTAAEAYGLP WELLAAVGAI ETQNGAYVST DPNWHSGLNE GERNPYGAAG IVQFGVQDPR
TGQVGGRLGS AGNAWGGKPK EPVADRRWHY DVGEMPANPR YFGIDGNGDG MVNVWDPWDN
ITSGAFRLAY YAQQARENGV ASVCGSGRGD LDPIECTVYR HNPARWYVAQ VLEVAEYYRG
SGIAPTSPSL NIQTASFGGG RDCEEQGGGA SRAAFLGDVS DRHRSAVQFA RDQIGKPYVW
AATGPDAYDC SGLVMAAWNS AGVAIPRTTF AQWQDGEPAA TWDGAQVGVV ADGVLDVGEL
QAGDLLYFHY SGQSPSHMGM YTGGGMMIHA PAPGKTITEV SIETEHYRTV FVGAIRIDPG
SSGDEPESMM A