Gene Ndas_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1696 
Symbol 
ID9245546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2067747 
End bp2068787 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSMP-30/Gluconolaconase/LRE domain protein 
Protein accessionYP_003679631 
Protein GI297560657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.263478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC ACTGCCCCAG GCCGCGACGA ACGGTGGCCG CCCGCGCACT CGCCGCACTG 
GCCCTGACCG GCGCGGCGGC CGGATGCGCC GCCCCCGCCG CCACCGGCGA GGACGGCCAG
GACGGCGGAA CCGAGCGCAC CGCCGAACTC CTGGTCCAGG TGACCTCCGT CCACGAGGAG
ACGGGGATGA CCCTGCTCGA AGGCCCGACC TTCGACGCCG ACGGACGCCT GCTCGTCGTG
GACGTCACCG CCCCCGCCGG GGAGCCCAAG GTGCTCCGGG TGGACACCGG AACCCGGGAG
GTGACCCCGG TGTTCACCGA CGAGACCGGC GCCTACACCT CCGCGCAGTT CAGTCCGCAC
GACGACCGGC TCTACCTGAC CGACTTCGCC GGTGGGAAGA TCGACAGCAT CACCCCCGAG
GGCGAGGACC ACACCACGTT CTTCTCCGGG GAGGTCGACG GAGCGCCGAT GAACCCCGAC
GACCTGGCCT TCGACGAGGC CGGGAACATG TACGTCAGCG ACTCGGCCGG GTTCGACGGT
CCGGCGTGGG AGGCCCGGGG CAGGGTCGTG CGCGTCGACC GCGACACCGC GGAGGCGACC
GTCCTGGCCG AGGAGCTGCC CGCGCCCAAC GGCATCTCGT TCACCGCGGA CTTCTCGGGG
CTGTGGGTCG GCCAGTACGG CGCCAACCGC GTCGACCACT ACGCGCTGAA CGAGGACGGC
ACCGAGGTGG AGACCTCCCA CGCCGCCCTG TACTTCGACG GAGGCACGAG CCGGATCGAC
TCCATCGCGG TGGACGCCGA CGGCAACCTC TACCAGGCCG TCCACGGCCA GCCGCGCATC
TTCGTGTACA GCCCGCTCGG TGAGCACCTG GCGACGGTCG GCGTCCCGGC CGACGCCGCC
GAGGGGCTGT ACTCGGCCAC CAACGTGGCC ATCGCACCGG GGACGACCGA CGCCTACATG
ACCGTCAGCG GGGACGACGG CGGGTTCGTC TACTCCTTCG ACGCGCTCGC CGAGGGGATC
CGCCAGTCCA ACGGCGGCTG A
 
Protein sequence
MEQHCPRPRR TVAARALAAL ALTGAAAGCA APAATGEDGQ DGGTERTAEL LVQVTSVHEE 
TGMTLLEGPT FDADGRLLVV DVTAPAGEPK VLRVDTGTRE VTPVFTDETG AYTSAQFSPH
DDRLYLTDFA GGKIDSITPE GEDHTTFFSG EVDGAPMNPD DLAFDEAGNM YVSDSAGFDG
PAWEARGRVV RVDRDTAEAT VLAEELPAPN GISFTADFSG LWVGQYGANR VDHYALNEDG
TEVETSHAAL YFDGGTSRID SIAVDADGNL YQAVHGQPRI FVYSPLGEHL ATVGVPADAA
EGLYSATNVA IAPGTTDAYM TVSGDDGGFV YSFDALAEGI RQSNGG