Gene Ndas_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2029 
Symbol 
ID9245879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2449082 
End bp2450113 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content74% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003679961 
Protein GI297560987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.259814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTCCG GTTACGTGAC GCTGGAACAG GTCGCCGAGC ACGCCGGGGT GTCCCTGGCC 
ACGGCGTCCA GGGTGATCAA CGGCAGCACA CGCCAGGTCA GTCAGCGCCT GCGCGACAAG
GTGACCGCCA GCGCGCGCGA ACTCGGCTAC CTGGCCAACG CCTCGGCCCA GACCCTGGCC
CGCAACAGCA GCGCCCTGGT CGGCCTGCTC GTCCACGACA TCTCCGACCC CTACTTCTCC
TCGATCGCCG CCGGGGTCAC CCGCCACACC GAGGAACAGG GACTGGTCCT GGTGCTGGGC
ACCACCAACC GCTCCGCGCA GAAGGAGGGC CGCATCCTGG CCACGCTGCG CGCGCACCGG
GCCCGCGCGG TGGTGCTGGT CGGCTCGCGC TCCACCGACG AGGAGAGCAA CCGGCGCCTG
TCCGAGGAGA TCCGCATGTT CCGCCGCCAG GGCGGGCGGG TGGCGTGCGT GTCCCAGGAG
GGCCTGCCCG CGGACACGGT CACCCCCGAC AACCACACCG GGGCCTCCGA CCTGGCCCGC
CGCCTCATCG CCCAGGGGCA CCGGGAGTTC GCCATCCTGG CCGGGCCCAC CGACCTCCAG
ACCGCCCGCG AACGCCTGGA CGGGTTCCAC TCCGCGCTCT CGGGCGCGGG GCTGGAACTG
GCCCACCACA ACGTGGTGCA CGGCGCCTTC ACCCGCGACG GCGGCTACGA GTCCACCCGC
CGCCTGATGG CGGTGGGCAC CGACGCCACG TGTCTGTTCG CCGTCAACGA CGTCATGGCC
ACGGGGGCGA TGGCCGCGCT GCGCGACCTG GGCCTGCGGG TGCCCACCGA CCTGTCCGTG
GCCGGGTTCG ACGACATCCC CACCCTGCGC GACCTCACCC CCGCCCTGAC CACGGTGCGC
CTGCCGCTGG AGGAGATGGG CGAACGCGCC GCTGTGCTCG CCCTGGACGG CGATCCCAGC
GACCAGCCCC GCGTGGTCAC CGTGCGCGGC GAGGTCGTCG AGCGCGAGAG CACCGCGCCG
CCCACCCGCT GA
 
Protein sequence
MESGYVTLEQ VAEHAGVSLA TASRVINGST RQVSQRLRDK VTASARELGY LANASAQTLA 
RNSSALVGLL VHDISDPYFS SIAAGVTRHT EEQGLVLVLG TTNRSAQKEG RILATLRAHR
ARAVVLVGSR STDEESNRRL SEEIRMFRRQ GGRVACVSQE GLPADTVTPD NHTGASDLAR
RLIAQGHREF AILAGPTDLQ TARERLDGFH SALSGAGLEL AHHNVVHGAF TRDGGYESTR
RLMAVGTDAT CLFAVNDVMA TGAMAALRDL GLRVPTDLSV AGFDDIPTLR DLTPALTTVR
LPLEEMGERA AVLALDGDPS DQPRVVTVRG EVVERESTAP PTR