Gene Ndas_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1914 
Symbol 
ID9245764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2334172 
End bp2335182 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003679847 
Protein GI297560873 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.04666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.595127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAC CGACCAAGCG CGCGACCCTG CGGGACGTCG CGGCGGGGGC GGGCGTCTCG 
GTCGCCCAGG CCAGCTTCGC CCTCAACGGC ACCGGCCGCG TGGCCGCCGC GACCGCCGAG
CGGGTCCGCC GGATCGCCGC GGAACTCGAC TACCAGGCCG ACGGCCGGGC CCGGGCGCTG
CGCACTGGTC GGCGCTCCGC CTACGGCGTG GTGATCCGCA ACATGCGCAA CCCCTTCTTC
CTGGACGTGC TGCGCGGCAT GGAGACCGTC GCGCACCGCG AGGGGGCGCT CCTGCTGATC
ATGAGCTCGG ACTACGACCA GGAGCGGGAG AGCGCCGCGC TGCGCAGGCT CGCCGCCGAG
GCGGTCGCCG GGATCGCCAT CGCGCCCATC GGGCGGCGCG ACCGCCTCCT GGAGTGGATG
GACCGCCACT CCCACGTTCC GGTGGTGGCC TTCAACTGCA CACCCGAACC CGACCGGGAG
GGCACCGCCA GCCGCCTGTC CACGGTCGGC CCCGACGACG AGGAGGCCGT CGCCCGGGCC
GTGGCGCACC TGGCGCAGCG GGGCCACCGC GAGGCCACGC TGCTGATGGC CCCCGAGCAC
CTGGCCGCCG ACTGGGGGCG CGAGGAGGCC TTCCAACGCC ACTGCGCCGA GCACGGGGTG
GCCGGTTCGG TGGCACGCGG ACCCCTGGAC TACGAGGCGG TGGCCCGCAG GTCCGCGGAG
ATGATGGCGC GCCCCGGGCA CCGCGCCCTG GTCGTCAACT CCGACCACCT GAGCGCCGCC
GTCTACGACG CCGCCCGCTC CCTGGGCCTG CGCGTGGGGC GCGACGTCAG CGTGGTGGGC
CACGACGACC TGCCCACCTC GGCCCTGCTG GACCCCGGCC TGACCACGAT CGCCGTGGAG
CGCGAGGTAC TGGGGGAGCG GATCATGAAC CTGCTCGTGG AGGGCCCAGG CGCCGCCGTG
CGGCTGCCCG TGCGCCTGGT GGAACGGGGG TCGGTGGCGG TCCTGGAGTG A
 
Protein sequence
MPEPTKRATL RDVAAGAGVS VAQASFALNG TGRVAAATAE RVRRIAAELD YQADGRARAL 
RTGRRSAYGV VIRNMRNPFF LDVLRGMETV AHREGALLLI MSSDYDQERE SAALRRLAAE
AVAGIAIAPI GRRDRLLEWM DRHSHVPVVA FNCTPEPDRE GTASRLSTVG PDDEEAVARA
VAHLAQRGHR EATLLMAPEH LAADWGREEA FQRHCAEHGV AGSVARGPLD YEAVARRSAE
MMARPGHRAL VVNSDHLSAA VYDAARSLGL RVGRDVSVVG HDDLPTSALL DPGLTTIAVE
REVLGERIMN LLVEGPGAAV RLPVRLVERG SVAVLE