Gene Ndas_0809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0809 
Symbol 
ID9244654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp998603 
End bp999637 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003678759 
Protein GI297559785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.119427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAAGA CCGGTGCTGG TCGGAAGCGT CCGACCCTGG AGATGGTGGC GCAGCGCGCC 
GGGGTCGGAC GAGGGACCGT CTCGCGGGTG ATCAACGGGT CCGCGCAGGT GAGCCCCCGC
ACCCGGGAGG CCGTGCACGC CGCGATCGCC GAACTCGGCT ACAGCCCGAA CCAGGCCGCG
CGCACCCTGG TCACCAGGCG CACCGACACG ATCGCGCTGG TCGTCTCCGA GCCCCGGGAC
CGGCTGTTCT CCGACCCCTT CTTCGCCGAC ATCATCCGCG GAGTGAGCTC GGTCCTGCAC
GAGCGCGACC TGCAGCTCAT GCTCACCACG GCCCGGACCG AGGCCGAGCA CAAGCGCGTG
GGCGACTACC TCAGCGGCTT CCACGTGGAC GGCGCGCTGC TGATCTCCCT GCACAGCGAC
AATCCGCTCT CGGCCCGTCT GGACGAGGCC GGGGTGCCGG TCGTCCACGG CGGTCGCCCG
CACTCGCCCG AACAGCCCGC GCCCTACTGC GTCGACATCG ACAACATCGG CGGGGCCCGG
ATGGCCATCC GCCACCTCCT GGAGCGCGGA TGCCGACGGG TGGCCGCCAT CACCGGCCCC
CTGGACATGA ACGCCGGTGT GGAGCGCCTG CGCGGCTACC GCGAGGTCAT GGCCGCCGCC
GGACTGGAGG TGGACGACAG GCTCGTCGTG CAGGGCGACT TCAGCGTGGA GGGGGGAGCC
GAGGCGATGG AGCGGCTCCT GGGCACCGGG CTGGAGCCCG ACGCGGTGTT CGCGGCCTCC
GACATGATGG CGCTCGGCGG CCTGCGGGTG CTGCGCGCAC GCGGCCTGAG AGTTCCGGAG
GACGTGGCCC TGGTGGGTTA CGACGACACC GTCATGGCCC AGCACAGCGA CCCGCCGCTG
ACCACCATCC ACCAGCCCAC GGTGCAGATG GGGCAGGAGA TGGCGCGGCT GCTGGTGGAC
GTGGCGATCC CCCGCACGAC GGAGGCCGAG ACCGTCATGC TCGGCACCCA CCTGGTCGTG
CGCGAGTCCG GCTGA
 
Protein sequence
MAKTGAGRKR PTLEMVAQRA GVGRGTVSRV INGSAQVSPR TREAVHAAIA ELGYSPNQAA 
RTLVTRRTDT IALVVSEPRD RLFSDPFFAD IIRGVSSVLH ERDLQLMLTT ARTEAEHKRV
GDYLSGFHVD GALLISLHSD NPLSARLDEA GVPVVHGGRP HSPEQPAPYC VDIDNIGGAR
MAIRHLLERG CRRVAAITGP LDMNAGVERL RGYREVMAAA GLEVDDRLVV QGDFSVEGGA
EAMERLLGTG LEPDAVFAAS DMMALGGLRV LRARGLRVPE DVALVGYDDT VMAQHSDPPL
TTIHQPTVQM GQEMARLLVD VAIPRTTEAE TVMLGTHLVV RESG