Gene Ndas_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1087 
Symbol 
ID9244933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1336321 
End bp1337373 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003679035 
Protein GI297560061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.978331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCACGA GGCCGCGTAT CAAGGACGTC GCGCGTCAGG CCGGGGTCTC GGAGAAAACC 
GTTTCCAACG TTATTAACGA CCATCCGCAC GTCCGGCCCG CCACGCGCGC CGCCGTGGAG
GCGGCCATCG ACGCGCTGGG CTACCGGGTC AACCTGGCCG GACGCCACCT GCGCCGCGGC
CGGACCGGTG TGATCGCGCT GGTCGTGCCC GAACTCGACC TGGGCTACTT CGCCGAACTC
GCCGACCTGG TCATCCGCGA GGCCGAGCGC CTCTCGCGCA CCGTCCTGGT CCACCAGAGC
GAGGCCCGCC GCGAGCGCGA GGAGTCCGCG CTGGAGGGGT TCGGCGCCGA CTTCGTGGAC
GGGGTCATCC TCAGCCCGCT CGCCATGGAC GACGCCGCCC TGCGCACGCA CCCCTCCCGG
CTGCCGGTGG TGCTCCTGGG CGAGCTGCCC CGCACGGTCC GGCACGGCCA CGTCGCCATC
GACAACGTGG CCGCCGCCCG GGAGGCGACC GAGCACCTGC TCGACGGCGG GCGCACCCGG
ATCGCGGTGG TGGGCGGTCG GCCGCCGGGC CCCTCCGGCA CGGCCGAGCT GCGCACCCGC
GGTTACCGCG AGGCCCTGGA GGCGCGGGGG AGGAGCTACG ACCCCGAGCT GGTGCGACCG
GCCGGGCACT TCCACTGGAG GGACGGGGCG GAGCTGGCCG CCGAACTCGT CGCCGGACCC
AGGCCGCCCG ACGCGCTGCT GTGCATGAAC GACCTGCTGG CGCTGGGGGC GATGCGCGCG
CTGCACGACG CAGGGGTGCG GGTCCCCCGG GACGTGGCGG TGGTGGGGTT CGACGACATC
GCCCCCGGGC GCTACTCCGT GCCGAGCCTG ACCACCGTCG CCCCGGACAA GCCGGGCCTG
GCCAGGGAGG CGGTGCGGCT GCTGCTGGAG GAGGTGGAGG CCCGTCGCGG CGCTCCGGAC
GCGGAGTCCG GGGCCGGGGC CGACCGGGCC TCGGCCAAGG TCGTCGTCGG CCACACCCTC
CTGGTGAGGG AGAGCAGCGC CAGCGTGCTC TGA
 
Protein sequence
MGTRPRIKDV ARQAGVSEKT VSNVINDHPH VRPATRAAVE AAIDALGYRV NLAGRHLRRG 
RTGVIALVVP ELDLGYFAEL ADLVIREAER LSRTVLVHQS EARREREESA LEGFGADFVD
GVILSPLAMD DAALRTHPSR LPVVLLGELP RTVRHGHVAI DNVAAAREAT EHLLDGGRTR
IAVVGGRPPG PSGTAELRTR GYREALEARG RSYDPELVRP AGHFHWRDGA ELAAELVAGP
RPPDALLCMN DLLALGAMRA LHDAGVRVPR DVAVVGFDDI APGRYSVPSL TTVAPDKPGL
AREAVRLLLE EVEARRGAPD AESGAGADRA SAKVVVGHTL LVRESSASVL