Gene Ndas_4931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4931 
Symbol 
ID9248818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp66959 
End bp67981 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content77% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003682820 
Protein GI297563847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.699714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACCA TCAAGGACGT CGCCGCGGCG GCGGGCGTGG CCCCGAGCAC GGTGTCCTAC 
GTGCTCAGCG GCTCCCGCCG CGTCTCCGAA CGGACCCGGT CCGCGGTGCG GACGGCCATC
GACGAACTGG GCTACCACCC CAACGCGGGT GCGCGCTCCC TGCGCAGCGC CCGCACCCGC
GTGATCGCGC TGGCCCTGCC GCTGGCCTCG CCCGGCTACC TGCCGGTCGG CGGGCGGTTC
ATGTACGGGC TGAGCCGCGC CGCCGGGGAG CTGGGCTACG ACCTGCTGCT CCTGACCGTG
CGCGACGACG ACGCCGGGGC GTACGGGCTG GAGCGCGCCG CGCGCAGCAG GCTCGCCGAC
GTCGCGGTGA TCATGGGCGT GGAGATGGAG GACCCCCGCA TCGGCGCGAT GGACGCGCTG
GGGTTCCCCG TGGTGGTCCT CGGCCGCCCC TCCGACGAGG ACGCCGCGCC CTGGGCCGAC
CTCGACTGGG AGGAGGCCGC GGTCGCCTCG CTGCGCCTGC TGCACGGGAC CGGGCACCGC
GACCTGTGCT TCGTGTCGAC TGTGGAGGAG GACATCGCCT CGGGCCGCAG CTACTCCGTG
CGCGGTCTGC GCGGGGCCGA ACGCGCCGCC GCCGAACTCG GCGTCCCCGT GCGCGTCCTG
CCCTCGGCGA AGGACCCCGC CGAACTGTAC CGGCGGCTGG ACGCGCTCCT CGACGGGGAC
CGCCCGCCCA CCGCGCTCGC CCTCCAGCAC CCCGCCGCCG TGCCCGGGGT CCTGCGCCAC
CTCGCAGCGC GCGGCACCGA CGTGCCCGGG GACGTCTCCC TGGTGGCCAT CGGCAGCTTC
CCCGAGGACC TCGCGGGACT GGACGTGACC CGGGTCGAGC TGCCCGTCGA GCGGATGTCG
GCCGCCGTGA CCCGGCTGGC CGCCGAGGCC GCGCGCGGCA GCCCGCCGCT CCCGGGAGGA
CGGCGCGAAC TCATCCCGCC CGAGATCACC CCCGGGGGGA CGGTAGCCGC TCCCCCGCCC
TGA
 
Protein sequence
MVTIKDVAAA AGVAPSTVSY VLSGSRRVSE RTRSAVRTAI DELGYHPNAG ARSLRSARTR 
VIALALPLAS PGYLPVGGRF MYGLSRAAGE LGYDLLLLTV RDDDAGAYGL ERAARSRLAD
VAVIMGVEME DPRIGAMDAL GFPVVVLGRP SDEDAAPWAD LDWEEAAVAS LRLLHGTGHR
DLCFVSTVEE DIASGRSYSV RGLRGAERAA AELGVPVRVL PSAKDPAELY RRLDALLDGD
RPPTALALQH PAAVPGVLRH LAARGTDVPG DVSLVAIGSF PEDLAGLDVT RVELPVERMS
AAVTRLAAEA ARGSPPLPGG RRELIPPEIT PGGTVAAPPP