Gene Ndas_5075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5075 
Symbol 
ID9248964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp219870 
End bp220922 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003682962 
Protein GI297563989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG GGAACACGGG TCGCAAGCGC CCCACGATCA GCGACGTCGC CCGCCGGGCG 
GGTGTGAGCA AGGGCGCCGT GTCCCGGGCG TTGAACACCG GTACCGGCAC CAGTGCCAGC
ACCCGTGAAC GCATCCGCCA GGCGGCGGTC GAACTGGGCT GGAACCCGAG TTACGCCGCA
CGCGCGCTCA ACGGCAAGTC GCTGGGCACC ATCGGACTGA TCGTGCGCCG CTCCCCCGAG
ATCCTCGACT TCGACGCGTT CTTCCCCTCG TTCCTGTCCG GTATCGAGTC GGTGCTCTCC
GGCGAGGAGC ACGCCACCGT CATCCGCTTC GTACCCGACG AGCGGACCGA GGCCGCCACC
TACGAGCGGC TGTTCAACGA CCACTTCGTC GACGGCTTCC TGGTCACCGA CCTGCGGACG
GACGACGGCC GTCCCGCGAT GCTGCGCCGC CTGGGCGCGC CCGCCGTGGT GGTGGGCGCG
CCCGAGGGCT CCTCCGACTT CCCCACCGTC ACCAACGACT CCAGGGAGGC GATCCGCGGT
CTGGTGCGCT GCTTCGCCGA GGCGGGGCAC CGGCGCATCG CCCACGTCCA GGGCGACCCC
CACATGCTGC ACGCGCACCA GCGCCGGCGC CACTGGGAGG AGGCCGTGCG CGAGTTCGGC
CTGGAGCCGG GGCCCGTGGA GGAGCACGGC GGCTACACCA TCGAGGGCGG GGCCCGGGCC
ACGGAGCGCA TCCTCGCCAG GCCCGCCGCC GAACGCCCCA CCGCCGTCTT CTACGGCAGC
GACCTCATGG CCATAGGCGG TTACTCCGTG CTGGGGGAGG CGGGTCTGAC CGTCCCCGAC
GACATGGCGG TGGCCGGGTT CGACGACATC CCCCTGGCCT CGTTCGTCAC ACCGCCGCTG
ACGACCGTCC GCAACAGGCA CCGCGCGCTG GGCAGTGTCG GGGCCCGCAT CCTGCTGGAC
ATGCTCAAGG GGCAGGAGCC GCCGCTGTCG ACCGTGCTCG TCGGTGAGCT CCGCCCGAGG
AAGTCGTCCG GACAGCCGAT CCAAGGCTTG TAA
 
Protein sequence
MATGNTGRKR PTISDVARRA GVSKGAVSRA LNTGTGTSAS TRERIRQAAV ELGWNPSYAA 
RALNGKSLGT IGLIVRRSPE ILDFDAFFPS FLSGIESVLS GEEHATVIRF VPDERTEAAT
YERLFNDHFV DGFLVTDLRT DDGRPAMLRR LGAPAVVVGA PEGSSDFPTV TNDSREAIRG
LVRCFAEAGH RRIAHVQGDP HMLHAHQRRR HWEEAVREFG LEPGPVEEHG GYTIEGGARA
TERILARPAA ERPTAVFYGS DLMAIGGYSV LGEAGLTVPD DMAVAGFDDI PLASFVTPPL
TTVRNRHRAL GSVGARILLD MLKGQEPPLS TVLVGELRPR KSSGQPIQGL