Gene Ndas_4294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4294 
Symbol 
ID9248168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5110296 
End bp5111327 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content73% 
IMG OID 
Producttranscriptional regulator, GntR family 
Protein accessionYP_003682189 
Protein GI297563215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.806766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGTCCA CGACCGGAAA AAAGGAACTG CCGCCCTACG CGCGGGTCGT CACCGACATC 
CGCGCGCGCA TCGGCTCCGG GGAGTTGCGA CCGGGCGAAC GGGTGCCCTC CACCCGGGAG
ATCATGCGCG AGTGGGGGGT GGCCATGGCC ACCGCCACCA AGGCGCTGGC CGCCCTGCGC
CAGGAGGGCC TGGTCGAGGC GGTGCGCGGA GTGGGCACCC TCGTGCGCGG TGCACCCGCC
TCCGCACCGG AGCCGCAGGG ACCACAGCGC CAGCGGGAAC GCCCGCGTCC GGCGCGGACG
GAAGACCCGG GAAGCCACCG TCCGGCCGCC GAGACCGGCG GCCTCGCCCG GGAGGCCATC
GTCCGGGCCG CGATCACCAT CGCCGACGCC GAGGGGATCG ACGGCCTGTC CATGCGCAGG
GTCGCCACCC AGCTGGGGGT GAGCACCATG GCCCTGTACC GCCACGTCGC GAACAAGGAC
GCGCTGGTGA CGGCGATGAT CGACCAGGTC TACACCGAGC ACGCCCTGCC CGACCCGCCG
CCCGCCGACT GGCGCGAGGC GCTCGAACTG GCCCTGCTGA CGGAGTGGGG CATCTACCGG
GCGCACCCCT GGGCCGTCCA GCTCACTCCG CTCGCCGGAG CGGTTCAGTC GCCCGGGCTG
GTGCAGAACG CCGAGTGGAT GATGCGGGTG ATCACCGGCC AGGGGCGCTC GCCGGACGAG
GCCATGGCGA TCCTCACCTT CGTGTCCGCC TACACCTCCG GCATGGCCCT CCAGGGCACG
CGCGCGGTGG TGGAGGGGTA CGAGGCCGGG ATGGACGCCG AGCACTGGTG GAGGTCCCGG
GGCGAGGAGT TCCTGCGGAT CGCCGAGCAG GGCAGGTTCC CCCTGACGTT CAGCGTCTCG
GGGCCGACCG ACGTGCACGC GATCTTCGGC CTCGGCATGA AACACCTGCT GGACGGGCTC
GCGCCGCTGA TCGAACCGGG AGGCCGACCC GTGGACGGGG GCCTCGCGGG CCCCACGGAC
ACAACCCGGT GA
 
Protein sequence
MVSTTGKKEL PPYARVVTDI RARIGSGELR PGERVPSTRE IMREWGVAMA TATKALAALR 
QEGLVEAVRG VGTLVRGAPA SAPEPQGPQR QRERPRPART EDPGSHRPAA ETGGLAREAI
VRAAITIADA EGIDGLSMRR VATQLGVSTM ALYRHVANKD ALVTAMIDQV YTEHALPDPP
PADWREALEL ALLTEWGIYR AHPWAVQLTP LAGAVQSPGL VQNAEWMMRV ITGQGRSPDE
AMAILTFVSA YTSGMALQGT RAVVEGYEAG MDAEHWWRSR GEEFLRIAEQ GRFPLTFSVS
GPTDVHAIFG LGMKHLLDGL APLIEPGGRP VDGGLAGPTD TTR