Gene Ndas_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2043 
Symbol 
ID9245893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2464253 
End bp2465245 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003679975 
Protein GI297561001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.109712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTTC CGCACCGTGT CGTCATCGGG GTCTTCCCCG ACGTCGACCT CCTTGACGTC 
ACCGGCCCGG CCGAGGTCTT CGCGCTGGCC AACCAGGAGG CCCCGGGCCG TGCCGACTAC
CGGGTCCTCC TCGCCGGGCC GACCCGCGGC GAGGTGCGGA CGTCGGCGGG CGTGCGGCTG
CTCACCGACG TGTCCTTCGA CGACGTCGGC GGACAGGTGG ACACGCTGCT GGTACCCGGC
GCGGTCGACA TGGGCGACGA CGGCCCCGTG GCCCGGATCG ACTCCGACGT CGTGGCGTGG
GTGCGGGAGA CCGCCCCCTG CGCCCGGCGG GTGGCGTCGG TGTGCGTGGG CGCGCACGTA
CTGGCGGCGG CCGGACTGCT GGACGGCAGG ACCGCGACCA CGCACTGGTC GACCGCCGCG
CAGCTCGCCG CCGACCATCC GGCCGTCACG GTCGACCCGG ACCCGATCTT CGTCCGCGCC
GACCGCGGAC GGCTGTGGAC GGGCGCCGGG ATCAGCGCCT GCCTGGACCT CGCACTCGCC
CTGGTGGCCG AGGATCTGGG TGAGGACGTC GCGCTGGCGG TGGCCCGGCA GCTGGTGATG
TACCTCAAGC GGCAGAGCGG GCAGAGCCAG TTCTCCGTGC CGCTCAGCCG GCCCGCCTCC
GCCCGCCGCG ACATCGACGA GCTGCTGCTG TGGATTTCCG ACCACCTCGA CGAGGACCTG
TCCGCGGAGG TGCTGGCGGC CCGGATGCAC CTGAGCGAAC GGCACTTCGC CCGCGTCTTC
GCCCAGGAGA CCGGCACCGG TCCCGCCGCC TACGTCGAGG GCGTCCGGGT CGAGGCCGCC
CGGCGCCTGC TGGAGACCAC CGACGACCCG CTCGACCGGG TCGCGGCCAG GGCCGGGTTC
GGCTCGACGG AGACCCTGCA CCGGGCGTTC CGGCGACAGC TCGCCACCAC CCCCGCCGCC
TACCGCCGCC GCTTCCGCAC CCAGGCCGCC TGA
 
Protein sequence
MSLPHRVVIG VFPDVDLLDV TGPAEVFALA NQEAPGRADY RVLLAGPTRG EVRTSAGVRL 
LTDVSFDDVG GQVDTLLVPG AVDMGDDGPV ARIDSDVVAW VRETAPCARR VASVCVGAHV
LAAAGLLDGR TATTHWSTAA QLAADHPAVT VDPDPIFVRA DRGRLWTGAG ISACLDLALA
LVAEDLGEDV ALAVARQLVM YLKRQSGQSQ FSVPLSRPAS ARRDIDELLL WISDHLDEDL
SAEVLAARMH LSERHFARVF AQETGTGPAA YVEGVRVEAA RRLLETTDDP LDRVAARAGF
GSTETLHRAF RRQLATTPAA YRRRFRTQAA