Gene Ndas_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2223 
Symbol 
ID9246073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2655191 
End bp2656753 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content76% 
IMG OID 
Producttranscriptional regulator, CdaR 
Protein accessionYP_003680151 
Protein GI297561177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000308823 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGATCA GCGAGCTGCT CGCGGTCACA CGGCTGCACC TGAGATTCCT GTCGGGGGCC 
GAGCACGCGC ACCGCACCCT GCGGTGGGCC TACACCACCG ACCTGCTCGA ACCCGCACGC
TACCTGCGCG GCGGCGAGTT CGTCCTCACC GGGATGATGT GGCGCACCCG CCCCGAGGAC
TCGCGCACCT TCGTGGCCTC CGTCGCCGGG GCCGGGGCGG TCGCCGTCGG CGCCGGGACC
GCCCTGGGCG AGGTCCCCGC CGATCTGGTG GAGGCCTGCC GCGAGCACGA CCTGCCCCTG
GTGGAGGTGC CCCCGGAGAC CTCCTTCGGC GCGGTCACCG AGGAGGTGCT GCGCTCCCTG
ACCCAGCACC GCTTCACCAC CATCGCCGAG ACCCGCGACC GCCACCGCCG CCTCATGGCC
GACGTCGCCG CCGGGGCCGA CTTCCCCCGG GCCTTCGCCG AGGCCGCCGC GCAGACCGGG
CGGGCCGCCT GGGTCCTGTC CTGCACCGGC CGCCACATCG CCGCCTCCGG CGGGCCCCTG
CCCGAGGACG AGCGCGGGTG GATCGCCGCC CGCGCCCTGA CCGGCCCCGC CCTGCCCCAC
ACCGTCCGCA CCCCCCAGCG GGAGGACCGC GCCCTCACGC TCCTGCCCGT CCAGGCCCGC
GAGTCCCACC CCCTGGCCAC CTGGCTGGTG GTCTGCGAGG GCGACCACGC CTCCTGGACC
GAGGAGGAGC ACGAGTCGGT GGCCGAACTC GTCTCCATCG CCGGGCTCGC CCGCAGCCGC
GCCGAGGAGC GCGCCCTGAC CGACGCCCGC CACCTGGAGG GACTGCCCCG GCTGCTGGCC
GCCCAGCGCT TCGACGAGGT CACCGAACTC CTGCGCGGCA CCGACCCGAC CGGCGGCCAG
GGCAGCCACG TCGTGGTCAG CGCCGTCATG CTGCCCGAGC CGCGCGTGCC CGACCTGGCC
CGGCGCGTGC TCTTGGAACT GGTCGCCGAC CGCCCCGGCG CCGTCGTCAC CGGCGACGAG
GACGCCCTGG CCGTCGTCCC GGTGGCCGGG ACCGACGCCC GCGCACGCGC CGAGGAGGTC
CGCTCCGCGC TGCTGCACCG CGCCCGCGTC CTGGAGGGCG GCCTGCTCGA CCACCGGCTG
GCCATCGGCC TCAGTTCGGC GGTGCGGGGC GTGCCCGACC TGCGCGGCGC CGCCGTGGAG
GCCCGCCACG CCCGCCGCCT GGCCGAGCTG CGCGGCGGCC GGTCCCGGGT GATCGCCGGA
GCCGAGATCG ACTCCCACGA ACTGCTGCTG GCCTCGGTCC CCGAGGAGGT GCAGTCCTCC
TACCGCGAGC GGCTGCTGGG CCCGCTGCTG GCCTACGACC GCGACCACCG CTCGGAGCTG
GTGCGGACCC TGGAGCAGTT CCTGGCCCAC TCGGGGTCCT GGCAGCGCTG CGCCGCCACG
ATGCACGTGC ACGTCAACAC GCTGCGGTAC CGGATCGGCC GCGTGGAGGA GCTGACCGGA
CGGGATCTGA GCAGCCTGGA GCACCGGGTG GACCTGTTCC TGGCGCTCAA GCTCCGGGAC
TGA
 
Protein sequence
MRISELLAVT RLHLRFLSGA EHAHRTLRWA YTTDLLEPAR YLRGGEFVLT GMMWRTRPED 
SRTFVASVAG AGAVAVGAGT ALGEVPADLV EACREHDLPL VEVPPETSFG AVTEEVLRSL
TQHRFTTIAE TRDRHRRLMA DVAAGADFPR AFAEAAAQTG RAAWVLSCTG RHIAASGGPL
PEDERGWIAA RALTGPALPH TVRTPQREDR ALTLLPVQAR ESHPLATWLV VCEGDHASWT
EEEHESVAEL VSIAGLARSR AEERALTDAR HLEGLPRLLA AQRFDEVTEL LRGTDPTGGQ
GSHVVVSAVM LPEPRVPDLA RRVLLELVAD RPGAVVTGDE DALAVVPVAG TDARARAEEV
RSALLHRARV LEGGLLDHRL AIGLSSAVRG VPDLRGAAVE ARHARRLAEL RGGRSRVIAG
AEIDSHELLL ASVPEEVQSS YRERLLGPLL AYDRDHRSEL VRTLEQFLAH SGSWQRCAAT
MHVHVNTLRY RIGRVEELTG RDLSSLEHRV DLFLALKLRD