Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2223 |
Symbol | |
ID | 9246073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2655191 |
End bp | 2656753 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, CdaR |
Protein accession | YP_003680151 |
Protein GI | 297561177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.14509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000308823 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGATCA GCGAGCTGCT CGCGGTCACA CGGCTGCACC TGAGATTCCT GTCGGGGGCC GAGCACGCGC ACCGCACCCT GCGGTGGGCC TACACCACCG ACCTGCTCGA ACCCGCACGC TACCTGCGCG GCGGCGAGTT CGTCCTCACC GGGATGATGT GGCGCACCCG CCCCGAGGAC TCGCGCACCT TCGTGGCCTC CGTCGCCGGG GCCGGGGCGG TCGCCGTCGG CGCCGGGACC GCCCTGGGCG AGGTCCCCGC CGATCTGGTG GAGGCCTGCC GCGAGCACGA CCTGCCCCTG GTGGAGGTGC CCCCGGAGAC CTCCTTCGGC GCGGTCACCG AGGAGGTGCT GCGCTCCCTG ACCCAGCACC GCTTCACCAC CATCGCCGAG ACCCGCGACC GCCACCGCCG CCTCATGGCC GACGTCGCCG CCGGGGCCGA CTTCCCCCGG GCCTTCGCCG AGGCCGCCGC GCAGACCGGG CGGGCCGCCT GGGTCCTGTC CTGCACCGGC CGCCACATCG CCGCCTCCGG CGGGCCCCTG CCCGAGGACG AGCGCGGGTG GATCGCCGCC CGCGCCCTGA CCGGCCCCGC CCTGCCCCAC ACCGTCCGCA CCCCCCAGCG GGAGGACCGC GCCCTCACGC TCCTGCCCGT CCAGGCCCGC GAGTCCCACC CCCTGGCCAC CTGGCTGGTG GTCTGCGAGG GCGACCACGC CTCCTGGACC GAGGAGGAGC ACGAGTCGGT GGCCGAACTC GTCTCCATCG CCGGGCTCGC CCGCAGCCGC GCCGAGGAGC GCGCCCTGAC CGACGCCCGC CACCTGGAGG GACTGCCCCG GCTGCTGGCC GCCCAGCGCT TCGACGAGGT CACCGAACTC CTGCGCGGCA CCGACCCGAC CGGCGGCCAG GGCAGCCACG TCGTGGTCAG CGCCGTCATG CTGCCCGAGC CGCGCGTGCC CGACCTGGCC CGGCGCGTGC TCTTGGAACT GGTCGCCGAC CGCCCCGGCG CCGTCGTCAC CGGCGACGAG GACGCCCTGG CCGTCGTCCC GGTGGCCGGG ACCGACGCCC GCGCACGCGC CGAGGAGGTC CGCTCCGCGC TGCTGCACCG CGCCCGCGTC CTGGAGGGCG GCCTGCTCGA CCACCGGCTG GCCATCGGCC TCAGTTCGGC GGTGCGGGGC GTGCCCGACC TGCGCGGCGC CGCCGTGGAG GCCCGCCACG CCCGCCGCCT GGCCGAGCTG CGCGGCGGCC GGTCCCGGGT GATCGCCGGA GCCGAGATCG ACTCCCACGA ACTGCTGCTG GCCTCGGTCC CCGAGGAGGT GCAGTCCTCC TACCGCGAGC GGCTGCTGGG CCCGCTGCTG GCCTACGACC GCGACCACCG CTCGGAGCTG GTGCGGACCC TGGAGCAGTT CCTGGCCCAC TCGGGGTCCT GGCAGCGCTG CGCCGCCACG ATGCACGTGC ACGTCAACAC GCTGCGGTAC CGGATCGGCC GCGTGGAGGA GCTGACCGGA CGGGATCTGA GCAGCCTGGA GCACCGGGTG GACCTGTTCC TGGCGCTCAA GCTCCGGGAC TGA
|
Protein sequence | MRISELLAVT RLHLRFLSGA EHAHRTLRWA YTTDLLEPAR YLRGGEFVLT GMMWRTRPED SRTFVASVAG AGAVAVGAGT ALGEVPADLV EACREHDLPL VEVPPETSFG AVTEEVLRSL TQHRFTTIAE TRDRHRRLMA DVAAGADFPR AFAEAAAQTG RAAWVLSCTG RHIAASGGPL PEDERGWIAA RALTGPALPH TVRTPQREDR ALTLLPVQAR ESHPLATWLV VCEGDHASWT EEEHESVAEL VSIAGLARSR AEERALTDAR HLEGLPRLLA AQRFDEVTEL LRGTDPTGGQ GSHVVVSAVM LPEPRVPDLA RRVLLELVAD RPGAVVTGDE DALAVVPVAG TDARARAEEV RSALLHRARV LEGGLLDHRL AIGLSSAVRG VPDLRGAAVE ARHARRLAEL RGGRSRVIAG AEIDSHELLL ASVPEEVQSS YRERLLGPLL AYDRDHRSEL VRTLEQFLAH SGSWQRCAAT MHVHVNTLRY RIGRVEELTG RDLSSLEHRV DLFLALKLRD
|
| |