Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2621 |
Symbol | |
ID | 9246472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3125385 |
End bp | 3126788 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | putative transcriptional regulator, Crp/Fnr family |
Protein accession | YP_003680544 |
Protein GI | 297561570 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.167094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0008696 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCGATCG ACTCGGCTCC TGAGGTCCGG AGCAGCAAGC AGCAGAGCCT GGGTACCGCG GCCGCGCGGA ACCTGGCGAC CACCACCAAG TCCACGCCGC AGATGCAGGG CATCAGCTCC CGCTGGCTGA CCCGGATGCT CCCGTGGGTG CACACCGACG GCGGCGCCTA CCGGGTCAAC CGGCGGCTGA CCTACACCGT CGGCGACGGG CGGATCGAGT TCGAGCAGAC CGGCGCCCGG GTCCGGGTCA TCCCCCACGA ACTCGGCGAG CTGGCCCTGC TGCGCGGCTT CGACGACGAG GAGGTGCTGG CGGCCCTGGC CAACCGCTTC GTCCAGCGCG ACTTCGAGAA CGGCCAGGTC CTGGTGGAGG AGGGCACGGC GGCCGACAGC CTGTTCCTGC TGGCGCACGG GCGCGTGCAC AAGACCGGCA CCGGCCCCTA CGGCGAGGCC GTCCGGCTGG GCGTGCTGGC CGACGGCGAC AGGTTCGGCG ACCAGCACCT GCTCGGCACC GAACCGGCGT GGGAGTACAC CGTCAAGGCC GCGACGGCGG GCACGCTGCT GGAGCTGCCC CGCCGCGACT TCATCGCGAT CCTGGACGAC TCGCCCGCCC TCCAGGCCCA CGTCCAGCAG TACCTGTCCC TGCCCGGCGA GCGGCAGAAC AAGCACGGCG AGGCCGAGAT CGCCCTGTCC TCGGGCCACG TCGGCGAGGC CGAGCTGCCC AGCACCTTCG TGGACTACGA GCTCAGGCCG CGCGAGTACG AGCTGAGCGT GGCCCAGACC GTGCTGCGGG TGCACAGCCG GGTCGCCGAC CTCTACAACA AGCCGATGAA CCAGACCGAG CAGCAGCTGC GGCTGACCAT CCAGGCCCTG CGCGAGCGCC AGGAGCACGA ACTGGTCAAC AACCGCGAGT TCGGCCTGCT CCACAACGCC GAGTTCAAGC AGCGCATCCA GACCCACTCG GGGCCGCCCA CCCCCGACGA CCTGGACGAC CTGCTGAGCA TGCGCCGCAA CACCCAGTAC ATGTTCGCCC ACCCCCGCGC CATCGCCGCC TTCGGCAAGG AGTGCAACAG CCGGGGCCTG AACATCGGCA CCGTCGAGGT CAACGGCCAC CACCTGCCCG CCTGGCGCGG GGTGCCCCTC CTGCCCTGCG GCAAGATCCC GGTCACCGAG CACCAGACCT CCTCGATCAT CGCGGTCCGC ACCGGCGAGG ACAACGAGGG CGTCATCGGC CTGTACCAGA CCGGCCTGCC CGACGAGGTC GAGCCCGGCC TCAACGCGCG CTTCATGGGC ATCGACGACA AGGCCGTCAT CTCCTACCTC GTCAGCACCT ACTACTCCGC CGCGGTGCTC GTCCCCGACG CCATCGGGAT CCTGGAGAAC GCCGAGGTCC ACCCGCGTGG CTGA
|
Protein sequence | MSIDSAPEVR SSKQQSLGTA AARNLATTTK STPQMQGISS RWLTRMLPWV HTDGGAYRVN RRLTYTVGDG RIEFEQTGAR VRVIPHELGE LALLRGFDDE EVLAALANRF VQRDFENGQV LVEEGTAADS LFLLAHGRVH KTGTGPYGEA VRLGVLADGD RFGDQHLLGT EPAWEYTVKA ATAGTLLELP RRDFIAILDD SPALQAHVQQ YLSLPGERQN KHGEAEIALS SGHVGEAELP STFVDYELRP REYELSVAQT VLRVHSRVAD LYNKPMNQTE QQLRLTIQAL RERQEHELVN NREFGLLHNA EFKQRIQTHS GPPTPDDLDD LLSMRRNTQY MFAHPRAIAA FGKECNSRGL NIGTVEVNGH HLPAWRGVPL LPCGKIPVTE HQTSSIIAVR TGEDNEGVIG LYQTGLPDEV EPGLNARFMG IDDKAVISYL VSTYYSAAVL VPDAIGILEN AEVHPRG
|
| |