Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0057 |
Symbol | |
ID | 9243887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 74282 |
End bp | 76021 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003678015 |
Protein GI | 297559041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000194766 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCGAAA ACCGCCAGCT TCGGTGCGCC GCGTCGGCGA ACATGGATGG CGTGATGGAC GACGACCAGC GCTACCGCGC CGTACACAGC AGGGACGCCC GCTTCGACGG CGTCTTCTAC ACAGCCGTGC GCACCACGGG TATCTACTGC CGTCCGAGCT GCCCGGCCGT CACCCCGAAA CGGGCCAACG TCCGCTTCTA CCCGACCGCC GCGGCCGCCC AGGAGTCGGG GTTCCGCGCC TGCAAGCGGT GCCGTCCCGA CCTCACCCCC GGTTCGCCGG AGTGGAACCT GCGGGCCGAC GTGGTGGGCC GCGCGATGCG CCTCATCCAG GACGGCGCCG TGGACCGGGG CGGCGTCAGC GCCCTGGCCT CGGCGGTCGG CTACAGCGAA CGCCAGCTCA ACCGGCTGCT GTCGGCCGAG GTGGGCGCGG GGCCGCTGGC CCTGGCCCGG ACCGAGCGCG CCCAGACGGC ACGGGTGCTG GTGGAGACCA CCGACATGCC GATGGCGGAC GTCGCCTTCG CCGCGGGGTT CGCGAGCGTA CGCCAGTTCA ACGAGACGAT GCGCGCGGTG TTCGACCGCT CCCCCACCGA GATGCGGACC ATGGGCGGTC GGCGTGCGTC GGCTTCCGGG GGGTCGCGTC CGGGGGACGC CCGCCCTCCG GCCACCGAGC CGGGGACGGT CACGCTGCGG CTGCCCTACC GGGAGCCCAT CGACCTGGCC CGGATGCTGA GGTTCCTCGG AGACCGTGCG GTTCCGGGCG TGGAGGAGTA CCGGGACGGG GTCTACCGCA GGACGCTGAT GCTGGCGCAC GGTCCCGCCG TGGTGGAGTT GTCCGAGGGG TCCGGGACCG GCAGGGCGGG CAGGACCGGC CGCGCTGGTG CCACGGGCGG CGTTCGTCCG GCGGACGCTG TGGACGGCGG GGTGTCCGTG AGCGGTGGGG GGCACGTGCT GTGCCGCCTG CGGTTGTCGG AGGCGCGCGA CCTGACCAGC GCGGTGCGCA GGTGCCGCAG GCTGCTCGAC CTGGACGCCG ACCCGGGTGC GGTGGCCGAG GCCCTGGGCG GGGACCCCCT CCTGGGACCG ATCGTGGCCG CCCACCCGGG ACTGCGATCG CCGGGGCACG TGGACCCGGC CGAACTGGCG GTCCGGGCGG TCCTCGGCCA GCAGGTGTCG GTGCGTGCGG CCCGCACGCT GGCGGGGCGG CTGGTCGAGC GGTTCGGCGA ACCGCTCGCT CCGGGCCTGG AAGCGCCGGG CGGAGGACTC ACCCACGTGT TCCCCTCCCC TGACGCGCTC GCCGCGGCCG ACCCGGCCGG TTTCTCCGTC CCGGTCGCGC GGGGACGCGC CCTGGCGGGG CTGTGCGAGG CGATCGCCTC GGGGTGGATC GACCTGGGGC CGGGATGCGA CCGGGACGAG GCCGAACGGC GTCTGGTGGA GCTGCGCGGC ATCGGTCCGT GGACCGCCGG TTACGTGCGC ATGCGGGGTC TGGGCGACCC GGACGTGTTC CTGCACGGCG ACCTGGGCGT CCGGATGGCG CTGGAGGCGG GGGGCAGACG GGCGACCCCC GCGGCGGCCG CGCGCGAGGC ACGGGAGTGG AGCCCGTGGC GGTCCTACGC CAACCACGCG CTGTGGGCGT CGTTGGCCGA CCGTGAGCGG GAGAGCACGG CCGTTCGGGC GGACGTGGTC GTGCGGGACG GCGTGCGGGA TGCTTCGAAG GAACGTCAGG AACGGAAGGA ATCGGCATGA
|
Protein sequence | MSENRQLRCA ASANMDGVMD DDQRYRAVHS RDARFDGVFY TAVRTTGIYC RPSCPAVTPK RANVRFYPTA AAAQESGFRA CKRCRPDLTP GSPEWNLRAD VVGRAMRLIQ DGAVDRGGVS ALASAVGYSE RQLNRLLSAE VGAGPLALAR TERAQTARVL VETTDMPMAD VAFAAGFASV RQFNETMRAV FDRSPTEMRT MGGRRASASG GSRPGDARPP ATEPGTVTLR LPYREPIDLA RMLRFLGDRA VPGVEEYRDG VYRRTLMLAH GPAVVELSEG SGTGRAGRTG RAGATGGVRP ADAVDGGVSV SGGGHVLCRL RLSEARDLTS AVRRCRRLLD LDADPGAVAE ALGGDPLLGP IVAAHPGLRS PGHVDPAELA VRAVLGQQVS VRAARTLAGR LVERFGEPLA PGLEAPGGGL THVFPSPDAL AAADPAGFSV PVARGRALAG LCEAIASGWI DLGPGCDRDE AERRLVELRG IGPWTAGYVR MRGLGDPDVF LHGDLGVRMA LEAGGRRATP AAAAREAREW SPWRSYANHA LWASLADRER ESTAVRADVV VRDGVRDASK ERQERKESA
|
| |