Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0476 |
Symbol | |
ID | 9244315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 573439 |
End bp | 575358 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003678429 |
Protein GI | 297559455 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCTGA GCGACCCGGC GAGGCGGTGG CACGACACGG CCGGACAGGC CCGGCAGGGC GCGTCCGGGC ACACCGCCCG CGGGCTGCGC GTCGGACTGC TCGGGCAGTT CGGCGTCGAG ATGGACGGGG CCCCGCTGCG CCTGCGCGGG GACAAGCGCC GTGCCCTGCT GGCCACGCTC CTGCTCAACA CCGGGCGCAC CGTCCCCACC GCGCACCTCA TCGAACGCGT CTGGGGGAGC CCGGCCTCCC CGTCCGCGCG CAGCGCCCTC CAGGTCCACG TGACCCGCGT GCGCGCGGTC CTGGACCAGC ACTGCGGCAC CCCGCTCATC ACGGGCGGCG ACGGCGGCTA CCGGGTCGAC CTCGCCGAGG ACCAGTGCGA CCTGCTGCGC TTTCGCTCCC TGGTCCGCCG CGCCGACGGG GGCGCCGACC CCTCCGACCG CGCCGACCTG CTCATCAGCG CCCTGCGCCT GTGGCGTGGC CCGGTCCTGG CCGACATCGT CAGCCCCGTC CTGCACGAGC GCGACATCCC GCCCCTGAAC GAGGAGCTCC TGCGGGCCGC CGAGGAGGGG TTCGGCGCCG CCCTGGCCCG GGGCGACCAC GAGCGGGTGG CCGACCAGAT CGGCCCCATC GCCACCGACC ACCCCGAGCG CGAACCCCTC ATCCGCGTCC AGATGACCGC CCTCTACCGC TGCGGGCGCC CCAGCGAGGC CCTGCGCGTG TACGCCCGCA CCCGCGACGC GCTGGCCGAG CACCTGGGCG CGGACCCGGG CCGCGAACTC CAGGAGACCT TCCACGGCAT CCTGCGCGGC GACCTCGACC GCGCCCCGGG CACCAGCGTC CCGCGCCAGC GCGCCTCCGT CGAGGAGGGC GCCGGGGTCC GGCCCCTCTC CGCGGACACC GGACCGAACA CCGGGTCAGC CGACGCGCCG GGCACCGAAC CGGAGGCGGG ATCCCACGCC GGACCCGGGC CGGGCGCCGA TCCCGACGTC CCGGCGGGGG CGGAGGAACG CGACGACCGC GGCGACGGTC CGGTCCACGC GCCCGCCCCG GTGTCGGCCG CCCTCGCCCC GGCGGAGCTG CCCGCCGCGC CCTCCGCGCT GCTGGGTCGC GAGGAGGCCC TGGCCGAACT GGACCGGCTG GTGGACCCCT CCAGCACCGC CCCCGGCAGC GCGCTGGTGC GCGGCCCCGC GGGCGCCGGG GCCAGCGCCC TCGCCCTGAG CTGGGCCCGC GCGGCCGCGC CGCACTTCCC CGACGGACAG CTCTACGTCG ACCTCCGCGG CGGCGACGGC AGCCCCCGAG ACCCGGTCGA GGTGCTGCGC CGTCTGGTGC GCTCCCTGTC CACCGGCACC CGCGGCACCG AGGCCATGGA CGCCGACGAG GCCGCCGCGC GGGTCCGGAC GCTGTTGGCG CACCGGCGCG TCCTGCTGGT CCTGGACAAC GCGGCCTCCG TGCGCCAGGT GCGCCCCCTG CTCCCCGGCG GCACCGGGTG CGCGGCGCTC GTCACCAGCC GCTACTGGCT GACCGACCTG CTGGTGCGCG ACGGTCTGCG CGCCCTGCCC GTGGGGCCGC TCCCCCCGGA CGCCGCGGTG GACCTGTTGC GCCCGCGGGA GGGGCGGGAC CGGCGCGCCG AGTCGGTGCT GCGCCGCCTC GCCCAGGTCC TGGGACACCT GCCCCTGGCG CTGCGCATGG CGGCGGTCTG GCTCGACGAC ACCCGCCCGG ACCGGTCCGC CGCGGAGCTG GTGCGCCGAC TGGAGGGGGC GGACCCGGCA CGCGGGAGCA CGCCCACGGC CCGGATGGCC GCCGTGCTGC GTGCCGGGCC CCGGGAGGGA CGGCACGGCC GCGGGGAAGT ACCGGCCGAG CCCCGTCCTC CCGACACCAG TGTCGAACGG GGATCTTACG TTCGGCTTTC ACCCAGGTGA
|
Protein sequence | MSLSDPARRW HDTAGQARQG ASGHTARGLR VGLLGQFGVE MDGAPLRLRG DKRRALLATL LLNTGRTVPT AHLIERVWGS PASPSARSAL QVHVTRVRAV LDQHCGTPLI TGGDGGYRVD LAEDQCDLLR FRSLVRRADG GADPSDRADL LISALRLWRG PVLADIVSPV LHERDIPPLN EELLRAAEEG FGAALARGDH ERVADQIGPI ATDHPEREPL IRVQMTALYR CGRPSEALRV YARTRDALAE HLGADPGREL QETFHGILRG DLDRAPGTSV PRQRASVEEG AGVRPLSADT GPNTGSADAP GTEPEAGSHA GPGPGADPDV PAGAEERDDR GDGPVHAPAP VSAALAPAEL PAAPSALLGR EEALAELDRL VDPSSTAPGS ALVRGPAGAG ASALALSWAR AAAPHFPDGQ LYVDLRGGDG SPRDPVEVLR RLVRSLSTGT RGTEAMDADE AAARVRTLLA HRRVLLVLDN AASVRQVRPL LPGGTGCAAL VTSRYWLTDL LVRDGLRALP VGPLPPDAAV DLLRPREGRD RRAESVLRRL AQVLGHLPLA LRMAAVWLDD TRPDRSAAEL VRRLEGADPA RGSTPTARMA AVLRAGPREG RHGRGEVPAE PRPPDTSVER GSYVRLSPR
|
| |