Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_5104 |
Symbol | |
ID | 8450735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5690520 |
End bp | 5693504 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645044139 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003204363 |
Protein GI | 258655207 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC CGGACTCCGC CACCGGACCG ATCGTCGACC TGCTCGGACC CGTGCGCATC CAGGGGCCCG CCGAACCGCA GATCCTGACC CGGCGCATGG AGATCGGCGT GCTGGGCATG CTCGCGCTGC ACGCCGGGAC CCCCGTCACC GGGTCCAACC TGATCGACCT GCTGTGGCCG CAGGACCCGC CCCGCACGGC CGCCAAGACG CTGCAGGGGT ACGTCAAGCG CGTCCGCGGC CTACTGGCCG GCGCCCGGGT CGAGCTGTCC TACGTCGGGC CGGCCGGGTA CCTCCTGGCC CTGGCCCCGG AGCAGGTCGA CGCCCTCCGG TTCGAGTCGC TGGTCGCGGC CGCCCGGACC TGCCCCGACG ACTCGCTGCG AGTCCAGCAG CTGGACGAGG CGCTGGCGCT CTGGCGCGGC GAGCCGTTTG CCGGCTGCGA CCTGGAGGGT CTGGGCCCCT TCCGCCAGTG GCTGGAGCGG CTGCGTAGCG GGGCCCGGTT GGAGCGGGCG ACCGCGGACA TCCGGCGCGG GGTCACGGAG CAGACGATCA GCGCGCTGCG CGTGCTGATC GCTCAGGAAC CGACCAACGA GCGGCTGTGG CTGCACCTGA TCGCGGCGTT GTACCTGGTC GGCAACCCGG TCGCGGCGCT GGAGGCCAGC GCCGAGGCCC GGCGGGAACT GGACGAGCGG GTCGGGGTCG CCCCCGGACC CGAGTTGATG GAGATCGAGC ACCGGATCGG CGTGCACGAC GACGTCGAGG GGTGTTACGT CCGGCTCACC GGCATCGAGC TGCGGCCGGC CACGGTCACC CCGGCTCCCG CCTCGGGTTC GACCCCGGGG TCCGGATCCG GCGCGGATCG CGAACGAGCC AATGCCGCCC TGCCGATGCC CGTCTGGGCC GGCGAATTGG TGGACCGCGA GGACACCGTC GCCGACATCG TCCGGCAGAT CGACCGCGGC GTGCCGGTCG TCACCATCGT CGGGCCGGGC GGGATGGGCA AGACGCGGTG CGCCGTGGAG GCGGCCCGAC AGGCCGGCAC GGTCTGCGTG GGATTCCTGG ATCTGAGCGG CATCGTCTCC GCCGATGCGC TGGCCGTGCA CCTGGCCGGC ACCATCGGCA CCCCCGGCCA CGACGACCCG TTCGTCGCGA TCGCCGAGAG GCTGGCCGGG CATCGCTGCT GCGTCCTGCT CGACAACGCC GAGCAGATCG TCGGCGGCGC CGACGTGATC GGCCGGCTGG CCAGTCTCTG CCCGTCGGTC ACCTGGCTGG TCACCTCGCG CCTGGAGCTG GGGCTGGAGG CCGAGCGGGT CACCCGCCTG CAGCCGCTGC CGTTGGACCC GGTCGCCGGC GGGATGAGCC TGTCGGCGGC GCTGCTGCTC TCCGCCGCCC AGCGCCGCGG CGTCAGCCTG CCGGCCGCGG CCCATCCGGT GATCGAACAG ATCGCCGCCG CAATCGGCGG CATCCCGCTG GCCCTGGAAC TGGCCGCCTG CCAACTGCAG TCGCTGGACC CGGCCACCCT GCTGCGCGCG CTGGACGATC CCCTGGTCAC CCTGGTCGAC CGGCGTCGGG CCATCGACCG GCACCGGTCC ATGCGCGCCT GCTTCCAGCT CGGCCTCGAC CAACTCTCCC CCGACGGCCT CGTCCTGCTG GCGCTGCTGT CCGGGCGCCC CAACGGCGCC CGCTACGACG ACCTGGCCGC CGTCTGGCCG GCCGAGTCCG CCGAGCCGCT GCCCGCGGCT CTGGCCGAGC TGGTCGAACT CGGTTTCGCC GCCGGCGCCA CCGATACTGC CGGGGCGACC CGGATCACCC AGCTGCCGGT CGTGCGCGCA CTGGGCCGGG AACTGACCGT CGCCGACCAA CTCGGCCCGC TGCCCGCCGC GCTGGACGCG GTGGTGATGG GCCGGGTGGC CGCCGCATCG GCCGGCGGCC ACACTTCCGA CGTCGAACCC GATCTGGCCG ATGTGCGCCG CCTGCTCCAG TTCGGGGTCG ATGACGACGC CGGGCTGGAA CCGGCGCTGG GCTTGGCCGC CGCCCTGGTC ATCTACTGGT GGTCGCATCG AATGGTCGAG GGTCGTGGTT GGCTCCACGC CCTTCTGCGC CGCCCGGTCC GGCAACAGCC CTCGGTCAAT CGTCTGCTAG CGCTCAACTC GGCGGTGTTT CTGGACTACG GCGTCGGCGA CAGCGATTCG GCTCGGCGGC ACGCAGATGA GGCGCTCGCC ACCGGCGCCG CGATGGTTCC GCCGGTCCAC TCCATGCTGC TGTCCCGTAC TGCCATGCTG GACGCCGCCG AAGAGTCGAT CGACCTGGCG CGATCGCGGG CCGCTGAGTC GTTGGCCATT GCCCGGGCAA TCGGTGACGG GCAGGTCTTG TGGCTGACGC TGGGGAATTG TGGTGACGTT GCCATCGCGA CCGGCGACCG GCACCAGGCT CGTGAGCTGT ATCTGGAATG CATCGATCAG CTCCGGCGGA GCGGAATCTC CTGGTTGGCG GCCGCGCCAT GCGGGAGGCT GGGCGACCTG GAGCTCTCCG CCGGTCGTCT CACGGAGTCT CGAATGTGGT TCGACCGAGC GGTGTCACTG TGGCTGGATC GTGAACTCGG AGCAGGTGCC GGACAGACCC TGGCCGGAGC TGCTCGATTG GACGTGATCG AGGGGCGGTT GGCCGATGCG CGCACTCATC TCGACGCCGG GCTGCTCGCC GCTGAGAAGT CCGGAAGCCG AGTTGAGTAC CCGTTCCTGA CGATCGGCTA CGCAGCCCTC GCGGCGGCTC GTGACGATGA CGACACGGCT CGAGCCCTGT TCGCCCTTGC TCTGTCTCAC GGACGGCGGG CGGGTGTCGC GTTGCGTCCG ATGATTGATG GAGAGCTTGC GTCGCTCTAC CGGTCAAAGG TCGACCGTTC CTCTCGTGAA AATGATGAGG CCCTGGCGCT GACGACTTCA CTCGAGGACC TCCCGACGAT CATCCGGCGC TTGATCGGTC CGTGA
|
Protein sequence | MSEPDSATGP IVDLLGPVRI QGPAEPQILT RRMEIGVLGM LALHAGTPVT GSNLIDLLWP QDPPRTAAKT LQGYVKRVRG LLAGARVELS YVGPAGYLLA LAPEQVDALR FESLVAAART CPDDSLRVQQ LDEALALWRG EPFAGCDLEG LGPFRQWLER LRSGARLERA TADIRRGVTE QTISALRVLI AQEPTNERLW LHLIAALYLV GNPVAALEAS AEARRELDER VGVAPGPELM EIEHRIGVHD DVEGCYVRLT GIELRPATVT PAPASGSTPG SGSGADRERA NAALPMPVWA GELVDREDTV ADIVRQIDRG VPVVTIVGPG GMGKTRCAVE AARQAGTVCV GFLDLSGIVS ADALAVHLAG TIGTPGHDDP FVAIAERLAG HRCCVLLDNA EQIVGGADVI GRLASLCPSV TWLVTSRLEL GLEAERVTRL QPLPLDPVAG GMSLSAALLL SAAQRRGVSL PAAAHPVIEQ IAAAIGGIPL ALELAACQLQ SLDPATLLRA LDDPLVTLVD RRRAIDRHRS MRACFQLGLD QLSPDGLVLL ALLSGRPNGA RYDDLAAVWP AESAEPLPAA LAELVELGFA AGATDTAGAT RITQLPVVRA LGRELTVADQ LGPLPAALDA VVMGRVAAAS AGGHTSDVEP DLADVRRLLQ FGVDDDAGLE PALGLAAALV IYWWSHRMVE GRGWLHALLR RPVRQQPSVN RLLALNSAVF LDYGVGDSDS ARRHADEALA TGAAMVPPVH SMLLSRTAML DAAEESIDLA RSRAAESLAI ARAIGDGQVL WLTLGNCGDV AIATGDRHQA RELYLECIDQ LRRSGISWLA AAPCGRLGDL ELSAGRLTES RMWFDRAVSL WLDRELGAGA GQTLAGAARL DVIEGRLADA RTHLDAGLLA AEKSGSRVEY PFLTIGYAAL AAARDDDDTA RALFALALSH GRRAGVALRP MIDGELASLY RSKVDRSSRE NDEALALTTS LEDLPTIIRR LIGP
|
| |