Gene Namu_5104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5104 
Symbol 
ID8450735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5690520 
End bp5693504 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content72% 
IMG OID645044139 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003204363 
Protein GI258655207 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC CGGACTCCGC CACCGGACCG ATCGTCGACC TGCTCGGACC CGTGCGCATC 
CAGGGGCCCG CCGAACCGCA GATCCTGACC CGGCGCATGG AGATCGGCGT GCTGGGCATG
CTCGCGCTGC ACGCCGGGAC CCCCGTCACC GGGTCCAACC TGATCGACCT GCTGTGGCCG
CAGGACCCGC CCCGCACGGC CGCCAAGACG CTGCAGGGGT ACGTCAAGCG CGTCCGCGGC
CTACTGGCCG GCGCCCGGGT CGAGCTGTCC TACGTCGGGC CGGCCGGGTA CCTCCTGGCC
CTGGCCCCGG AGCAGGTCGA CGCCCTCCGG TTCGAGTCGC TGGTCGCGGC CGCCCGGACC
TGCCCCGACG ACTCGCTGCG AGTCCAGCAG CTGGACGAGG CGCTGGCGCT CTGGCGCGGC
GAGCCGTTTG CCGGCTGCGA CCTGGAGGGT CTGGGCCCCT TCCGCCAGTG GCTGGAGCGG
CTGCGTAGCG GGGCCCGGTT GGAGCGGGCG ACCGCGGACA TCCGGCGCGG GGTCACGGAG
CAGACGATCA GCGCGCTGCG CGTGCTGATC GCTCAGGAAC CGACCAACGA GCGGCTGTGG
CTGCACCTGA TCGCGGCGTT GTACCTGGTC GGCAACCCGG TCGCGGCGCT GGAGGCCAGC
GCCGAGGCCC GGCGGGAACT GGACGAGCGG GTCGGGGTCG CCCCCGGACC CGAGTTGATG
GAGATCGAGC ACCGGATCGG CGTGCACGAC GACGTCGAGG GGTGTTACGT CCGGCTCACC
GGCATCGAGC TGCGGCCGGC CACGGTCACC CCGGCTCCCG CCTCGGGTTC GACCCCGGGG
TCCGGATCCG GCGCGGATCG CGAACGAGCC AATGCCGCCC TGCCGATGCC CGTCTGGGCC
GGCGAATTGG TGGACCGCGA GGACACCGTC GCCGACATCG TCCGGCAGAT CGACCGCGGC
GTGCCGGTCG TCACCATCGT CGGGCCGGGC GGGATGGGCA AGACGCGGTG CGCCGTGGAG
GCGGCCCGAC AGGCCGGCAC GGTCTGCGTG GGATTCCTGG ATCTGAGCGG CATCGTCTCC
GCCGATGCGC TGGCCGTGCA CCTGGCCGGC ACCATCGGCA CCCCCGGCCA CGACGACCCG
TTCGTCGCGA TCGCCGAGAG GCTGGCCGGG CATCGCTGCT GCGTCCTGCT CGACAACGCC
GAGCAGATCG TCGGCGGCGC CGACGTGATC GGCCGGCTGG CCAGTCTCTG CCCGTCGGTC
ACCTGGCTGG TCACCTCGCG CCTGGAGCTG GGGCTGGAGG CCGAGCGGGT CACCCGCCTG
CAGCCGCTGC CGTTGGACCC GGTCGCCGGC GGGATGAGCC TGTCGGCGGC GCTGCTGCTC
TCCGCCGCCC AGCGCCGCGG CGTCAGCCTG CCGGCCGCGG CCCATCCGGT GATCGAACAG
ATCGCCGCCG CAATCGGCGG CATCCCGCTG GCCCTGGAAC TGGCCGCCTG CCAACTGCAG
TCGCTGGACC CGGCCACCCT GCTGCGCGCG CTGGACGATC CCCTGGTCAC CCTGGTCGAC
CGGCGTCGGG CCATCGACCG GCACCGGTCC ATGCGCGCCT GCTTCCAGCT CGGCCTCGAC
CAACTCTCCC CCGACGGCCT CGTCCTGCTG GCGCTGCTGT CCGGGCGCCC CAACGGCGCC
CGCTACGACG ACCTGGCCGC CGTCTGGCCG GCCGAGTCCG CCGAGCCGCT GCCCGCGGCT
CTGGCCGAGC TGGTCGAACT CGGTTTCGCC GCCGGCGCCA CCGATACTGC CGGGGCGACC
CGGATCACCC AGCTGCCGGT CGTGCGCGCA CTGGGCCGGG AACTGACCGT CGCCGACCAA
CTCGGCCCGC TGCCCGCCGC GCTGGACGCG GTGGTGATGG GCCGGGTGGC CGCCGCATCG
GCCGGCGGCC ACACTTCCGA CGTCGAACCC GATCTGGCCG ATGTGCGCCG CCTGCTCCAG
TTCGGGGTCG ATGACGACGC CGGGCTGGAA CCGGCGCTGG GCTTGGCCGC CGCCCTGGTC
ATCTACTGGT GGTCGCATCG AATGGTCGAG GGTCGTGGTT GGCTCCACGC CCTTCTGCGC
CGCCCGGTCC GGCAACAGCC CTCGGTCAAT CGTCTGCTAG CGCTCAACTC GGCGGTGTTT
CTGGACTACG GCGTCGGCGA CAGCGATTCG GCTCGGCGGC ACGCAGATGA GGCGCTCGCC
ACCGGCGCCG CGATGGTTCC GCCGGTCCAC TCCATGCTGC TGTCCCGTAC TGCCATGCTG
GACGCCGCCG AAGAGTCGAT CGACCTGGCG CGATCGCGGG CCGCTGAGTC GTTGGCCATT
GCCCGGGCAA TCGGTGACGG GCAGGTCTTG TGGCTGACGC TGGGGAATTG TGGTGACGTT
GCCATCGCGA CCGGCGACCG GCACCAGGCT CGTGAGCTGT ATCTGGAATG CATCGATCAG
CTCCGGCGGA GCGGAATCTC CTGGTTGGCG GCCGCGCCAT GCGGGAGGCT GGGCGACCTG
GAGCTCTCCG CCGGTCGTCT CACGGAGTCT CGAATGTGGT TCGACCGAGC GGTGTCACTG
TGGCTGGATC GTGAACTCGG AGCAGGTGCC GGACAGACCC TGGCCGGAGC TGCTCGATTG
GACGTGATCG AGGGGCGGTT GGCCGATGCG CGCACTCATC TCGACGCCGG GCTGCTCGCC
GCTGAGAAGT CCGGAAGCCG AGTTGAGTAC CCGTTCCTGA CGATCGGCTA CGCAGCCCTC
GCGGCGGCTC GTGACGATGA CGACACGGCT CGAGCCCTGT TCGCCCTTGC TCTGTCTCAC
GGACGGCGGG CGGGTGTCGC GTTGCGTCCG ATGATTGATG GAGAGCTTGC GTCGCTCTAC
CGGTCAAAGG TCGACCGTTC CTCTCGTGAA AATGATGAGG CCCTGGCGCT GACGACTTCA
CTCGAGGACC TCCCGACGAT CATCCGGCGC TTGATCGGTC CGTGA
 
Protein sequence
MSEPDSATGP IVDLLGPVRI QGPAEPQILT RRMEIGVLGM LALHAGTPVT GSNLIDLLWP 
QDPPRTAAKT LQGYVKRVRG LLAGARVELS YVGPAGYLLA LAPEQVDALR FESLVAAART
CPDDSLRVQQ LDEALALWRG EPFAGCDLEG LGPFRQWLER LRSGARLERA TADIRRGVTE
QTISALRVLI AQEPTNERLW LHLIAALYLV GNPVAALEAS AEARRELDER VGVAPGPELM
EIEHRIGVHD DVEGCYVRLT GIELRPATVT PAPASGSTPG SGSGADRERA NAALPMPVWA
GELVDREDTV ADIVRQIDRG VPVVTIVGPG GMGKTRCAVE AARQAGTVCV GFLDLSGIVS
ADALAVHLAG TIGTPGHDDP FVAIAERLAG HRCCVLLDNA EQIVGGADVI GRLASLCPSV
TWLVTSRLEL GLEAERVTRL QPLPLDPVAG GMSLSAALLL SAAQRRGVSL PAAAHPVIEQ
IAAAIGGIPL ALELAACQLQ SLDPATLLRA LDDPLVTLVD RRRAIDRHRS MRACFQLGLD
QLSPDGLVLL ALLSGRPNGA RYDDLAAVWP AESAEPLPAA LAELVELGFA AGATDTAGAT
RITQLPVVRA LGRELTVADQ LGPLPAALDA VVMGRVAAAS AGGHTSDVEP DLADVRRLLQ
FGVDDDAGLE PALGLAAALV IYWWSHRMVE GRGWLHALLR RPVRQQPSVN RLLALNSAVF
LDYGVGDSDS ARRHADEALA TGAAMVPPVH SMLLSRTAML DAAEESIDLA RSRAAESLAI
ARAIGDGQVL WLTLGNCGDV AIATGDRHQA RELYLECIDQ LRRSGISWLA AAPCGRLGDL
ELSAGRLTES RMWFDRAVSL WLDRELGAGA GQTLAGAARL DVIEGRLADA RTHLDAGLLA
AEKSGSRVEY PFLTIGYAAL AAARDDDDTA RALFALALSH GRRAGVALRP MIDGELASLY
RSKVDRSSRE NDEALALTTS LEDLPTIIRR LIGP