Gene Namu_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3887 
Symbol 
ID8449506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4287505 
End bp4289505 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content73% 
IMG OID645042935 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003203171 
Protein GI258654015 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000943372 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.402378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGAA CCCGCTCGGA ATTGTTCGCG ATTATCGGCA ATGATCTCGA GAGCGGCCGT 
TCCGGGTCCG GTCTCGAACC GACGCGCCGG GACGACAGCA ATCCGAGATC CGTCCTGCAA
TTCCGCATCC TCGGCCCCGT CCAGGTCTTC TGCCACGGCA GCGAGGTCCA GCTGGGGGGC
TCGAAGCAAC GCACGGTGCT GGCCACCCTG CTGCTGGCCC GGGGCCGGGT GGTCACCGAC
GCGGCCCTGA GCACGGTGTT GTGGGGTGAG GATCCGCCGC GGACCTCCAG CGCGCAGATC
TACACCTACG TCTCCCGGTT GCGCCGCAGT CTGGGCGAAC AGGTCGAGCT GGTGCGCAGC
GGCCAGGGCT ACGCGCTGCG GGCGCCGCAC GCCTGGTTCG ACCTCGACGA GTACCTCAGC
CTCACCCGGC TGGGTCGGGC GACCCTGGAA CAGGGACGGC CCGACGTCGC CACCTTGCAC
CTGGCCGCCG CGTTGGCCCT CTGGCGGGGC GCGGCCCTGG GATCGGGCAC CGAGTTCCTG
GCCGAGACCG AGGTCGCGGC GTTGGAGGAG AGCCGGCTGA GCACCCAGGA ACTGTGGGTC
GAGGCCGAAC TGTCGCTGGG CCGGTGCCGC GGCCTGATCG CCGAACTCAC CTCGCTGGTG
GCCGCCCATC CCCTGCGGGA ACGCTTCCGG GCCCAGCTGA TGACCGCCCT GTGGCGCTCG
CACCGGCGGG CCGACGCGCT GCGGACCTTC TTCGAGGGCC GGGAGCTGCT GGCCGACGAG
CTCGGCGTCG ACCCGAGCCC ACTGCTCACC GAGCTGTACG AGGAGATCGT CGCCGAACCG
GCGGACGGCC CGACGGTGCC TGGCGCGTCC GCCGACGGCC CGACCGGTCC CGTCGGCCCC
CGAGCGCCGG CGCCGGCGCC GGCGCCGGCC ATGCTGCCGC CCGATCTGGC CGACTTCACC
GGGCGGCGCA CCGAGGCGGC CCGGCTCAGC AGCTGGCTGG GATTCCAGCA CCCGGCGACC
CCGCCCCGGC CGCACACCCC GGCCCCGTGC GCGGTGGCCT GCGACGGGCG CCCGCGGATG
GCGCTGATCT CCGGTCCCCC GGGTGTCGGC CGGTCGTCGC TGGCCATCCA CGTCGCCCAG
TTGGGCCGCC AGCTGTATCC CGACGGGCAG CTGTTCGTCG ATCTGGGCGG GCCCGGCCGG
CCGCGGGTGG ACGTGCGCGA CGTGCTGGCC TGGTTCCTGC GGGCCCTGGG CGCCACCGCC
GATCAGATTC CCGGCGACAC CCAGGAACGC GCGCAGGTGT ACCGGAGCAT GCTGGCGCGC
CGGCGGGTGC TGGTGGTGCT GGACAACGCC GTCTCCGACG AACAGGTCCA CCTGCTCCTG
CCGGCCGGCG CGGGCTGCGG AGTGCTGATC ACCAGCACCG AACCACTGGC CGCGGTACCG
CTGAACCGGC AGATCGATCT CGGTCCGTTC GGCATGGACG AGGCGCTGGC GTTCCTGGGC
CGCGCGGGCG GCCAGGACCG GGTACGAGTG GAGCGGCCGG CGGCGGTGGA GCTGGTCAAC
AGCTGCGGCC GGTTCCCCCT GGCCCTGCGG ATCCTGAGCC TGCAGTTGAG CCGCAAGCCG
CACTGGTCGC TGCGGCAGAT GGTCACGCAC CTGCACGCCG ACGCAACCCG GCTGGACCGG
CTGCAGGCCG GGGCGCTGCA CATCCGACCG GCCCTGGACC GGTTGTTCGA CGCGATCGAG
GAGGGCCGGC TCACCCAGAT CCGGCTGCTG GCCGACCTGC CAACCCCGAC CTTCACCGCG
GACACGGTCG GCCGGTTCCT GGGGATGCCC GAGAGCCTGG CCGAGCACGT CTTGGAACAG
CTGCTCGACC GGCGGCTGCT GGAGGTCATC GGGCTCGATG CCGGCCGCCG CCCGCTCTAC
ACCTTCCCGC CGCTGACCCG GTTGGCCGCC CGCGAGCTGC GGCGCGGGGC CGGAACCCGG
CCGGTGGTCG AGGGCGCCTG A
 
Protein sequence
MTGTRSELFA IIGNDLESGR SGSGLEPTRR DDSNPRSVLQ FRILGPVQVF CHGSEVQLGG 
SKQRTVLATL LLARGRVVTD AALSTVLWGE DPPRTSSAQI YTYVSRLRRS LGEQVELVRS
GQGYALRAPH AWFDLDEYLS LTRLGRATLE QGRPDVATLH LAAALALWRG AALGSGTEFL
AETEVAALEE SRLSTQELWV EAELSLGRCR GLIAELTSLV AAHPLRERFR AQLMTALWRS
HRRADALRTF FEGRELLADE LGVDPSPLLT ELYEEIVAEP ADGPTVPGAS ADGPTGPVGP
RAPAPAPAPA MLPPDLADFT GRRTEAARLS SWLGFQHPAT PPRPHTPAPC AVACDGRPRM
ALISGPPGVG RSSLAIHVAQ LGRQLYPDGQ LFVDLGGPGR PRVDVRDVLA WFLRALGATA
DQIPGDTQER AQVYRSMLAR RRVLVVLDNA VSDEQVHLLL PAGAGCGVLI TSTEPLAAVP
LNRQIDLGPF GMDEALAFLG RAGGQDRVRV ERPAAVELVN SCGRFPLALR ILSLQLSRKP
HWSLRQMVTH LHADATRLDR LQAGALHIRP ALDRLFDAIE EGRLTQIRLL ADLPTPTFTA
DTVGRFLGMP ESLAEHVLEQ LLDRRLLEVI GLDAGRRPLY TFPPLTRLAA RELRRGAGTR
PVVEGA