Gene Namu_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2193 
Symbol 
ID8447804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2421407 
End bp2422564 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content73% 
IMG OID645041315 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003201559 
Protein GI258652403 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00163999 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00133858 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGCCCC CGCACCCCGG CCCGCCCGAG GCCGGTGGCG TGTCCCCGAC GCTGGACGTG 
CGACTGCTCG GACCGCTGGA GCTGCGCCTG GACGGCCGGC CGATCCCGTT GCCGGGCGGC
AAACCCAAGG CCGTGCTGGC CGGGTTGCTG GTCAGCCGCA ATCGCGTGGT GCCGGCCGAC
TCTTTGGCCG ACGCGATCTG GGACGGTGAG GTGCCGGCCA ACTTCCTGGC CACCCTGCAG
GTCCACGTGT CCGCCCTGCG CCGGGCGCTG CGCCCGGTGT CCGACCCGGG GCTGCTGACC
GTCACCACCC AGTCGCCCGG CTACCGCGTC GTCGTCGACG ACGCACTGGT CGACGTCGGC
CGGTTCGGCC GGTGGGCGCG GGCCGGCAGC GACCTGCTGA CCGCCCGCCG CTACGCCGAG
GCGGCCGACC GGCTACGGGC CGCCCTGGCC GAGTGGTCCG GGTCCGCGCT GGCCGACCTG
CAGGGCCTGC GGTTCGCCGA CGACTTCGCC GCCGCCGTGG AGGAGGAGCG GCTGGTCGCG
TTGCAGGCCC GGATCGAGGC CGACCTGGCC TGCGGGATGG AGTCGGCGGT GGTCGGCGAA
CTGGTCACCC TCACCGGCCA GTACCCGTTG CGCGAGCCGT TCTGGATCCA GCTGATCACT
GCCCTGTATC GCTCGGGCCG GCAGGCGGAC GCGCTGGACG CGGCCCGCCG CATCCGGACC
CTGCTCGACG ACGAACTCGG CATCGATCCC AGCCCGGCGC TGCGGGACCT GGAACGGCAG
GTGCTGCGCC AGGAACTGGC AGCGCCCGGA CCGGCCCCCG TGCCGTCGAT GCAGCGCACC
GTGGCCGAGA CCGCGGTTGT GCTGTCCAAG GCCCGGGTGC GACTGCCGTC CGGGGAATCG
TTGCCGGTGC CCAGCCGGGG TCTGCGCCTG GGCCGGATGG ACGACAACGA CCTGGTGATC
GCCGGGGAGA AGGTCAGTCG CTACCACGCG GTGATCGGCG AATCCGCGAA CGGCTTCACG
GTGACCGACC TGCGCTCCAC CAACGGCACC CACGTCAACG ACGAGCGGGT GGTGGAGAGC
CATCTGCTGC GCGACGGGGA TCGGATCCGC ATCGGCGGCA CCGAATTGAC CTTCCAGCTC
GACGCCGAGC CCGCCTGA
 
Protein sequence
MTPPHPGPPE AGGVSPTLDV RLLGPLELRL DGRPIPLPGG KPKAVLAGLL VSRNRVVPAD 
SLADAIWDGE VPANFLATLQ VHVSALRRAL RPVSDPGLLT VTTQSPGYRV VVDDALVDVG
RFGRWARAGS DLLTARRYAE AADRLRAALA EWSGSALADL QGLRFADDFA AAVEEERLVA
LQARIEADLA CGMESAVVGE LVTLTGQYPL REPFWIQLIT ALYRSGRQAD ALDAARRIRT
LLDDELGIDP SPALRDLERQ VLRQELAAPG PAPVPSMQRT VAETAVVLSK ARVRLPSGES
LPVPSRGLRL GRMDDNDLVI AGEKVSRYHA VIGESANGFT VTDLRSTNGT HVNDERVVES
HLLRDGDRIR IGGTELTFQL DAEPA