Gene Namu_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3961 
Symbol 
ID8449580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4371040 
End bp4372044 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content72% 
IMG OID645043006 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003203242 
Protein GI258654086 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0946705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCGCA AGCCGACGAT GAACGACGTG GCGCACCGTG CCGGCGTCGC GCTGAAGACC 
GTCTCCCGGT ATGTCAACGG CGACCCGACG ATCGGTGCGG ACTATGCCGA CCGCATCCGG
GAGGCGATCG CGGAACTGGG CTACCGGCGC AACATGGCCG CCGCCCGGAT CCGGCCCGGG
CAGAGCGCGA AGATGATCGG GCTGATCATC AGCGACCTGT CCAACCCCTA CTTCGCGACC
CTGGCCCGGG CCATCGAACT GGGGGCCGCC GCGGCCGGCT ACATGCTGAC CATCGCCAGC
TCGGAGGAGG ACGGAGCGCG GCACGACCTG CTGGTCGACC GGCTGCTGGA GCAGCAGGTG
GACGCGATCA TCGACGTCCC GCCGCGCGCG CCGGGCCGGG CCTGGCGGGA CATCCCGCCG
CCGCTGCCGC CGCTGGTGTT CGTCGACCGG CCGTCCGACT GGGCCGCTGC CGATACGGTG
CTGGCCGACA ACGCCGGAGG TGCCCGGTCT GCCACCCGGG CGTTGCTGCA CGCCGGTGCC
GGCACCGTCG CCTTCGTCGG CGACTCGGTG GAGATCTTCA CGATGGGGGA GCGGCTGACC
GGCTACCGGC AGGCCCTGGT CGAGGCCGAC CGGCCGGTCG ACGACGACCT GGTGCGGGAC
ACCGTGCACA CGGTCGACGA CGCGATGCGG GTGGTGCTCG ATCTGCTCGC GGGCGGACGG
GCGCAGGCGG TGTTCGCGGC CAACAACCGC GCCGCCCTGG GCGCGTTGCG CGCCTTCCGG
TTGGCCGAGA CGTTCCTGCC GATGATCGGC TTCGACGAGT TCGAGGCCGC CGCGCTGATC
AACCCGCCGA TCTCGGTGGT CAGCCAGGAC ATCCAGGCGA TGGGCAGGGC CGCCGCCGAC
CTCGCCGTGG CCCGGCTCAA CGGGAGCGAT ATCCCCTGCA CCACCACGGT TTTGCCGACG
TCGCTGATCC TGCGGGGGTC GGAACGGCTG CTCCCGGCGT TCTGA
 
Protein sequence
MQRKPTMNDV AHRAGVALKT VSRYVNGDPT IGADYADRIR EAIAELGYRR NMAAARIRPG 
QSAKMIGLII SDLSNPYFAT LARAIELGAA AAGYMLTIAS SEEDGARHDL LVDRLLEQQV
DAIIDVPPRA PGRAWRDIPP PLPPLVFVDR PSDWAAADTV LADNAGGARS ATRALLHAGA
GTVAFVGDSV EIFTMGERLT GYRQALVEAD RPVDDDLVRD TVHTVDDAMR VVLDLLAGGR
AQAVFAANNR AALGALRAFR LAETFLPMIG FDEFEAAALI NPPISVVSQD IQAMGRAAAD
LAVARLNGSD IPCTTTVLPT SLILRGSERL LPAF