Gene Namu_3846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3846 
Symbol 
ID8449465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4217138 
End bp4218184 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID645042895 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003203131 
Protein GI258653975 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.00500845 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.20128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC CCACTCTCCG CGACGTCGCG GACCGTTCCG GCTTCTCCAT CACCACCGTC 
TCCCAGGTTC TCAACGATGT GCCGGGCAAG CGCATCCCGG ACGCCACCCG CGACCGGGTC
CGGGCCGCCG CGACCGAGCT GGCCTACCGG CCGAACCGGT TGGCCCAGGG CCTGCGGCTG
CAGCGGTCCA ACACCCTGGG GTTCGTCAGC GACAAGATCG CGACCACGCC CTACGCCGGT
GAGGTGATCC TGGGCGCGCA GGACGCGGCC GCCGAGCACG GCGACCTGTT GTTGCTGATG
AACTCCAACA GCGACCCGGG TCTGGAGGAA CGCGAGATCC GCGCCCTGCA GGAGCGCCAG
GTGGACGGCA TCATCTTCGC TTCGGAATAC CACCGGGTGA TCACGCCGCC GGACGCGTTG
CAGGGCACCC CGGCGGTGCT GCTGGACGCC CGCTCGGTCC GTGGCGATGT CAGCTCGGTC
GTCCCCGACG AGGTCGGCGG CACGTTGGCC GCGGTGCGCG AACTGATCGC CGCCGGCCAC
CAGCGCATCG CCTTTCTCAA CAACGTCGAC GACATTCCCG CCACAGCCCT GCGTTTGCAG
GGATTCCGGC AGGGTCTGAA CGAGGCCGGC CGGCGCCTGC GCGCGGGCAT GGTGGTGACC
GCCGCCTCGA CCCCGGGCGC CGGTTATGAC GCGGCCCGGC AGTTGCTCGA CCAACCCCGG
GCCGGCCGAC CGACCGCGAT CTTCTGCTTC AACGACCGGA TGGCGATGGG CACCTACCAG
GCGGCCGCCG AACTGGGCCT GCGCATCCCG GACGACCTGT CGGTCGTCGG CTTCGACAAC
CAGGAACTGA TCGCGGCGAA CCTGCGCCCC GGTCTGACCA CGGTGGCCCT GCCCCATTAC
GCGATGGGCC AGTGGGCGGT CGCCGCCCTG CTGGACCTGA TCGACGCCCA GGCCGACCCA
TCCCAGAAGC GACAGCCGAT CCGCGAGGAG AAGCTGCCCT GCCCACTGGT GCGCCGTGCA
TCGGTGGGCC CGCCGCCCCG GTCGTGA
 
Protein sequence
MGKPTLRDVA DRSGFSITTV SQVLNDVPGK RIPDATRDRV RAAATELAYR PNRLAQGLRL 
QRSNTLGFVS DKIATTPYAG EVILGAQDAA AEHGDLLLLM NSNSDPGLEE REIRALQERQ
VDGIIFASEY HRVITPPDAL QGTPAVLLDA RSVRGDVSSV VPDEVGGTLA AVRELIAAGH
QRIAFLNNVD DIPATALRLQ GFRQGLNEAG RRLRAGMVVT AASTPGAGYD AARQLLDQPR
AGRPTAIFCF NDRMAMGTYQ AAAELGLRIP DDLSVVGFDN QELIAANLRP GLTTVALPHY
AMGQWAVAAL LDLIDAQADP SQKRQPIREE KLPCPLVRRA SVGPPPRS