Gene Namu_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3369 
Symbol 
ID8448984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3707590 
End bp3708798 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content75% 
IMG OID645042446 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_003202686 
Protein GI258653530 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0184281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0139033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCTG ACCCCGCAGC GCCGGCGTCC CGACGGATCG CCGCCCGGTT GGCCGGCGTG 
CGCAGCTCGC CGGTGCGCGA CCTGCTCGCG CTGGCCGACC GCCCCGAGAT CATCTCCTTC
GCCGGGGGCC TACCCGCGCC GGAACTGTTC GACCTCAACG GGTTTCGCGA TGCCTTCGGC
CAGGCCCTGG CGGCCGCGCC GGGCCCGGGG AACCTGCAGT ACGCGGCCAC CGAGGGCAAT
CCGCGGCTGC GGCAGCAGGT GGCCGACCGG CTGACCGGCC GCGGCGTACC GACCGGCGCG
TCCGACGTGT TGATCACCAG CGGCTCGCAG CAGGCGCTGA CCCTGATCAC CACCGCCCTG
CTGGATCCGG GTGGCGTCGT GGCCGTGGAG AGCCCCACCT ACCTGGCCGC GCTCCAGGCG
TTCCGGCTCG CCGACGCGCG GGTCGTGCCC GTGCCCGGCG ACGACGACGG CCTGGATCCC
GACGCGCTGC AGGCGACGAT CCGCGAGCAC CGGCCGACCC TGCTGTACCT GGTGCCGACC
TTCGCCAACC CGACCGGCCG GACGATGAGC CGGGCCCGCC GGCAGGCCGT GGTCGACATC
GCCGCCGCCC ACGGCCTGTG GGTGATCGAG GACGACCCGT ACGGCGAGCT GCGTTACGAC
GGGCCGGCGG TCCCCCTGAT GGCCGGCCTG CCCGACGCGC GCGAATGCGT GCTGCATCTG
GGTTCGTTCT CCAAGATCGG CGCCCCGGGC CTGCGCTTGG GCTGGGTGCG CGCGCCGCAG
TCGCTGCGCC CGTCCCTGGT GGTGGCCAAG CAGGCCGCGG ACCTGCACTC CTCGACGATC
GATCAGGCCG CCGCGGCGAT CTATCTGGAT ACCGGTGCCC TGGACGAGCA CGTCACCGGC
CTGCGCCGGG TCTACGGGCA ACGCCGCGAC GCGATGCTGG CCCGGCTGCC CAGCGCCCTG
CCGGCCGGCG CAACCTGGAC CCGACCGGCC GGCGGCATGT TCGTCTGGGT CACCCTGCCC
GGCGGCCGGG ACGCGGCCGC CGATCTGCCG GCCGCCCTGG ACGGTGGGGT CGCCTTCGTC
CCGGGTGCCG CGTTCTTCGC CGCCGACCCC GATCCCGCCA CCCTGCGCCT GTCGTTCACC
ACCCACGCAC CGGCGGTGAT CGAGGAGGGT CTGGGCCGGC TGGCCGGCGT TCTGCGCAGC
CGCTCCTGA
 
Protein sequence
MPADPAAPAS RRIAARLAGV RSSPVRDLLA LADRPEIISF AGGLPAPELF DLNGFRDAFG 
QALAAAPGPG NLQYAATEGN PRLRQQVADR LTGRGVPTGA SDVLITSGSQ QALTLITTAL
LDPGGVVAVE SPTYLAALQA FRLADARVVP VPGDDDGLDP DALQATIREH RPTLLYLVPT
FANPTGRTMS RARRQAVVDI AAAHGLWVIE DDPYGELRYD GPAVPLMAGL PDARECVLHL
GSFSKIGAPG LRLGWVRAPQ SLRPSLVVAK QAADLHSSTI DQAAAAIYLD TGALDEHVTG
LRRVYGQRRD AMLARLPSAL PAGATWTRPA GGMFVWVTLP GGRDAAADLP AALDGGVAFV
PGAAFFAADP DPATLRLSFT THAPAVIEEG LGRLAGVLRS RS