Gene Namu_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4212 
Symbol 
ID8449838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4656883 
End bp4658373 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content70% 
IMG OID645043261 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003203490 
Protein GI258654334 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.134321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGG CCGCGATCGT GCTGATCGGG TCCGGAGTGG CCTACGCCTT CACCGACACC 
CTGCAATCGG TCAGCGGCAC CTCCGACGCC GCCGGCAACG CCGGTGACGC CGGCGTCATC
TCGGCGCAGG GCCTGACCTT CCTGCTGGTC GGCTCGGACG CGCGGACCGA CGCCGACGGC
AACCCGCTCT CCGCCGAGGA ACTCGCCCAG GTCGGGACCG AGGACGACGG CGGCGGCATC
AACACCGACA CGATCATGCT GGTGCACGTG CCGCAGGGCG GTGGCCGGGC GACCGCCGTC
TCGATCCCCC GCGATACCTG GATCGGTAGC CGGGTGACCA GCCAGATCAC CGGCCCGTAC
GCCAACGGCT CGGAAGGGCC CTACCAACCG AACAAGGTCA ACTCCTTCTA CGCCACGGCC
AAGGCCTACA CCGAGCAGTA CCTGGTCAGT CAGGGTGTGA CCGACAAGGC CCAGATCGAG
CGGGAGTCGA ACGAGGCGGG CCGGACCCAG CTGATCCGGG TGATCCAGGC GTTCACCGGC
ATGAAGATCG ACCACTACGC CGAGGTCAAC CTGCTCGGCT TCTACCTGCT GTCCAACGCC
ATCGGCGGGA TTCCGGTGTG CCTGAACGCG GCTGTCGACG ATCCCTGGTC GGGCGCGAAC
TTCCCGGCCG GCGAGCAGGA GGTGCAGGGC ACCGCCGCGT TGGCCTTCGT CCGGCAGCGG
CATGGCCTGC CGGCCGGCGA CCTGGACCGG GTGAAGCGGC AGCAGGCCTT CCTGCGCGGC
GCCGCCGACA AGATCCTGTC GGTGGGCACC CTGACCAGCC CGACCAGGCT GAACGACCTG
GTCAGCGCGG TCGACCGCTC GGTCGTCTTC GACAAGGGCT TCGACGTGCT CACCTTCGCC
GAGCAGATCA GCAACCTGTC CTCGGGCAAC ATCGACTTCC AGACGCTGCC GACCACCGGG
CCCGAGTCCT CTACCGACAA GGACGCCCTG GCCACCGACC CGGCCCAGAT CAAGGCCTTC
TTCCAGGCGA TCGCCGGCGG CTCGAGCAGT GGCTCGGGCG GTGGGTCGAC CCAGGCCCCG
CCGTCGGTCG ACGCCGCGTC CATCACCGTG GACGTCAACG ACGGCACGAT CGCCGACGGC
GTGACCGCGC GCGCATCGGA ACTGGTGACC GACGGTGGTT TCACCCTCGG CGCGCTCGGG
GTGATCGCCG GCACCCGCAA GGGCAACGAG CAGACGACCA CCGAGTTCCA TTACTCCGGT
GACGAGGCCG CCGCCCAGCA GGTGCAGTCG GCCTTCGGCG GGATCGGCAA GCTGGTCAAC
GATTCCGCGG TCAAGGCCAA CCACGTGCTG GTGCTGGTCG GCACCGACCT GACCTTGCCC
TCCGGACTGC GCGGGGCGGC GTCCGCCTTC GCCACCGCGG CGGCGGCCCC GGCCGTCGCG
CCCGCGCCGG CTGCCTTCAC ACTGGCCGCG TCGGTGCCAT GCGTGAACTG A
 
Protein sequence
MVAAAIVLIG SGVAYAFTDT LQSVSGTSDA AGNAGDAGVI SAQGLTFLLV GSDARTDADG 
NPLSAEELAQ VGTEDDGGGI NTDTIMLVHV PQGGGRATAV SIPRDTWIGS RVTSQITGPY
ANGSEGPYQP NKVNSFYATA KAYTEQYLVS QGVTDKAQIE RESNEAGRTQ LIRVIQAFTG
MKIDHYAEVN LLGFYLLSNA IGGIPVCLNA AVDDPWSGAN FPAGEQEVQG TAALAFVRQR
HGLPAGDLDR VKRQQAFLRG AADKILSVGT LTSPTRLNDL VSAVDRSVVF DKGFDVLTFA
EQISNLSSGN IDFQTLPTTG PESSTDKDAL ATDPAQIKAF FQAIAGGSSS GSGGGSTQAP
PSVDAASITV DVNDGTIADG VTARASELVT DGGFTLGALG VIAGTRKGNE QTTTEFHYSG
DEAAAQQVQS AFGGIGKLVN DSAVKANHVL VLVGTDLTLP SGLRGAASAF ATAAAAPAVA
PAPAAFTLAA SVPCVN