Gene Namu_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0093 
Symbol 
ID8445673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp104183 
End bp105820 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content60% 
IMG OID645039241 
Productputative transcriptional regulator 
Protein accessionYP_003199516 
Protein GI258650360 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATATC CACTGTTCCA TGAGCTCGGA CCGGCCCAGG CTCCAATGGT GCTCAGTCAG 
GCCGACTTCG CTCGGGCGTT CCCCAGCGAG GGGGATTACG TCGAGTTCAA GCAAGGGATC
CCGGAGTCCA AGGTCGCAGA GGCCGTCGCT GCGTTCTCCA ACGCGAACGG AGGTGTCGTC
CTCCTTGGTG TCAACGACAC GGGCCGAGTA ACCGGCATTG GCTCCGACGG CGAGACACAG
GCACGCATCC ATCGGGCGGT TGCAGCCGTT CGCGACCCAG GACGATACGA ACTACATGTG
CTCCAAGTCG AAGATCGTCA CGTTCTCGCC CTCGCCGTCC ACCGACGGCG CGAAGGCTTT
GCGCAGATGC ATGACGGACG ACTTCTCATA CGTCGCGGAG CGATGAACTC TGCCCTCATG
GGGAATGAGC TCGCCGTCTT TGTTTCCGGA CGCGCTCTGA CCCGCTTCGA GCAAACACCT
GTCAACACGC TGATCGGAGC GGCCGATCCC AATCTAGTCA CGAAACTTAT CGAGACCTTC
GGCTGGGGAT CCGAGGGCAC ATTGGCGCGA TTGTTCGAAG GCGGTTTGAT TGACACACCC
ACCGAACGGA GCCCGCTGAC GGTAGCCGGC GCTCTCTACC TCCTTCCCCG GCCCGCCGAC
GTTCTCGGAA AAGCCTACAT CGAGATCTTC CGCTATAGAA ATCCGGGCGA GGAGTATGAT
TCTCGAATAG AATTCTCCGG CCCTGCGGAC CAACAGGTCG GAGACGCCAC AGAACACATC
ATGAAAGAAC TCGGCTCGGA TATCGTCATC CTTGGACTTT ACCGCCATGA ACTACCACGT
ATTCCACAAC CAGTACTCCG TGAGGCTTTG GCAAATGCAG TTGCCCATCG ATCGTACGAA
AGCGCACGTC AGTCCATCCG AGTCGAGATT CATCGGGATC GAGTAACCAT CAAGTCACCA
GGCGGCCTAC CCGAACCAGT GACGATCGCC AATATGCGAC AACAGAACGC CTCGAGGAAC
GCTACTGTCA TCAGAATCCT ACGAGCGATG CGTCTGGCTG AGGACGCTGG GCGCGGTGTC
GATGTCATGC AAGATGAGAT GGCTGCTGCG ATGCTCAACC AGCCGATCTT CGATACCGAT
GGGCACCATG TCGAGGTTGT ACTGCCCTTA GGGAGTGCTG TTTCGCCCCC TGAACGTGCA
TGGCTGGCAG AGATCGAACG CCGCGGATCC ATCCGACCCG ACGATCGCCT TTTGCTTGTC
CATGCCGCCC GAGGTGAGTT GCTCACCAAC ACCGTCGCGC GGGAATTGTT AGGGGTGGAC
AGCACTCATG CGCGAGCGTC TCTGGGTCGA TTGAAGGCCA TGGGATACGT ACAGCAGCAC
GGAGAGCGCG GGGGCGCCAC CTACTCGTTA GCCCACGAGT TGGCACCCCC TCCCGGACTA
GGTCTTGCCC AGGACGACCT GCGATCTTTG GTCATTGGCC TTGCCAAGTC GGCTCCAATT
ACCAACGAAT CGGTCCGCGA GCGCACAGGA CTTGACCGCG CAGCCGCGCT TCGGCTCTTG
TCGGAGCTCA CCAACGACGG CCTTCTTGTT CGGCATGGTT CCCGTCGTGG AACCTTCTAC
ACGCTCGCCG AGAAGTAG
 
Protein sequence
MAYPLFHELG PAQAPMVLSQ ADFARAFPSE GDYVEFKQGI PESKVAEAVA AFSNANGGVV 
LLGVNDTGRV TGIGSDGETQ ARIHRAVAAV RDPGRYELHV LQVEDRHVLA LAVHRRREGF
AQMHDGRLLI RRGAMNSALM GNELAVFVSG RALTRFEQTP VNTLIGAADP NLVTKLIETF
GWGSEGTLAR LFEGGLIDTP TERSPLTVAG ALYLLPRPAD VLGKAYIEIF RYRNPGEEYD
SRIEFSGPAD QQVGDATEHI MKELGSDIVI LGLYRHELPR IPQPVLREAL ANAVAHRSYE
SARQSIRVEI HRDRVTIKSP GGLPEPVTIA NMRQQNASRN ATVIRILRAM RLAEDAGRGV
DVMQDEMAAA MLNQPIFDTD GHHVEVVLPL GSAVSPPERA WLAEIERRGS IRPDDRLLLV
HAARGELLTN TVARELLGVD STHARASLGR LKAMGYVQQH GERGGATYSL AHELAPPPGL
GLAQDDLRSL VIGLAKSAPI TNESVRERTG LDRAAALRLL SELTNDGLLV RHGSRRGTFY
TLAEK