Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3812 |
Symbol | |
ID | 8449431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4183901 |
End bp | 4186684 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042862 |
Product | transcriptional regulator, LuxR family |
Protein accession | YP_003203098 |
Protein GI | 258653942 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.142204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0901624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATC ATTTCGTCGG CCGGGACACC GAGCTGGACA CGCTCGACGA GCTTCTGGTC CAGGCCCGCG CCGGAGCCCC GCACGTCGTC CTGCTGGTCG GTGAGCCCGG GATCGGCAAG ACCACGCTGA TCGACGGGTT CCTGCGCCGG CATCGTGACG TCACGTCCCT GCGCGCGGGC GGGGACGACA GCGAAATGCT CTACTCCTAC GGAATCGTCC GGCAGCTCGC GGCGTCGGCC GGGCCGGCCG GCCTCGAGCT GGCCGGCCAG CTGGCCCAGG TCGCTCCGGT GCCCGACCCG ATCGCCATCG GCAGCCAGCT GCTGGCCCTG CTCGGCGAGC GGCAACAGGC CGGGCCGGCC GTCCTGGTCA TCGACGACAT CCAGTGGGCC GACGACCCTT CGGTCAAGTC GATCACCTTC GGGTTGCGCC GGCTTCAGGC CGACCAACTG CTGGTGATCC TCGCGGTTCG CGAGGAGTCG ATCAACGAGC TGCCGGACGG GCTGCTGCGC ATCGTCCGCG GCGAGCTGGC CACGGAGATC CGGTTGGGGG GGCTGGCCGA GCACGAACTG GCCGAGCTGG CCGGAAAGCT CGGCATCGGC AGCTTCCCGG CTCGGGCGGC CAGGCGGCTG CGATCGGGCA CGGACGGGCA TCCGTTGTTC GCCCGCGAGC TGCTCCGGGA GTTCCCGCCC GAATCCTGGG GAGCACAGGA CGTCCTGCCG CCGCCCCGGT CGTTCCGGCA CCTGGTCCGG CAACGTTACC GGCGGTGCAG CACCGACGGG CGCCGGCTGG TGGACGCGGC GGCCGTCATC GGCCTGTCCG CCCCGCTGGC CCTGACCGCC CAGTTGGCCG ACCTCCCGCA CGCGCTGGAA GCGATCTCCG ATGCCGAGCG GGCCGACCTG CTGCAGCTGG TCGACGCCTC GTATCCCGGA TCGGTCCGCT TTCCCCATCC GCTGGTTCGG TCGGCCGTGT ACTCGGCGAT CGACCCCCCG CACCGCTCCG CGCTGCACGC TCGCGCGGCC CAACTGCTGG ACGACACCAG AACGGTGCTG CAGCACCGGT TCGCGGCGGC CCGGCAACCG GACGAGGGGC TGGCCGCCGA CCTGTCCGCG TTCGCCGCCA CCGAGATTCA CAACGGCGAG TGGGTCAGTG CCGCCACGCA CTTGGTCCGC AGTAGCCGGC TCAGCCCGCA GCCGGCGGAT CGGCAACGCC GGCTCCTGCG GGCGGTCAAT TTTCTGGTCA TCGCCGGCGC GGCGTCCCAG GCCGCGCACC TGGCCGAGGA GGTCGCCTCC TTTCCGCCCG GCCCGTTGCG GGACAGCACC CTGGGGTACC TGGCGACGGC CACCCGGGGG CCCGCCGAGG CGGGGCGGCT GCTCAGCAGT GCCTGGCAGC AGGTCGATCC GGCCGCCGAC CGGGAGTTGG CGGCCACCAT CGCGCTGCAG AGCGCGATCC ATTTCCACGG CCGGCTGGAC GGCCCGGCCA CCGCCACCTG GAGCGCCAAG GCCATCGACC TCACCGAACC CGACGACCCG ATGCGGTTCG TGGCCGAGAC GCATCGAGCC TTCGGCCTTG GCTACTCCGG TCGATTCGGC GAGGCGCTGC CCGTGGTCGA AGACGTGACC GGCACGATCG ACGGCAGTCC GGACGCCCGC CGGGTCCAGC ACGGAGCCGC GCACGGCTGG CTCCGGTTAA TCAACGACGA TCTCGTCACG GCCCGCAGCA TGTTGCACGC GGTTGCGGTC GACGCCGCCC GACTGGTCAC GCTGAACACC GCGGCGTTCA GTTACGCCCA CCTCGCGCGG GCCGAGTACC TCGCCGGGGC CTGGCACGAG GCCCTGGTCC ACATCGAGAA CGCCATCGCG CTGACCACCG AATCGGAGTC GGACTACCTG CGCTCGTTGG TGTTCGGGAT GGCCGTCGTC ATTCCGGCCG CGCGGGGCGA GTGGGCGGCC GCCCGCACCT ACGCCGATCT CGCGACCGCC GAGGCCAGCT CCTACGAGCG GGCAGTGGCC GCGGCCGCCC TGGCCGGGGC ACAGCTGGCC GCCGCCCGGG ACGAGCCCGA GCGGGTCCTG GACCTGCTCG ACCCCATCCG CGCGATGCAC CCCCGTCAGG GCATCGACGA GCCGGGGATC TGGCCCTGGC CCGAGCTCTA CGCCGACGCC CTGACTGCCC TGGGTCGCAC CGCCGAGGCG GACGCGTTCC TGATCCCCCA TGAGCTGCTG GCCCACCGCC GCCGGCGCGC CTCCGCGATT GCCCGACTCG CCCGGGCGCG CGGTTCGCTG GAGTCCCGGG CGGGCCGGGA CAGCGTGGCG GACCAGGCAT TCCAGCTGGC CGCCGGGCAG ATCCAGCGGT TGGCCATGCC TTACGAGCAG GCTCTGATCG AGCTCGCCCA TGGCCGGCAC CTGGTCCGGA CCAACCGGCG TAAGGCCGCC GCCAAGCTGC TCACCGCCGC CGGCGACCGG TTCACCGCTT TGGGTGCGGC CCCGTTCCTG GCCCAGACTG TCGCCGAGCT GCGCGCCTGC GGCATCACCG TCGAAGGCAC CACCCGGCAA CGCCTGCGCA TCGGCCTGAC CTCCCAGGAA CTGGTGGTCG CTCGCCTCAT CGCCGATGGC AAGACCAACC GTGAGGCCGC CGCGGATCTC GTCGTCAGTG TGAAGACCAT CGAGTACCAC CTGAGCAACG TCTACGCCAA GCTGGGCGGC ATCACCCGGC GGCAGCTCCG CAGCGCCCTG GCCGACCACG ACCAGCAGGC TCCTCTGCAG CACCGCCGAA CCGCGTCATC GTGA
|
Protein sequence | MSDHFVGRDT ELDTLDELLV QARAGAPHVV LLVGEPGIGK TTLIDGFLRR HRDVTSLRAG GDDSEMLYSY GIVRQLAASA GPAGLELAGQ LAQVAPVPDP IAIGSQLLAL LGERQQAGPA VLVIDDIQWA DDPSVKSITF GLRRLQADQL LVILAVREES INELPDGLLR IVRGELATEI RLGGLAEHEL AELAGKLGIG SFPARAARRL RSGTDGHPLF ARELLREFPP ESWGAQDVLP PPRSFRHLVR QRYRRCSTDG RRLVDAAAVI GLSAPLALTA QLADLPHALE AISDAERADL LQLVDASYPG SVRFPHPLVR SAVYSAIDPP HRSALHARAA QLLDDTRTVL QHRFAAARQP DEGLAADLSA FAATEIHNGE WVSAATHLVR SSRLSPQPAD RQRRLLRAVN FLVIAGAASQ AAHLAEEVAS FPPGPLRDST LGYLATATRG PAEAGRLLSS AWQQVDPAAD RELAATIALQ SAIHFHGRLD GPATATWSAK AIDLTEPDDP MRFVAETHRA FGLGYSGRFG EALPVVEDVT GTIDGSPDAR RVQHGAAHGW LRLINDDLVT ARSMLHAVAV DAARLVTLNT AAFSYAHLAR AEYLAGAWHE ALVHIENAIA LTTESESDYL RSLVFGMAVV IPAARGEWAA ARTYADLATA EASSYERAVA AAALAGAQLA AARDEPERVL DLLDPIRAMH PRQGIDEPGI WPWPELYADA LTALGRTAEA DAFLIPHELL AHRRRRASAI ARLARARGSL ESRAGRDSVA DQAFQLAAGQ IQRLAMPYEQ ALIELAHGRH LVRTNRRKAA AKLLTAAGDR FTALGAAPFL AQTVAELRAC GITVEGTTRQ RLRIGLTSQE LVVARLIADG KTNREAAADL VVSVKTIEYH LSNVYAKLGG ITRRQLRSAL ADHDQQAPLQ HRRTASS
|
| |