Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1911 |
Symbol | |
ID | 8447518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2104202 |
End bp | 2107174 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 645041041 |
Product | transcriptional regulator domain protein |
Protein accession | YP_003201289 |
Protein GI | 258652133 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.345318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000588756 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGACCAGAC CCGTGGTGCT GACCGAGAAG CTGCGCCGCC CCGAACCGGT GGGGTTGGCC CGGCGCCGGC TCGAGGAGCA GTTGCTCGCG CCCTCGACCA GCCGGTTGGA CCTGGTGGTG GCGCCGCCCG GATCGGGCAA GACAACCCTG TTGGCCCGGG TGGCCGCGGC CGCCGAGAGC GCGGGCAGTG CGGTTGGCTG GTACCGGGTG ACCGCCGACG ACGCCACCGA GTCGGCCTTC GTGGCCCACC TGGCTCGGGC GCTGCACCTG GACACCGCCG CCGACGAGTC GTCGGTGACC GACATGCTCG AGTTGCTCGA TCGGTCGCCG ACCGCGCCCA CCCTGGTGGT GTTCGACGAC GTCCATGAGA TCGCCGGCTC GGTCGCCGAA CGGGCCCTGG ACCAGTTCGT GCAGCTGCGG CCGCCGCACC TGCGGGTGCT GGTCGGTTGC CGCCGGCCCC CGGAACTGAA CATCCCGCGG CTGCGGGTGT CCGGCCAGTT ACGCGAGATC ACCAGCGACG ACTTGCGTTT CCGCTCCTGG GAGGTCGAGG AGCTGTTCCT GTCGGTGTTT CGGGAACCGC TCTCGCCGGA GTCGGCGGCC GCGCTGACCC GGCGCACCGG CGGCTGGGCG GCCGGCCTGC AACTGTTCCA CCTGGCCACC TCCGGACGCA GCGCGGTCGA CCGCGGGCGG GCCGTCGACG ACCTGGGTGG CCGGTCCAAG CTGGTGCGCT CGTACCTGGC CCGCAACGTG TTGGCCGAGC TGCCCGAGGA ACGGCGGCAC TTCCTGCTGC GCACGTGCAC GTTGGGCACG CTGACCGGAC CGCTGTGCGA CGCGCTGCTC GGGGTGACCG GCAGTTCCAC CATCCTGGAC GAGCTGGAAC ACCGGCAACT GTTCACCTCC TCCGACGACG ACGGCCACAC CTTCCGGTAC CACGAGGTGC TGCGCACGCA TCTGGAGTGG GCCCTGGTCC AGGAGTACGG GGCGGCCGGC GCGCGCGAGT GGTACTCGCG CAGCGCCGCG TTGTTGGAGT CGGTCGGCGA CCAGCGGGCG GCGGTGCGGG CCTACGCGCG GGCCGAGGAC TGGGGTGCGG TCGGCCGGCT GTTGCAGACC CGCGGCACCG GCCCGGATGC CGGGGCCGCC GGCACCGACC TGCTGCTGCC GCCGGCGGTG GTCGAGCACG ATCCGTGGTT GTCCCTGGCC CAGGCCCGCC GCCGGGTCCG CGAGGGCGCC CTGACCGCCG CGGTCGACCG TTTCCGCCGG GCCGAGAGCC TGCTCGACGA ACCGGATTTC CGCGACGACT GCCAGCGGGA ACGATCGGTC GCGGCCATGT GGGCCTCGAC CGACACCACC GGCATGAGCT GGGCCGTGGA CGGGCGCGGG TCGGCCCGGC ACTGGTCCAT CCCGATCCGC CTGGCCACCC GGCGGCTGGG CCCGGCCCGG GTCGCTGCCC CGGCGCTGAC CGGATCGACC CCGGCGCCGC ACGAACTGGC CCGCGGCATC ATCAGCCTGC TGGCCGGCGA CTTCCGCGAC GCCCGCCGCA CGCTGGACGC CGTCGTGGCC GACCGGGCGG CCGAGTCGAC CCACCGGTTG CTGGCCGACC TGACCATTGC CGTCATCGAC CTGATCCGCG ACCGCCCCGG GGATCCGGCG AGCCGGTTGG GCCAGATCGC GCTCGACGCC GAGGTGGCCG GGTTGCCGTG GATCGCCCGG CTCGCCCGCG GGCTCGGCGA GTGCGTGCTG GCGGCCACCG ACCCGGCCGC GTGGCGGCTG GCCGCCTGCG TCGATCAGAT CGACGAGTGT GACGAGGCCG GCGACGCCTG GGGCGCCGCC CTGCTGACCT TGGCCGCCGC CGTCACCGGC CTGCTGGCCG ACCCAGTGGC CGAGGCGCAT GGATTTGCCG ACGCGGCCGC GCGCTTCCGC CGGCTGGATG CCCCGGTCCC GGCCCTGTGG GCGCAGGCGT TGCAGGCCTG CGCCCTGGCC GGCCGGCGCC GCCCGGGCGC GGCCGAGTTG GCCGGCCGGG TGGCCACCGA GGCGCGCGCC GCGCAGGTCA CCGGGGCCCA GGCCCTGGCC CAGATCGCCG CGGCGATGGC CGGGGCGCCG GTCGGCACCG CGCCGGAAGC GACCGGCAGC GAACTGGACC GGATGATCGC CCTGTTCACC GGGCCGGCCG CGCCCACCCC GCCCGTGGGT GAGCCGGCCG CCGCGACCTC CTCCATTCCC GCGGACCAGC CGGCGCCGCT CGCCGCGAGC CGGACCGTCA CGATCCGCTG CCTGGGCGGG TTCAGCGTCG ACATCGACGG TCAGAGCGTC GATCTGGCTC CGCTTCGGCC GCGGGCCCGC GCGCTGCTGC GGCTGCTGGC GATGACGCCG AACCGGGACG TCCACCGCGA GCATCTGGTC GACGCGCTGT GGCCGGGCAC GGATCTGACC GTCGGCACCC GGCGGTTGCA GGTGGCCGTG TCCAGCGTGC GACAGCTGCT GGAACAGCGT GGCCTGCCCG GCGGCGAGGT GGTGCTGCGC CACGGTGACG CCTACCGGCT GGCCCTGCCC CCGGGGTCGG TCGTGGACAC GGACGCCTTC GAGCGGGGCG TGCGGGACGC CGAGACCGCC GCCGCCCGTG GTGATGTCAC CGCCGCCGCG GCGCTGCGCG GCTCGGCCCT GTCCTGGTAC CGGGGCGACC TGCTGCCCGA GGACGGTCCG GCCGAGCACG TGGTCGGCGA ACGCGACCGG CTCCGGCTGC TGGCCGCGAC CACGGCCGGC ACCCTGGCCC AGGACTACCG GACGCTGGGG CAGCTGCGGC AGGCGGTGGC CGCCGCCCGG CAGTCCGTGC AACTCGACCG CTATCAGGAC CTGGCCTGGG AGCTGCTGGC CGACCTGCAC CGCGACGCCG GCGACGACAG CGCCGCCGCC CGCACCCGGC GCGAGCACGC CGCCGCCCAG GCCGAACTGG AGCTGGATTC CGTGCGGCCC TGA
|
Protein sequence | MTRPVVLTEK LRRPEPVGLA RRRLEEQLLA PSTSRLDLVV APPGSGKTTL LARVAAAAES AGSAVGWYRV TADDATESAF VAHLARALHL DTAADESSVT DMLELLDRSP TAPTLVVFDD VHEIAGSVAE RALDQFVQLR PPHLRVLVGC RRPPELNIPR LRVSGQLREI TSDDLRFRSW EVEELFLSVF REPLSPESAA ALTRRTGGWA AGLQLFHLAT SGRSAVDRGR AVDDLGGRSK LVRSYLARNV LAELPEERRH FLLRTCTLGT LTGPLCDALL GVTGSSTILD ELEHRQLFTS SDDDGHTFRY HEVLRTHLEW ALVQEYGAAG AREWYSRSAA LLESVGDQRA AVRAYARAED WGAVGRLLQT RGTGPDAGAA GTDLLLPPAV VEHDPWLSLA QARRRVREGA LTAAVDRFRR AESLLDEPDF RDDCQRERSV AAMWASTDTT GMSWAVDGRG SARHWSIPIR LATRRLGPAR VAAPALTGST PAPHELARGI ISLLAGDFRD ARRTLDAVVA DRAAESTHRL LADLTIAVID LIRDRPGDPA SRLGQIALDA EVAGLPWIAR LARGLGECVL AATDPAAWRL AACVDQIDEC DEAGDAWGAA LLTLAAAVTG LLADPVAEAH GFADAAARFR RLDAPVPALW AQALQACALA GRRRPGAAEL AGRVATEARA AQVTGAQALA QIAAAMAGAP VGTAPEATGS ELDRMIALFT GPAAPTPPVG EPAAATSSIP ADQPAPLAAS RTVTIRCLGG FSVDIDGQSV DLAPLRPRAR ALLRLLAMTP NRDVHREHLV DALWPGTDLT VGTRRLQVAV SSVRQLLEQR GLPGGEVVLR HGDAYRLALP PGSVVDTDAF ERGVRDAETA AARGDVTAAA ALRGSALSWY RGDLLPEDGP AEHVVGERDR LRLLAATTAG TLAQDYRTLG QLRQAVAAAR QSVQLDRYQD LAWELLADLH RDAGDDSAAA RTRREHAAAQ AELELDSVRP
|
| |