Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4693 |
Symbol | |
ID | 8450323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5216783 |
End bp | 5219614 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 645043733 |
Product | transcriptional regulator, winged helix family |
Protein accession | YP_003203958 |
Protein GI | 258654802 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.493374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATTC GGGATCTCGG CCCACTCGTC GTGGAGCGAG ACGGATTGCC GGTCGGCCTC GGCGCCGGCC GGCTGGCGGC GGCGCTGAGC CCGCTGGCCA ACCGGGTCGG CGAGGTGGTC GGGACCACGG CGCTGGTCGA GGCGGTGTGG GGTCCCCACG CCCCACCACG GGCGCCGCAG CTGCTCGAAT CGGTGATCTG GCGGCTGCGC AAGGAGCTCG AACCGGGCCG GGCCGCCCGG GCCGCGCCGG TGCTGCTGCG CCGCGAAACC CTGGGGTATC GGCTGGATCT GCCGCCCGAC GCGGTCGACT CCACCGAGCT GCGGACGGCC GCCCCGCAGA TCCGCGGGTG GGCCGCGGAC GGTCGATCGG AGCAGGTGCT GGAGCGCTCG GCCGGGGTGC TGAGCCGGTG GCGGGGCGAG CCCTACGCCG ACCTGACCGA CCACGGCTGG CTCGGGCCGG CCCGTCAGCA GCTGGTCGAC GCCCGCATCG ACATCGCCGA GCTGCGGGTG CAGGCCCTGC TCGACCTCGG CCGCCCGGAG GACGCGATCG GCGAGCTCGA CCCGCTGCTG GCCGAGCATC CGCTGCGCGA ACGACTGTGG GCCCAGCGCA TCGCCGGCCT GTACCGGGCC GGCCGGCCGG CCGACGCGTT GGCCGACTTC ACCCGGGCCC GCGCGGTGCT GGCCGACGAA CTGGGCATCG ATCCGGGCCG GGAGTTGCGC GAGCTGCACC GGCGCATCCT CGAGCAGGAC CCCGCGCTGG ACCTGCGGCC GGCCACCCGA CCCACCGTCG TCGTGTCCGA CCTGCCACGC GGCCGCACGT CGCTGATCGG GCGCGACGAC GACCTGGCCA CCCTCACCGG GGAGCTGGGC CAGGTGCGGC TGGTCACCCT GGCCGGGCCG GGTGGCGCCG GCAAGACGCG ACTGGCCGTG GAGGTCGGCC ATGCGGCCAC CCGATTCCCC GACACCCGAT TCCCCGACGG TGTGCACTTC GTCGACCTGG CTCCGGTCCG CGACCGGGAC CTGCTCGTCG TCGCGATCGC CGGCACCCTG GAACCGGCCG GGCAACCCGG TCGGCGGCCG ATCGAGGTCG TGACCGCCCG GCTCGCCGAC GCGGACGCCC TGCTGATCCT GGACAACTGC GAGCAGCTGA TCGACGCGTG CGCCGAGGTG GTCCCGGAGA TCCTCGACCG CTGTCCGCGG GTCCGGGTAC TGGCCACCAG CCGCGAGCCC CTGGAGCTGC CCGGCGAGTA CGTGCACCGG CTGGGCCCGC TGCCGGTGGC GCCGGCCGGC GCCGTTCCCG GCCCGGCCCA GGAACTGTTC CTGGCCCGCA CCGGCGGCAC CACCGGGCCC GACCCGGCCG ATCCGGACGG CGAGCTGGTC CGCCGGATCT GCCTCGCGGT CGGCGGTCTG CCGTTGGGGA TCGAGCTCGC CGCCGCCCAG GCCGACACGT TCGCACTGAG CGAGATCGTC GAGGCGCTGG AACACAATCC GGCCGAGTTG GCCCGCCGAG GCACCGGACC CCCGCGTCAG GCCTCGCTGC GGGAGACCGT CGACTGGGGC TACCGGCTGG CCCGTCACGA CGAGCAGGTC CTGCACCGCC GGCTGGCGGT CATTCCCGGA CCGTTCACCC TGGACGTGGC CACCGCCCTG TGCGACCTGG CCCCGCTCCG GGCCGATCGG GCCATGAGCC TGGTCGGTGG CCTGGTGCAC CGGTCACTGC TGGTCGCCGC CCGCCCGACC CGCGGAGCGT CCTCGTTCCA CCAACTGGCC CCGATCCGGG CCCATGCCGC GTCCGTGCTC GACGACGCCG AGCGCGCGGC CATCGAGGCG GTCCGCGACC GGTGGCTCAA CGGACGGATC GCCGCCGCAC CGGTCGACGG GGCGGGCCAG GCGGCGTTCC TGGACTGGCT GGAGGGCAAC GCCGCCACGC TGCGGGCCAG CCTGGATTCG ACCCTGCACC GCGGCGGGGA TCGCACCGCC CCGTCCATGG TGCTGGCCCT GCTCGGCGGT TGGTTCGAAC GGGGCCGGCT GACCGAGGCC GCGCACTGGG TCGAGCGGCT GCGGGCCCGG CCGCGCGGGC GGCACCCCCT TGACGACGCC CTGGTCGACG TGGCCGCCGG CGCCGTGCTG GCCCTGGAGC ATCACCGGGA CCCGGCGGCC GAGCTGCTGC GCTCCGCGCT GCCCCGGCTC GAGTCCGCCC CGGCCGACTC GACCGCACAG GTGGCCTCGG CGCTGCGGAT CGGGGCGGTG GCCGCGTGGA CCGGCGATCT GTGGGACATT GCCGCCGACT ACCAGGACGC GGGATTGCGG TTCGGCCACG TGGCCGGCCT CCCCCACCTG GAGCTGGCCT GCCGGGCCAT CCGCGCCGCC AACTGGTCCT TCGCGGGCGA CCGGGCCGCC GGGATCGCCG AAGCCGGCGA GGTGCTCCAG ACGGCCAAGA CGACGGGCAA CGACCTGGCC GCGCTGTTCG CGCTCGTCGC CCTGACGGTG ACGGCACTGA CCCAGGGTGA GCCGCAGGTC GCCCTGGGCT ATTCCGACCA GCTGCTGCTG ACCCATCGCC GGATGGGCAC CCTGGCGGTC AGCGACACCA TCGAGACCCG GGCCTCGATC CGCCTCGGCG CCGGTGACCT CCCGGCGGCC GTCCGCTGCC TGGGGGCCTC GGCCGGCCTG AACCGGCGGT TGGGTCGGGA CTGGCCCTGG CACGAGTTCA CCCCCGCGGT GCTCGACGAG CTGCGGCAGC GGCTGGAGCC GGCCGAGTTC GACCGGCACT GGGCCAGCGG GGAACGGCTC GGGCGGGGCG ACCCGGAGCG CTTCACGCCG GACTGGATCT GA
|
Protein sequence | MQIRDLGPLV VERDGLPVGL GAGRLAAALS PLANRVGEVV GTTALVEAVW GPHAPPRAPQ LLESVIWRLR KELEPGRAAR AAPVLLRRET LGYRLDLPPD AVDSTELRTA APQIRGWAAD GRSEQVLERS AGVLSRWRGE PYADLTDHGW LGPARQQLVD ARIDIAELRV QALLDLGRPE DAIGELDPLL AEHPLRERLW AQRIAGLYRA GRPADALADF TRARAVLADE LGIDPGRELR ELHRRILEQD PALDLRPATR PTVVVSDLPR GRTSLIGRDD DLATLTGELG QVRLVTLAGP GGAGKTRLAV EVGHAATRFP DTRFPDGVHF VDLAPVRDRD LLVVAIAGTL EPAGQPGRRP IEVVTARLAD ADALLILDNC EQLIDACAEV VPEILDRCPR VRVLATSREP LELPGEYVHR LGPLPVAPAG AVPGPAQELF LARTGGTTGP DPADPDGELV RRICLAVGGL PLGIELAAAQ ADTFALSEIV EALEHNPAEL ARRGTGPPRQ ASLRETVDWG YRLARHDEQV LHRRLAVIPG PFTLDVATAL CDLAPLRADR AMSLVGGLVH RSLLVAARPT RGASSFHQLA PIRAHAASVL DDAERAAIEA VRDRWLNGRI AAAPVDGAGQ AAFLDWLEGN AATLRASLDS TLHRGGDRTA PSMVLALLGG WFERGRLTEA AHWVERLRAR PRGRHPLDDA LVDVAAGAVL ALEHHRDPAA ELLRSALPRL ESAPADSTAQ VASALRIGAV AAWTGDLWDI AADYQDAGLR FGHVAGLPHL ELACRAIRAA NWSFAGDRAA GIAEAGEVLQ TAKTTGNDLA ALFALVALTV TALTQGEPQV ALGYSDQLLL THRRMGTLAV SDTIETRASI RLGAGDLPAA VRCLGASAGL NRRLGRDWPW HEFTPAVLDE LRQRLEPAEF DRHWASGERL GRGDPERFTP DWI
|
| |