Gene Namu_4693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4693 
Symbol 
ID8450323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5216783 
End bp5219614 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content76% 
IMG OID645043733 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003203958 
Protein GI258654802 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.493374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTC GGGATCTCGG CCCACTCGTC GTGGAGCGAG ACGGATTGCC GGTCGGCCTC 
GGCGCCGGCC GGCTGGCGGC GGCGCTGAGC CCGCTGGCCA ACCGGGTCGG CGAGGTGGTC
GGGACCACGG CGCTGGTCGA GGCGGTGTGG GGTCCCCACG CCCCACCACG GGCGCCGCAG
CTGCTCGAAT CGGTGATCTG GCGGCTGCGC AAGGAGCTCG AACCGGGCCG GGCCGCCCGG
GCCGCGCCGG TGCTGCTGCG CCGCGAAACC CTGGGGTATC GGCTGGATCT GCCGCCCGAC
GCGGTCGACT CCACCGAGCT GCGGACGGCC GCCCCGCAGA TCCGCGGGTG GGCCGCGGAC
GGTCGATCGG AGCAGGTGCT GGAGCGCTCG GCCGGGGTGC TGAGCCGGTG GCGGGGCGAG
CCCTACGCCG ACCTGACCGA CCACGGCTGG CTCGGGCCGG CCCGTCAGCA GCTGGTCGAC
GCCCGCATCG ACATCGCCGA GCTGCGGGTG CAGGCCCTGC TCGACCTCGG CCGCCCGGAG
GACGCGATCG GCGAGCTCGA CCCGCTGCTG GCCGAGCATC CGCTGCGCGA ACGACTGTGG
GCCCAGCGCA TCGCCGGCCT GTACCGGGCC GGCCGGCCGG CCGACGCGTT GGCCGACTTC
ACCCGGGCCC GCGCGGTGCT GGCCGACGAA CTGGGCATCG ATCCGGGCCG GGAGTTGCGC
GAGCTGCACC GGCGCATCCT CGAGCAGGAC CCCGCGCTGG ACCTGCGGCC GGCCACCCGA
CCCACCGTCG TCGTGTCCGA CCTGCCACGC GGCCGCACGT CGCTGATCGG GCGCGACGAC
GACCTGGCCA CCCTCACCGG GGAGCTGGGC CAGGTGCGGC TGGTCACCCT GGCCGGGCCG
GGTGGCGCCG GCAAGACGCG ACTGGCCGTG GAGGTCGGCC ATGCGGCCAC CCGATTCCCC
GACACCCGAT TCCCCGACGG TGTGCACTTC GTCGACCTGG CTCCGGTCCG CGACCGGGAC
CTGCTCGTCG TCGCGATCGC CGGCACCCTG GAACCGGCCG GGCAACCCGG TCGGCGGCCG
ATCGAGGTCG TGACCGCCCG GCTCGCCGAC GCGGACGCCC TGCTGATCCT GGACAACTGC
GAGCAGCTGA TCGACGCGTG CGCCGAGGTG GTCCCGGAGA TCCTCGACCG CTGTCCGCGG
GTCCGGGTAC TGGCCACCAG CCGCGAGCCC CTGGAGCTGC CCGGCGAGTA CGTGCACCGG
CTGGGCCCGC TGCCGGTGGC GCCGGCCGGC GCCGTTCCCG GCCCGGCCCA GGAACTGTTC
CTGGCCCGCA CCGGCGGCAC CACCGGGCCC GACCCGGCCG ATCCGGACGG CGAGCTGGTC
CGCCGGATCT GCCTCGCGGT CGGCGGTCTG CCGTTGGGGA TCGAGCTCGC CGCCGCCCAG
GCCGACACGT TCGCACTGAG CGAGATCGTC GAGGCGCTGG AACACAATCC GGCCGAGTTG
GCCCGCCGAG GCACCGGACC CCCGCGTCAG GCCTCGCTGC GGGAGACCGT CGACTGGGGC
TACCGGCTGG CCCGTCACGA CGAGCAGGTC CTGCACCGCC GGCTGGCGGT CATTCCCGGA
CCGTTCACCC TGGACGTGGC CACCGCCCTG TGCGACCTGG CCCCGCTCCG GGCCGATCGG
GCCATGAGCC TGGTCGGTGG CCTGGTGCAC CGGTCACTGC TGGTCGCCGC CCGCCCGACC
CGCGGAGCGT CCTCGTTCCA CCAACTGGCC CCGATCCGGG CCCATGCCGC GTCCGTGCTC
GACGACGCCG AGCGCGCGGC CATCGAGGCG GTCCGCGACC GGTGGCTCAA CGGACGGATC
GCCGCCGCAC CGGTCGACGG GGCGGGCCAG GCGGCGTTCC TGGACTGGCT GGAGGGCAAC
GCCGCCACGC TGCGGGCCAG CCTGGATTCG ACCCTGCACC GCGGCGGGGA TCGCACCGCC
CCGTCCATGG TGCTGGCCCT GCTCGGCGGT TGGTTCGAAC GGGGCCGGCT GACCGAGGCC
GCGCACTGGG TCGAGCGGCT GCGGGCCCGG CCGCGCGGGC GGCACCCCCT TGACGACGCC
CTGGTCGACG TGGCCGCCGG CGCCGTGCTG GCCCTGGAGC ATCACCGGGA CCCGGCGGCC
GAGCTGCTGC GCTCCGCGCT GCCCCGGCTC GAGTCCGCCC CGGCCGACTC GACCGCACAG
GTGGCCTCGG CGCTGCGGAT CGGGGCGGTG GCCGCGTGGA CCGGCGATCT GTGGGACATT
GCCGCCGACT ACCAGGACGC GGGATTGCGG TTCGGCCACG TGGCCGGCCT CCCCCACCTG
GAGCTGGCCT GCCGGGCCAT CCGCGCCGCC AACTGGTCCT TCGCGGGCGA CCGGGCCGCC
GGGATCGCCG AAGCCGGCGA GGTGCTCCAG ACGGCCAAGA CGACGGGCAA CGACCTGGCC
GCGCTGTTCG CGCTCGTCGC CCTGACGGTG ACGGCACTGA CCCAGGGTGA GCCGCAGGTC
GCCCTGGGCT ATTCCGACCA GCTGCTGCTG ACCCATCGCC GGATGGGCAC CCTGGCGGTC
AGCGACACCA TCGAGACCCG GGCCTCGATC CGCCTCGGCG CCGGTGACCT CCCGGCGGCC
GTCCGCTGCC TGGGGGCCTC GGCCGGCCTG AACCGGCGGT TGGGTCGGGA CTGGCCCTGG
CACGAGTTCA CCCCCGCGGT GCTCGACGAG CTGCGGCAGC GGCTGGAGCC GGCCGAGTTC
GACCGGCACT GGGCCAGCGG GGAACGGCTC GGGCGGGGCG ACCCGGAGCG CTTCACGCCG
GACTGGATCT GA
 
Protein sequence
MQIRDLGPLV VERDGLPVGL GAGRLAAALS PLANRVGEVV GTTALVEAVW GPHAPPRAPQ 
LLESVIWRLR KELEPGRAAR AAPVLLRRET LGYRLDLPPD AVDSTELRTA APQIRGWAAD
GRSEQVLERS AGVLSRWRGE PYADLTDHGW LGPARQQLVD ARIDIAELRV QALLDLGRPE
DAIGELDPLL AEHPLRERLW AQRIAGLYRA GRPADALADF TRARAVLADE LGIDPGRELR
ELHRRILEQD PALDLRPATR PTVVVSDLPR GRTSLIGRDD DLATLTGELG QVRLVTLAGP
GGAGKTRLAV EVGHAATRFP DTRFPDGVHF VDLAPVRDRD LLVVAIAGTL EPAGQPGRRP
IEVVTARLAD ADALLILDNC EQLIDACAEV VPEILDRCPR VRVLATSREP LELPGEYVHR
LGPLPVAPAG AVPGPAQELF LARTGGTTGP DPADPDGELV RRICLAVGGL PLGIELAAAQ
ADTFALSEIV EALEHNPAEL ARRGTGPPRQ ASLRETVDWG YRLARHDEQV LHRRLAVIPG
PFTLDVATAL CDLAPLRADR AMSLVGGLVH RSLLVAARPT RGASSFHQLA PIRAHAASVL
DDAERAAIEA VRDRWLNGRI AAAPVDGAGQ AAFLDWLEGN AATLRASLDS TLHRGGDRTA
PSMVLALLGG WFERGRLTEA AHWVERLRAR PRGRHPLDDA LVDVAAGAVL ALEHHRDPAA
ELLRSALPRL ESAPADSTAQ VASALRIGAV AAWTGDLWDI AADYQDAGLR FGHVAGLPHL
ELACRAIRAA NWSFAGDRAA GIAEAGEVLQ TAKTTGNDLA ALFALVALTV TALTQGEPQV
ALGYSDQLLL THRRMGTLAV SDTIETRASI RLGAGDLPAA VRCLGASAGL NRRLGRDWPW
HEFTPAVLDE LRQRLEPAEF DRHWASGERL GRGDPERFTP DWI