Gene Namu_4706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4706 
Symbol 
ID8450336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5235226 
End bp5236563 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID645043746 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003203971 
Protein GI258654815 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.93667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG GGCCGCCGAA CCCGGACCCC GCCCCCGACG TCGACGCGCG GACGGCGTCC 
GCCTCCGAAC CCGCTGCCGA CGCGGTGACC GATCGGCCGA ACCGGAACTG GCGACGGATC
GGACCGGACC TGCGGCCGCT GTCCATCCCG GCCTACCGGC GGCTGTTCCT GGGTCAGGTC
TTCACCGTGG TCGGGGCCAT GGTGACCACC GTCGCGGTGC AGCAGCAGGT GTTCGACCTG
ACCGGCAGCT CGGCCTGGGT CGGCATTGCC TCGCTGGTCG CATTGGTCCC GCTGGTCGTC
TTCGGGCTGC TCGGCGGCGC GATCGCGGAC ACCTACGACC GGCGCAAGCT GCTGATGGTG
ACCTCGGTCG GGATCGCGGT GACCAGCATC GGCCTGTGGC TGGCCGCGCT GATCGGCAAC
GAGTCGGTGT GGACGGTGTT CGCCCTGCTG GCCGTCCAGC AGGGCTTCTT CGCCGTGAAC
CAGCCGACCC GCAGCGCGAT CATCCCGCGG ATCGTGCCGG CCGAGCTGGT GCCGTCGGCC
AACGCGCTGG GCATGACCGT GTTCAGCATC GGCGTCATCG TCGGCCCCTT GCTGGTGGGC
ATGCTGATTC CGATCATCGG GGTGCCCTGG CTGTACTTCC TGGACGCGTT GACCCTGGTC
GGCATCCTGT ACGCGGTGAT CAAGCTGCCG CCGGTGCCGC CGCTGGGGGA GCGGCGCGGC
CGGGCCAAGG TGATCGACGG CCTGGCCTAC CTGCGGCTCA AGCCGCTGCT GCTGATGACC
TTCGTGGTGG ACATCATCGC GATGGTCTGC GGCATGCCCC GCGCGCTGTT CCCGCAGATG
GCCCAGGAGA CCTTCGGCGG CGCCGTCGGC GGAGGCTTCG AACTGGGCGT GCTCAACGCC
GCGCTGGCCG TCGGCGCGCT GATCGGTGGC CTGACCGGCG GCTGGATCCA CCGGGTGCAC
CGCCAGGGCA TTGCGATCAT CGCCGCCATC GTGGTGTGGG GCGCGTCGAT GGCCCTGTAC
GGCACCACCT CGATCCTCTG GCTGGCCGCG ATCTACTTGG CCGTCGGTGG CTGGGCCGAC
CTGGTCAGCG CGGTCTACCG ATCGACGATC CTGCAGGTCA ACGCCACCGA CGAGATGCGC
GGCCGGATGC AGGGGGTGTT CACCGTGGTG GTGGCCGGCG GGCCCCGGGT CGCCGACTTC
GTGCACGGGT TGGTCGCCGC GGCCACCTCG ACCACGTTCG CCGTGGTCGC GGGCGGGATC
GCGACCATCG TGCTGACCCT GCTGGCGGCC ACCCTCGGCC GGTCCCTGGT GCGCTACGAC
ACCCGCCGAC ATGACTGA
 
Protein sequence
MTAGPPNPDP APDVDARTAS ASEPAADAVT DRPNRNWRRI GPDLRPLSIP AYRRLFLGQV 
FTVVGAMVTT VAVQQQVFDL TGSSAWVGIA SLVALVPLVV FGLLGGAIAD TYDRRKLLMV
TSVGIAVTSI GLWLAALIGN ESVWTVFALL AVQQGFFAVN QPTRSAIIPR IVPAELVPSA
NALGMTVFSI GVIVGPLLVG MLIPIIGVPW LYFLDALTLV GILYAVIKLP PVPPLGERRG
RAKVIDGLAY LRLKPLLLMT FVVDIIAMVC GMPRALFPQM AQETFGGAVG GGFELGVLNA
ALAVGALIGG LTGGWIHRVH RQGIAIIAAI VVWGASMALY GTTSILWLAA IYLAVGGWAD
LVSAVYRSTI LQVNATDEMR GRMQGVFTVV VAGGPRVADF VHGLVAAATS TTFAVVAGGI
ATIVLTLLAA TLGRSLVRYD TRRHD