Gene Namu_4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4290 
Symbol 
ID8449916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4772292 
End bp4773698 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content69% 
IMG OID645043338 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003203567 
Protein GI258654411 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.720009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA TCGCACCCCG GGTCGGCAAG CCCGGCGAAG TGGCGCACGC CGATCCCAAG 
AACGTCCGGC GGGCCGCCTG GGCCGGCCTG GTCGGCACCG CCCTGGAGCA GTACGACTTC
ATCATCTACG GCACCGCCTC CGCGCTGATC TTCAGCAAGC TGTTCTTCCC GAGCATCTCG
CCGGTCGCCG GGATGATCGC CGCGTTCTCG GCCTACGCGA TCGGCTTCCT GGCCCGCCCG
CTGGGCGGCC TGTTCTTCTC CCACTTCGGG GAGCGGTACG GCCGCAAGTG GGTGCTGGTC
AGCACCCTGT TCCTGATGGG CGCGGCCACC TTCCTGATCG GCTGCCTGCC CACCTACGAG
ACGGCCGGGG TGCTGGCCCC GATCCTGCTG GTGCTGCTGC GCTTCCTGCA GGGCTTCGGC
GCCGGCGCCG AGCAGGCCGG TGGCGCCACC CTGCTCACCG AGACCGCACC GCTGGGCAAG
CGCGGCCGGC TGGCCTCGTT CGTCATGGTC GGCGCCGCAT TCGGCACCGT GCTGGGCGCC
CTGGCCTGGG TGCTCGCGCA GCTGCTGCCG GACGACGTCC TGCTGTCCTG GGGCTGGCGA
ATGATCTTCT GGGCCAGCCT GTTCGTCACC GTGGGCGCCT GGATCATCCG GATGAAGATG
GCCGAGAGCC CGATCTTCGT CGAGCTGAAG AAGTCGGTCG ACGTCGAGCA CGCGGCCCCG
CTGAAGGAGG TCGCCAAGCA CGGCACCAAG AACGTGCTCA AGGTCATCTT CATGAACTGG
GGCATCAGCA CGCAGTCCTA CACCTACCAG GTCTTCATGG CCTCCTACCT GATCACCTTC
GTCGGGGTGG ACAAGCATTT CGTGCCCAAC GTGCTGCTCT ACGGCGCGCT GTTCGGCTCG
GCCGCGGCCT ACCTGATGGG TCTGCTGTCG GACCGGTTCG GCCGCCGGCG GATGTTCCTG
GTGCTGGCCG GCGCGGCCAT CCTGATCCAG TTCCCGGCGT TCATGGCGGT CAACACCGGC
TCGCACTTCT GGATCATCGT GGTGATGGCG CTGGGCTTCA TCACGGCCGC CCAGGGCATC
ACCGCGGTCA CCATGAGCTT CTTCCCGGAG ATGTTCGGCG CCCGCTACCG CTACGCCGGG
GTCACCCTGG GCCGCGAGTT CTCCTCGATC ATCGGCGGCG GCATCGCCCC GTTGGTCGCC
GCCGGCCTGA TGGCCTGGTT CTTCAACTCC TGGATCCCGG TCGCCGGCTA CATGGTGCTG
ACCATGGTGG TCAGCTTCCT GGTCGCCCGC ACCGTCCCCG AGACGGTCAA CCGCGACCTG
CAGATCCTGA CCGACGCCCG GCCCGGCGAG GCCCGCCCGG GCCTGACCGC GGCGAACGAC
GCGGCCGCCA GCCGGGTCGC CGCCTGA
 
Protein sequence
MTEIAPRVGK PGEVAHADPK NVRRAAWAGL VGTALEQYDF IIYGTASALI FSKLFFPSIS 
PVAGMIAAFS AYAIGFLARP LGGLFFSHFG ERYGRKWVLV STLFLMGAAT FLIGCLPTYE
TAGVLAPILL VLLRFLQGFG AGAEQAGGAT LLTETAPLGK RGRLASFVMV GAAFGTVLGA
LAWVLAQLLP DDVLLSWGWR MIFWASLFVT VGAWIIRMKM AESPIFVELK KSVDVEHAAP
LKEVAKHGTK NVLKVIFMNW GISTQSYTYQ VFMASYLITF VGVDKHFVPN VLLYGALFGS
AAAYLMGLLS DRFGRRRMFL VLAGAAILIQ FPAFMAVNTG SHFWIIVVMA LGFITAAQGI
TAVTMSFFPE MFGARYRYAG VTLGREFSSI IGGGIAPLVA AGLMAWFFNS WIPVAGYMVL
TMVVSFLVAR TVPETVNRDL QILTDARPGE ARPGLTAAND AAASRVAA