Gene Namu_0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0688 
Symbol 
ID8446274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp755961 
End bp757196 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content74% 
IMG OID645039822 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003200091 
Protein GI258650935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGCG GGGAAGGCGC GGAGGTGACG CTGCGCGACA TCGCGCCCGC CGCGTTCCTG 
CCGCCCGCGG TGTTCGCCGT CGGCCAGGGC GCGATCGCGC CGGTCATCGT CATCTCGGCG
ACCGACCTGG GCGCCAGTCC GGCCATGGCC GCGCTGGTGG TGGCGCTGGC CGGCCTGGGC
CAGGTCTGCG CGGACATCCC GGCCGGGTCG CTGGTGCACC GGTTCGGCGA GCGACCGACG
ATGATCGGCG CCGCCCTGCT GACCGTGCTC GCCCTCGTCG GCTGCATGGT CGCCGGCCAG
CTGATCCTGT TCGCCGCGGC GATCTTCGTC ACCGGTGCGT GCACCGCGGT CTGGCTGCTG
GCCCGGCAGG CCTTCGTCAC CGAGGTGGTG CCCTACCGGC TGCGGGCCCG GGCGATGTCC
ACCCTCGGCG GGGTGTTCCG GATCGGACTG TTTATCGGCC CGTTCGTCGG CTCGGCCGCG
GTGCACGCGG TCGGCCTGTG GGGCGCCTAC GCCGTGCACG TGGTCGCCGC CCTGGTCGCC
GCCGGCACCC TGCTGGTCGT GGGTGACCCG AAGGTCGACC CGGCCGGCCC GGGCGGGACG
GCCCAGCCCC GCGCGAGGTT CCCGGCCGTC CTGAGCGAAC AGCGGTCGGT GTTCGCCACG
CTGGGTGTGG GGGTGCTGCT GGTCAGCGCG ATCCGCGCGG TGCGGCAGGT CGCGTTGCCG
CTGTGGGGTC AGGAACTCGG GCTGTCCCCG GCGGCCATCT CGCTGATCTT CGGGGTGTCC
GGCGCCATCG ACATGCTGCT GTTCTACCCG GCGGGCAAGG TGATGGACCG GTTCGGGCGG
ATGTGGGTGG CCGTTCCGGC GATGACGGTC CTCGGGCTGT CACTGATCGT CCTGCCGCTG
ACCGACACCG CCACCGGCCT GATGGTGGTC GGCCTGATCA TGGGTATCGG CAACGGGATG
AGCGCCGGCC TGGTGATGAC GCTGGGCGCC GATCTCGCCC CGCCCGGGCA GCGGCCCGTC
TTCCTGGGCA TCTGGCGGGT GTTCTCCGAC TCCGGGAACG GAGCCGGACC CTTCGTCATC
GCCGGGGTGA CCGCGCTGGC CTCGCTCGGG GCGGGCATCG TCGCGATGGG CATCGTCGGG
CTGCTGGGCG GGGGCTGGCT CGGCTACTGG ATCCCGCGCC GGGTGCCGCC ACCCACCCCG
CGGGACAAAG TCGTGGCCGA CGGCGGGCCC CGTTGA
 
Protein sequence
MPSGEGAEVT LRDIAPAAFL PPAVFAVGQG AIAPVIVISA TDLGASPAMA ALVVALAGLG 
QVCADIPAGS LVHRFGERPT MIGAALLTVL ALVGCMVAGQ LILFAAAIFV TGACTAVWLL
ARQAFVTEVV PYRLRARAMS TLGGVFRIGL FIGPFVGSAA VHAVGLWGAY AVHVVAALVA
AGTLLVVGDP KVDPAGPGGT AQPRARFPAV LSEQRSVFAT LGVGVLLVSA IRAVRQVALP
LWGQELGLSP AAISLIFGVS GAIDMLLFYP AGKVMDRFGR MWVAVPAMTV LGLSLIVLPL
TDTATGLMVV GLIMGIGNGM SAGLVMTLGA DLAPPGQRPV FLGIWRVFSD SGNGAGPFVI
AGVTALASLG AGIVAMGIVG LLGGGWLGYW IPRRVPPPTP RDKVVADGGP R