Gene Namu_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4898 
Symbol 
ID8450528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5464315 
End bp5465721 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content72% 
IMG OID645043936 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003204161 
Protein GI258655005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.724169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATTC CCGTCGACCA ATCCCGCGCG CCCGGCTGGC GCACCGAGAT CACCCGGGTC 
CAGTGGCTGG TGCTGCTGGG CACCACGCTG GGCTGGGCCC TGGACGGCTT CGCCGGCAGC
CTGTACGCGC TGGTGCTCGG CCCGGCGATG ACCGAGCTGC TGCCCAACAG CGGCATCACC
CCGGCCCCGG CGTCCATCGG CCTGTACGGC GGCCTGACCG TCGCCCTGTT CCTGGCCGGG
TGGGCCACCG GCGGCATCCT GTTCGGCGTG CTGGCCGACT ACTTCGGTCG CACCAAGGTG
CTCTCCATCG GCATCCTGAC CTACGCCGTG TTCACCGCGG CCGCCGCCTT CGCCGACACC
TGGTGGCAGC TGGGCATTCT CCGGTTCATC GCCGGCCTGG GCTCGGGGGT GGAAGCCCCG
GTGGGCGCCG CCCTGGTCGC CGAGGTCTGG CGAAACCGCT ACCGGGCCAA GGCCTGCGGC
GTCATGATGT CCGGCTACGC GGCCGGCTTC TTCATCGCCG CCCTGGCCTA CGCGGTCCTG
GGCAGCCACG GCTGGCGGAT CATGCTGGGG CTGGCCGTCA TCCCGGCGGT GCTGGTCTGG
TTCATCCGCC GCTACGTGCC CGAGCCGGCG GAGATCACCT CGGCGATCAG CGCCCGCCGG
CGGCGCCGGG AGGCCGGCAA GCGTGACGAG CAGGACCGTT TCGTCCTCGG CCGGCTGGTC
CGCCCGCCGC TGCTGCGCAA CACCCTGATC TGCACGGCCC TGGCCACCGG TTCGCTAATC
GCGTTCTGGA GCGTGTCCAC CTGGTACCCG CAGATCATCC GGCTGGCCAC CGCGGCCGAG
TCGCTGCCGG TGGACGTCGG CAACAGCCGG GTCGCCCTGG CCTCCATGCT GTTCAACGCG
GGCGGCGTCG CCGGTTACGC CTCCTGGGGC TTCCTGGCCG ACGCGATCGG CCGGCGCAAG
GCCTTCGCCA TCAGCTTCGC GGTGTCCGCG GTCAGCATCG CGTTCCTGTT CCCGTTCGAG
CACAGCTTCA CCACGTTCCT GGTGATGATG CCGGTGCTGG GCTTCGGCCT GTTCGGCGCG
CTGTCCGGAA CCTTCGTCTA CGGTCCCGAG ATCTTCCCGC CGAGCGTGCG GGCCACCGGC
ATGGCCCTGG CCAACAGCGT CGGCCGCTAC ATCACCGCGG CCGGCCCGCT GATCGCCGGC
GTCATCGCCG CCAGCTGGTT CGGCGGCGAC CTGGGCCTGG CCACCACCTG CGTGGCCGCA
TTCGGGCTGA TCGCCCTGGT CGGCCTGGCC TTCGCGCCGG AGACCAAGGG CGCCGCGCTG
CCCACCGATC CCGGCGTCAC CCTCCCCCCG CCCGCCGCAC CTGCCCCCGT CCAGGCGGCC
GCCACCACCC AGGAGCACAC GTCATGA
 
Protein sequence
MSIPVDQSRA PGWRTEITRV QWLVLLGTTL GWALDGFAGS LYALVLGPAM TELLPNSGIT 
PAPASIGLYG GLTVALFLAG WATGGILFGV LADYFGRTKV LSIGILTYAV FTAAAAFADT
WWQLGILRFI AGLGSGVEAP VGAALVAEVW RNRYRAKACG VMMSGYAAGF FIAALAYAVL
GSHGWRIMLG LAVIPAVLVW FIRRYVPEPA EITSAISARR RRREAGKRDE QDRFVLGRLV
RPPLLRNTLI CTALATGSLI AFWSVSTWYP QIIRLATAAE SLPVDVGNSR VALASMLFNA
GGVAGYASWG FLADAIGRRK AFAISFAVSA VSIAFLFPFE HSFTTFLVMM PVLGFGLFGA
LSGTFVYGPE IFPPSVRATG MALANSVGRY ITAAGPLIAG VIAASWFGGD LGLATTCVAA
FGLIALVGLA FAPETKGAAL PTDPGVTLPP PAAPAPVQAA ATTQEHTS