Gene Namu_4783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4783 
Symbol 
ID8450413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5318791 
End bp5320215 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content70% 
IMG OID645043823 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003204048 
Protein GI258654892 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC AACCCTCGTC AGCAGTCCGC GGCAGTGACG ATCCCGAGTA CGCCGCGAAC 
CTCAAACGCG CCACCCTGGC GGCCTCCGTG GGCTCGGCCC TGGAGTACTA CGACTTCGCC
CTCTACTCGC TGGCCTCGGC CCTGATCTTC GGGAAGATCT TCTTCCCGGG CCTGGGCGAC
GCCGCCGGCA CCGTGGCCAG CCTGGCCACC CTGGCCATCG GCTTCCTGGC CCGACCGATC
GGCGGGCTGT TCTTCGGCAC GCTGGGCGAC AAGCTGGGGC GCAAGTGGGT CCTGATGATC
ACCATCGCGC TGATGGGCGG CTCCAGCTTC CTCATCGGCG TGCTGCCGAC CGCCGACCAG
ATCGGCGTCT GGGCGCCGCT GCTGCTGGTC TTCCTGCGCA TCTGCCAGGG CTTCGGAGCC
GGTGCCGAGC AAGCCGGGGC GACCGTGCTG ATGGCCGAGT ACTCCCCGGT CCGGCGGCGC
GGCTTCTTCT CGGCCCTGCC GTTCGTCGGC ATCATGCTCG GCACCCTGCT GGCCTCCGTG
GTTTTCGTCG GCCTGGGCCA GGTGCCCCAG GAGGCGCTGC TGAGCTGGGT CTGGCGGATC
CCGTTCCTGG CCTCGATCCT GCTGATCGGC GTGGCCGTGC TCATCCGGCT CCGGCTGCAG
GAGAGCCCCA CCTTCGTCAA GCTCGAGAAG CAGGAACAGG TCGCGCACAA CCCGCTGGGC
GAGGTGTTCC GGCGCTCGAC CCCGAGCGTG CTGCGCGGCA TCGGCCTGCG CATGGCCGAG
AACGGCGGCT CGTACATCTA TCAGACCCTG GCCATCACCT ACGTCAGCAA GCTCGGCGTG
CAGACCTCGG TCGGGCCGCT GGCCGTGGCC GTCGGCGCGA TCCTCGGCCT GGTCACCATC
CCGGTGTCCG GAGCCCTGTC GGACAAGTAC GGCCGGATGC GGATCTACCG GATCGGCGCC
CTGGTGCAGC TGGCCCTGGC CCTGGTCGCC TGGCCGTTGC TGTCCACCGG CAACGCGGTG
GTGACCGTCG TCGTCATCGC CATCTCCTAC GGCGTGGGCG TCAACATCAT GCTGGGCGCC
CAGTGCGCCG CCCTGCCCGA ACTGTTCGGA TCACGGCACC GCTACATCGG CGTGGCGGTG
GCCCGCGAGT TCAGCGCGAT CATCGCCGGC GGCATCGCGC CGTTCGTCGG GGCCCTGCTG
CTCGGCTGGT TCGCCAACTC GTGGATCCCG CTGGCCGTGT ACGTGATCGT GCTGACCATG
ATCACCCTGG TCACCACCTT CTTCACCCCG GAGACCCGGG GCCGGGACCT GACCCTGCTC
GGAGACGCGC TCGCCGACAG CAGCCAGGAG ATCGCCGCCC GTCCCGCGCC GCAGACCTCC
TCCTTCCGCG ACCACCACGA CACCGAGCGC CCGGTCGCGG TCTGA
 
Protein sequence
MSEQPSSAVR GSDDPEYAAN LKRATLAASV GSALEYYDFA LYSLASALIF GKIFFPGLGD 
AAGTVASLAT LAIGFLARPI GGLFFGTLGD KLGRKWVLMI TIALMGGSSF LIGVLPTADQ
IGVWAPLLLV FLRICQGFGA GAEQAGATVL MAEYSPVRRR GFFSALPFVG IMLGTLLASV
VFVGLGQVPQ EALLSWVWRI PFLASILLIG VAVLIRLRLQ ESPTFVKLEK QEQVAHNPLG
EVFRRSTPSV LRGIGLRMAE NGGSYIYQTL AITYVSKLGV QTSVGPLAVA VGAILGLVTI
PVSGALSDKY GRMRIYRIGA LVQLALALVA WPLLSTGNAV VTVVVIAISY GVGVNIMLGA
QCAALPELFG SRHRYIGVAV AREFSAIIAG GIAPFVGALL LGWFANSWIP LAVYVIVLTM
ITLVTTFFTP ETRGRDLTLL GDALADSSQE IAARPAPQTS SFRDHHDTER PVAV