Gene Namu_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2999 
Symbol 
ID8448612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3290713 
End bp3292161 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content72% 
IMG OID645042083 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003202325 
Protein GI258653169 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0222904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000622856 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACAGGTC GGCAGAACGG GCCCCCGCCT GCGGGCCGGT CCGGGCCGGG CTTCTCCCGG 
GCGTTGGCCG TCCTGGTGGC CGGCGCGTTC TTCATGGAGA ACCTGGACGC GACGATCATC
GCGCCGGCCG CGCCGTCGAT CGCCGCCGAC TTCGGCGTCA CCCCGGTGCA GATCAACGTC
GCGATGACGG CCTACCTGCT GACCGTGGCG ATCCTGATCC CGGCCAGCGG GTGGCTGGCC
GACCGGTTCG GCGCCCGGGC CGTCTTCTGC CTGGCCATCA CCATCTTCAC GATCGCCTCG
GTGGGCTGCG CCGCCGCGCC CACCCTGGGC GTGCTGACCG TGGCCCGGGT GTTCCAGGGC
ATCGGCGGGG CGATGATGGT GCCGGTCGGC CGGCTGGTCG TGCTGCGCAG CACCGACAAG
TCCGAGCTGA TCAAGGCGAT CGCCTACCTG ACCTGGCCGG CCCTGGTCGC GCCGGTGCTG
GCGCCACCGC TGGGCGGGCT GCTGTCGGAG TTCGCATCCT GGCACTGGAT CTTCCTGATC
AACGTGCCGC TCGGGCTGGT CGGGCTGTTC CTGGCCCTGC GGCTGGTGCC GGACGTGCGG
GCCGACCAAC CGCGCCGGTT GGACTGGCGG GGCTTCCTGC TCACCGCCGC CGGGGTGGCC
GCGCTGGTCA TCGGCTTGGA AGGCATCGGC ACCGGACAGA TCGGTACCGG GCCGAACATC
GCATTCGTCG CCATCGCGCT GGGCACCGCG CTCCTGGTGC TGGCCGCCGA CGTCGGCTAC
CTGCTGCGCG CCCGGCACCC GTTGCTGGAC CTGCGCACCC TGAAGATCCA CAGCTTGCGC
GCGGCGGTGG CCGGCGGCAC CGTCTTCCGG CTGGTGATCA GTGCGATCCC GTTCCTGCTG
CCACTGTTCT TCCAGGTCGG CTTCGGCTGG TCGGCCGCGC AGGCCGGCGG CATTGTCATC
GCCCTGTTCG CCGGCAACGT GGGCATCAAA CCGCTGACCA CGCCGCTGAT GCGGGCATTG
GGCATCCGCA CGGTGCTGCT GATCGCGCTG GCCCTGTCCA TCGCCTGCCT GCTGGCGATG
GGCCTGCTGC AGGCGACGAC GCCGCTGACC GTCATCGTCG TCGTGCTCGC GGTCAGCGGT
GTCTTCCGGT CGGTCGGTTT CTCGGCCTAC AACAGCGTGG CCTTCGCCGA CGTGCCGGCG
GACCGGATGA CCCACGCCAA CACGCTGCAC GCCACGCTGC AGGAACTCGG CGCGGGGCTG
GGCATCGCGG TGGGCGCGCT GCTGGTCCGG CTCGGCGACC CGGTCGGCCA GGCACTGGGC
CTGGCCGACG ATCCGGGCAC CCCGTACCGG GTGGCGTTCG TGCTGCTGGG GGCGATCTTG
CTGGTCCCGT TCGTCGAAGC GGTGCTGATG CCCGCCTCCG CCGGGGGCGC GGTGACCGGT
CGCCGTTGA
 
Protein sequence
MTGRQNGPPP AGRSGPGFSR ALAVLVAGAF FMENLDATII APAAPSIAAD FGVTPVQINV 
AMTAYLLTVA ILIPASGWLA DRFGARAVFC LAITIFTIAS VGCAAAPTLG VLTVARVFQG
IGGAMMVPVG RLVVLRSTDK SELIKAIAYL TWPALVAPVL APPLGGLLSE FASWHWIFLI
NVPLGLVGLF LALRLVPDVR ADQPRRLDWR GFLLTAAGVA ALVIGLEGIG TGQIGTGPNI
AFVAIALGTA LLVLAADVGY LLRARHPLLD LRTLKIHSLR AAVAGGTVFR LVISAIPFLL
PLFFQVGFGW SAAQAGGIVI ALFAGNVGIK PLTTPLMRAL GIRTVLLIAL ALSIACLLAM
GLLQATTPLT VIVVVLAVSG VFRSVGFSAY NSVAFADVPA DRMTHANTLH ATLQELGAGL
GIAVGALLVR LGDPVGQALG LADDPGTPYR VAFVLLGAIL LVPFVEAVLM PASAGGAVTG
RR