Gene Namu_3786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3786 
Symbol 
ID8449405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4152344 
End bp4154026 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content70% 
IMG OID645042837 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003203073 
Protein GI258653917 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.398253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.51855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CCGCGTCCAC CGGCGCCGCT GCGGCCACCG CTCGTTCCGG GGCCGGTCTC 
GTCCTGGCCG TGCTGGCGGC CAGCCAGTTC CTGATGACGC TGGACAGTTC GGTGATGAAC
GTGTCCATGC CCACCGTCGC CGCGGACCTG GGCACCACGA TCACCGGCAT CCAGACCGCG
ATCACCATGT ACACGCTGGT GATGGCCACC CTGATGATCA CCGGCGGCAA GCTGGGCACG
ATCATGGGCC GCCGCCGGGC CATGGGCATC GGCCTGGTCA TCTACGCGGC CGGCTCGTTC
ACCACCGGCA TCGCCCAGAA CCTGACCCAG CTGCTGATCG GCTGGTCGCT GCTGGAGGGC
ATCGGCGCGG CCCTGATCAT GCCGGCCATT GTGGCCCTGG TCGCCTCGAA CTTCCCGGCC
GACAAGCGTT CGGCCGCCTA CGGTCTGGTC GCCGCGGCGG GTGCGGCCGC GGTCGCCGTC
GGCCCGCTGA TCGGCGGCGC GGTCACCACC TACGCCTCGT GGCGGTACGT CTTCTTCGGC
GAGGTCGTGA TCGTCCTGCT GATCCTGGCC GTGCTGCGCA AGCTCAACGA CGTGGCCCCG
CAGCCCAGCC GGATCGACCT CTTCGGGTCG TTGCTGTCGG TGGTCGGCCT GGGTCTGTTC
GTCTTCGGCG TGCTGCGCTC GAGCGAATGG GGCTGGGTGA TCGCCAAGCC GGGCGCACCG
GCCCCGTTGA ACGTGTCGCT GACCATCTGG TGCATGGCCG GCGGGCTGCT GGTCGTCTAC
GGCTTCCTGC GCTGGGAGAG CCGGCTGGAG GCCGCGGCCA AGGACGACCC GACGGCGCTG
CAGCCGCTGC TGCGGCCCAG CATGCTGGCC AACCGGCAGC TGACCGGTGG CCTGGGCATG
TTCTTCTCCC AGTTCATGAT CCAGGCCGGG GCGTTCTTCG CGGTCCCGCT GTTCCTGTCC
GTGGTGCTGG GCCTGACCGC CATGCAGACC GGGGTGCGGC TGGTGCCGCT CTCGATCGCC
CTGCTGGTCA CGGCGATCGG GGTACCCAAG GTGTGGCCCA AGGCCAACCC GCGGCGGGTG
GTCCGGCTCG GGCTGCTGCT GATGATCATC GGGATCGGCT TCCTGGTCGC CGGCATGGAT
CCGGACGCCG ACGCCAGCGT GATGACGATC CCGATGATCC TGATGGGCCT AGGCCTGGGC
GCGCTGGCCT CCCAGCTCGG CGCGGTCACC GTGTCGGCGG TGTCCGAGAG CGAGACCGCC
GAGGTCGGTG GCCTGCAGAA CACCGCGACC AACCTGGGCG CCTCGATGGG CACCGCGCTG
ATCGGCTCAG TGCTCATCGC CACGCTGAGC AGTGCGGCGT TGACCGGGGT GGCCAGCAGC
CCGGAGATCA ACGACGCGCT CAAGTCGCAG ATCTCCACCG AACTGGTCGG CGGGGTGCCG
TTCGTCTCCG ACGAGCAGGT GGCGACCGCC CTGTCCGCGG CCGGGGTGAG CCAGTCCGAG
ATCGATGCCA TCACCGAGAT CAACGCGCAG GCCCGGCTGG AGGCGCTGCA GGTGGCCTTC
GCGCTGGTCG GCTTCATCGC CATCGGGGCG CTGTTCTTCT CCACCAAGCT GCCCACGGTC
GCCCCCGGCT CGACGGTCGC GGCCGCCGGT CCCGACCCCA CCGGTCCGCC CAAGCGCGAC
TGA
 
Protein sequence
MTTSASTGAA AATARSGAGL VLAVLAASQF LMTLDSSVMN VSMPTVAADL GTTITGIQTA 
ITMYTLVMAT LMITGGKLGT IMGRRRAMGI GLVIYAAGSF TTGIAQNLTQ LLIGWSLLEG
IGAALIMPAI VALVASNFPA DKRSAAYGLV AAAGAAAVAV GPLIGGAVTT YASWRYVFFG
EVVIVLLILA VLRKLNDVAP QPSRIDLFGS LLSVVGLGLF VFGVLRSSEW GWVIAKPGAP
APLNVSLTIW CMAGGLLVVY GFLRWESRLE AAAKDDPTAL QPLLRPSMLA NRQLTGGLGM
FFSQFMIQAG AFFAVPLFLS VVLGLTAMQT GVRLVPLSIA LLVTAIGVPK VWPKANPRRV
VRLGLLLMII GIGFLVAGMD PDADASVMTI PMILMGLGLG ALASQLGAVT VSAVSESETA
EVGGLQNTAT NLGASMGTAL IGSVLIATLS SAALTGVASS PEINDALKSQ ISTELVGGVP
FVSDEQVATA LSAAGVSQSE IDAITEINAQ ARLEALQVAF ALVGFIAIGA LFFSTKLPTV
APGSTVAAAG PDPTGPPKRD