Gene Namu_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4005 
Symbol 
ID8449624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4424410 
End bp4425738 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content71% 
IMG OID645043050 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003203286 
Protein GI258654130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0701044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCAGG ACCTGACCAA CGCCCCGACG CCCACGTCGG CCACGGCGGG CGAGCCCGCC 
GCACCGGCGC GGATGGGCAC CTTCGCGGCC CTGCGGGTCC GCAACTACCG GCTGTTCTTC
GCCGGCCAGA TCGTGTCCAA CACCGGCACC TGGATGCAGC GAATCGCCCA GGACTGGCTG
GTGCTGGAGC TGACCCACTC GCCGCTGGCG GTCGGCATCA CCACCGCGCT GCAGTTCCTG
CCGATGCTGG TGTTCGGACT GTGGGGCGGC CTGATCGCCG ATCGCTACCC CAAGCGCCGG
CTGCTGCTGC TCACCCAGTC GTCGATGGGC GTGCTCGCCG TCCTGCTGGC CGGGCTGACC
CTGGCCGGAG TCGTGCAGGT CTGGCAGGTG TACCTGATCG CCCTGGGCCT GGGGCTGGCC
ACCGTGGTGG ACAACCCGAC CCGGCAGACG TTCGTCAACG AGATGGTGCC CACCCACCTG
GTCCGCAACG CGGTCGGGCT CAACTCGGGC AACTTCCAAC TCGGCCGGAT GCTCGGCCCG
GCAGTGGCCG GCGTGCTGAT CGCGGCGGTC GGCACCGGTT GGGCGTTCGC GTTCAACGCG
GCCAGCTTCG CCGCCGTGCT CACCGCGCTG CTGTTGATGC GACCGGCCGA GCTGCAGCAC
CTGCCGCACG CCGACCGGGC TCCTGGCCAG CTGCGTGAGG GGCTGCGGTA CGTGCGCAGC
AACCCGATTC TGCTCTGGCC GATCGTGCTG GTCTTCTTCA TCGGCACGTT CGGCTACAAC
TTCGCGATCA TCCTGTCCGC CTACACCCAG AACATCTTCC AGTCCGGTGC CGACCTGTAC
GGGCTGCTCA ACACCGCGAT GGCCGCCGGC TCGGTGGTCG GCGCGCTCTT CGCGGCCCGG
CGCACCTCGG CCAACCTGGC CGTGCTGTTC CTGGCCGCCG GCAGCTTCGG GCTCGGCCTG
ATCGTGCTCG GCCTGACCCC CTGGTTCTGG CCGTTCCTGC TGCTGCTGGT CGTCGTCGGG
TTCGTCTCGG TCACCTTCAA CACCTTGGGC AACGCCACCG TGCAGCTGTC CAGCGAGCCG
GAGCTGCGCG GCCGGGTGAT GAGCCTGTAC ATGCTGGTCT TCATGGGCGG CACGCCGATC
GGCGCGCCGA TCGTCGGGGC CATCACCCAG CAGTGGGGGG CGCCGGCCGC CCTGATCATC
TCCGGGCTGA TCTGCCTGCT GGCCGCGGCC GGGGCGGCCG CGTTCGCCGC CCACTCGGCC
GGAGTGTCGG TGCGCACCGA CCTGGCCGCC CGGGTCCGCC GCCTGGTCGC CCATCCGCAC
CGCGCCTGA
 
Protein sequence
MSQDLTNAPT PTSATAGEPA APARMGTFAA LRVRNYRLFF AGQIVSNTGT WMQRIAQDWL 
VLELTHSPLA VGITTALQFL PMLVFGLWGG LIADRYPKRR LLLLTQSSMG VLAVLLAGLT
LAGVVQVWQV YLIALGLGLA TVVDNPTRQT FVNEMVPTHL VRNAVGLNSG NFQLGRMLGP
AVAGVLIAAV GTGWAFAFNA ASFAAVLTAL LLMRPAELQH LPHADRAPGQ LREGLRYVRS
NPILLWPIVL VFFIGTFGYN FAIILSAYTQ NIFQSGADLY GLLNTAMAAG SVVGALFAAR
RTSANLAVLF LAAGSFGLGL IVLGLTPWFW PFLLLLVVVG FVSVTFNTLG NATVQLSSEP
ELRGRVMSLY MLVFMGGTPI GAPIVGAITQ QWGAPAALII SGLICLLAAA GAAAFAAHSA
GVSVRTDLAA RVRRLVAHPH RA