Gene Namu_5351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5351 
Symbol 
ID8450984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5988190 
End bp5990226 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content72% 
IMG OID645044382 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003204604 
Protein GI258655448 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACCG AACGGACGGA CAACTCCTCC GGCCCCGATC TGGACACCGA CCCGGGCCTC 
GATCCCGAGA CCGGCCGCCG CCCCACCGTG GCCATCGTGG CCGTGCTGGC GTTGTGCGGC
ACCGCGGTGG CCCTGCAGCA GACGATGGTC GTCCCGTTGC TGCCCGAGTT CCCCACGATC
CTGGGCGTCA GCGCCGACGA CGCGTCCTGG CTGGTCACCG CCACCCTGCT GACCAGCGCG
GTGGCCACCC CGGTGATGTC CCGGCTGGCC GACATGGTCG GCAAGCGGCT GATGATGCTC
GTCTGCATGG TCGCGATGAC GGCCGGATCC GTGCTCGCCG CGCTGTCCTC GGCCTTCCCG
CTGGTCATCG CCGGGCGAGC GCTGCAGGGC TTTGCGGCCG CCCTCATCCC GATCGGCATC
AGCATCATGC GCGACGAGCT GCCCCGGGAG CGGGTGAGTT CGGCGGTCGC CCTGATGAGC
GCGACCCTGG GCATCGGCGG GGCGCTGGGC CTGCCGCTGG CCGGCATCGT CTCCGAGAGC
CTGGGCTGGC ACGCCAACTT CTGGCTGTCG GCGATCGTCG GGGTGATCCT GCTCGGCGCG
ATCCTGCTGG TGATCCCCGA GTCCCGGGTG CGTACCGGCG GCTCGTTCGA CTACCTGGGT
GCGGTGCTGC TCTCGATCGC CCTGACCGGC CTGCTGCTGG TGATCTCCAA GGGCAGCGCG
TGGGGCTGGC GCAGCGAACC GGTGATCGTG CTGTTCCTCA TCGCGGTCGG GGCGCTGAGC
GCGTGGGTGC CGTTCGAGCT GCGCGTCGGG CAACCGCTGG TCGACCTGCG CACCTCGGCC
CGGCGGCCGG TGCTGCTGAC CAACATCGCC TCGGTGCTGC TGGGTTTCGC GATGTTCGCC
AACCTGCTGC TGACCACCCA GGAACTGCAG ATCCCCACCG TCACCGGGTA CGGATTCGGC
CTGCCGATCA TCACCGCCGG CCTGCTGATG GTCCCGTCCG GGCTGGCCAT GGTCATCTTC
GCCCCGGTGT CCGGCGGGAT GATCAACCGG TTCGGCGGCC GGATCACCCT GCTCACCGGT
GGCCTGGTGA TGGCCCTGGC CTACATCGCC CGGGTCTTCC TGTCCGGGAA CCTGACCGCG
GTCGTCATCG GCTCGACCCT GGTCAGCATC GGCACGGCCA TCGCCTACGC GGCGATGCCG
ACCCTGATCA TGGCGTCCGT GCCGATCACC GAGACGGCCA GCGCCAACGG GCTGAACACC
CTGCTGCGGG CCATCGGCAC CTCGACCTCC AGCGCCACCG TCGCGGCCAT CCTGGGCACC
GTCACCATCA CCGTCGGCAC GCTCACCGCG CCGTCCGCGC AGGCCTTCCA GGACGTGTTC
TGGATCGCCG CGACCGCGGC CCTGCTGGGC TGCGTGGTGG CCTGGTTCAT CCCCCGGCCG
TCCGCGGCCG CGGCGAGCGG GCCGGCCGGG GAGCCGACCC CGATGCGGGT CGGGCCGGCG
GTGAGCGCGG GCGACAGCAA GGACGTGGTG CTGCGCGGAC GGATCCGCCG GCCGGACGGC
GCCGTGCCGT ATCCCGCGGT GGTCACCGTG GTCACCACCG ACGGTGACCC GGTGGACTGG
GGACGGGCCG ACCACGACGG CCGGTACTCC ATCGCCCTGC CCGGCCCGGG CCGGTACCTG
GTGCTGGCCA ACGCCCAGGG CTGGTCACCC AAGGCCCAGG TGATGACGTT CCGGCACCGA
GCGGAGATGA CCGGGCTGAG CGAGATCACG CTGACCGATC AGCTGACCCT GTCCGGTCAG
GTGACCTGCG GGGCGGCGCC GGTGCCGCAC GCGCTGGTCT CGCTGTCCGA GGCCGCCGGC
GCGTCGGTGT GTTCGCTGAC CGCCGACGAG CACGGCCACT GGTGCCTGCC GCTGCCGCCG
CCCGGGCGGT ACGTGGTGGC CGTGCTGGCC CGGGACTCGG GTGCGGCCGG GGCCCGCAAG
GTCGTGCTGG ACGCCCGGTC CGCCGTGGTC GACGTGAGCA TCCCGGCGGC CGGCTGA
 
Protein sequence
MVTERTDNSS GPDLDTDPGL DPETGRRPTV AIVAVLALCG TAVALQQTMV VPLLPEFPTI 
LGVSADDASW LVTATLLTSA VATPVMSRLA DMVGKRLMML VCMVAMTAGS VLAALSSAFP
LVIAGRALQG FAAALIPIGI SIMRDELPRE RVSSAVALMS ATLGIGGALG LPLAGIVSES
LGWHANFWLS AIVGVILLGA ILLVIPESRV RTGGSFDYLG AVLLSIALTG LLLVISKGSA
WGWRSEPVIV LFLIAVGALS AWVPFELRVG QPLVDLRTSA RRPVLLTNIA SVLLGFAMFA
NLLLTTQELQ IPTVTGYGFG LPIITAGLLM VPSGLAMVIF APVSGGMINR FGGRITLLTG
GLVMALAYIA RVFLSGNLTA VVIGSTLVSI GTAIAYAAMP TLIMASVPIT ETASANGLNT
LLRAIGTSTS SATVAAILGT VTITVGTLTA PSAQAFQDVF WIAATAALLG CVVAWFIPRP
SAAAASGPAG EPTPMRVGPA VSAGDSKDVV LRGRIRRPDG AVPYPAVVTV VTTDGDPVDW
GRADHDGRYS IALPGPGRYL VLANAQGWSP KAQVMTFRHR AEMTGLSEIT LTDQLTLSGQ
VTCGAAPVPH ALVSLSEAAG ASVCSLTADE HGHWCLPLPP PGRYVVAVLA RDSGAAGARK
VVLDARSAVV DVSIPAAG