Gene Namu_2329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2329 
Symbol 
ID8447940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2572118 
End bp2573659 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content73% 
IMG OID645041450 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003201694 
Protein GI258652538 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00827746 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000157522 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGGGGAC GCCGTTGGGC TATCGGGGCT GCCGGCACGG CCGTCCTGCT GGCCGCCCTG 
GACGCGTACG TCGTCGTCGG GTTGCTGGTC GACATGGTCG TGGACCTGGG CATCCCGGTG
AACCGGCTGG AACGGGCCAC CCCGATCGTC ACCGGTTTCC TGCTCGGGTA CGTCGCGGCC
ATGCCGCTGC TGGGTCAGGC CTCCGACCGG TACGGCCGGC GCCGGGTGCT GCAGCTATGC
CTGCTGGGCT TTGCCGCCGG CTCGGCCCTC ACCGCCGCGG CGGGTTCGGT GCCGTTGCTG
GTGGCCGGTC GCGCCGTGCA GGGCATCGCC GGCGGGGCCC TGTTGCCGGT GACCATGGCC
CTGGTCGCCG ATCTGTGGCC GGAGCGGCGC CGGGCCGGCG TCCTGGGCGC GGTCGGGGCC
GCCCAGGAGA TCGGCAGCGT CCTGGGCACC TTGTACGGCG TCGGGGTGGC CGCCCTGTTC
GCCTCGTGGC CGTTGTTCGC CGCGCTCCAG CCGGAGAGCT GGCGCTGGGT GTTCTGGGTC
AACCTGCCGC TGGCCGCGAT CGCCATGCTC GTGGTGCAGC TGACCGTTCC CCGCTCCGCG
GCGCGGCCGG GCGATCGCCC CGGCGTCGAC CTGATCGGGG GCGCGCTGCT GGCCCTGGCC
CTTGGCCTGC TGGTGGTCGC GCTCTACAAC CCGGACCCGT CCCGATCCGT CCTGCCGTCC
TGGGGATGGC CCGCGCTGGC CGGGGTGGCG GCGTTGGTCG TCGCCTTCGT CGCGTACGAG
CGGCGGGCCC GAGTGCGGCT GCTGGATCCG GCGGGCGTCC GGATGGGGGC GTTGCTCACC
GGTCTGGGGG TCAGCGCGAT CTCCGGGGCC GCCCTGATGG TCACCCTGGT CGACGTCGAG
CTGTTCGCTC AGACGCTATT GCGCATGACC TCGGCCGAGT CGGCCCAGCT GCTGGTGCGC
TTCCTGGTGG CGTTGCCCAT CGGGGCCCTG GTCGGCGGGC TGCTCGCCGC CCGGTGCGGT
GAGAAGTGGG TCAGCGCGGC CGGTTTGGCT CTGGCCGCCG GCGGATTCGT CCTGATGAGC
CGCTGGACGC CCCAGGTGCG CGAATCATCC CATCTGTTCG GAATGCCCGC GCTGGACAGT
GATCTGGCGG TGGCCGGGTT CGGCCTCGGG CTGGTCATCG CGCCGCTGTC CGCGGTGACG
CTGCGGGTGG TGCCGGCCCC GTCCCACGGC GTCGCCTCGG CCGCGGTGGT GGTCGCGCGG
ATGACCGGCA TGCTGATCGG GTTGTCCGCG CTCACCGCGT TCGGGCTCTG GCGGTTCCGG
GACCTGACCC GAGACCTGGT GCCCCCGTTG CCGATCGGGA TCACCGACGA GCAGTTCAAC
GACCGCCTGG CCGCTTTCAG TCGGGCCCTG GAGCAGGCCC TGACCACCGA GTACCAGGAG
ATTTTCCTGG TCACCGCCGG GCTCTGCGGG CTCGGGGTGG GGCTTTCGCT GCTGCTGCCG
CGGCGCGATC GGGCCGCCGT CCGGTCAGGC GATCCGGCGT AG
 
Protein sequence
MGGRRWAIGA AGTAVLLAAL DAYVVVGLLV DMVVDLGIPV NRLERATPIV TGFLLGYVAA 
MPLLGQASDR YGRRRVLQLC LLGFAAGSAL TAAAGSVPLL VAGRAVQGIA GGALLPVTMA
LVADLWPERR RAGVLGAVGA AQEIGSVLGT LYGVGVAALF ASWPLFAALQ PESWRWVFWV
NLPLAAIAML VVQLTVPRSA ARPGDRPGVD LIGGALLALA LGLLVVALYN PDPSRSVLPS
WGWPALAGVA ALVVAFVAYE RRARVRLLDP AGVRMGALLT GLGVSAISGA ALMVTLVDVE
LFAQTLLRMT SAESAQLLVR FLVALPIGAL VGGLLAARCG EKWVSAAGLA LAAGGFVLMS
RWTPQVRESS HLFGMPALDS DLAVAGFGLG LVIAPLSAVT LRVVPAPSHG VASAAVVVAR
MTGMLIGLSA LTAFGLWRFR DLTRDLVPPL PIGITDEQFN DRLAAFSRAL EQALTTEYQE
IFLVTAGLCG LGVGLSLLLP RRDRAAVRSG DPA