Gene Namu_2508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2508 
Symbol 
ID8448119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2763891 
End bp2765057 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID645041620 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003201864 
Protein GI258652708 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00018254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00122667 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCCAC CGGCGCGATC GAGCGTGCTG GTGCCGTCGG CGGCGCTGCT GTGGGGTCTG 
CAGGCGGCGT TCCTCACGCC CGTGCTGGCA TTGCTGCTGG TCTCGCTGTA CGACGCGACC
ACCGTGCAGG TCGGCTGGGT CATCGCCGTC TACAACGCCA GCGGGTTCCT TGCTGCGCTG
GTCATCCCGG CCCGCGCCGA CCGAGCGCGC CGGTACCTGC CGTCCCTGGT GGTCTGCGCG
GCCCTCACTG CGGCACTGGC CGCCGCGCTG GCGTTGAGCA CGTCCTTGCC GATCGCCGCA
GTGGCGCTGG CCGTGCTGGG CGGGCCGGCG AGCTCGGGGT TTTCGCTGCT GTTCGCGCAC
CTGCGGCATT CCGGAGCGAC ACCGAATCAG GTGGTCAACA CCCGCGCCGT GGTCTCGTTC
GCCTGGGTCG CCGGGCCGCC CATCGCCACC TTCCTGGTCG GGGCGTTCGG CGACCGCTCA
CTGTTGGCCG CACTCGGCGT CATCGGCGTG CTCAGTGTGG CCGTCACCGC GCTGATGATG
CGAGGAGCGG CGTCGGACCG CCTCCGACCA CCGGTCGTCG AACAGCAGAC GACGATGAGA
CCGTCGCGGT CGACCGTCGC GGTGGTGATC GGCGCGTTCG TGGCCGTGCA AGCCGGGAAC
GCGGCCGCCG TTGCGGTGAT GACGCTGTAC GTGACCGAAT CGCTGGGGCT CGGCGTCGTG
TGGGCCGGCG CCGCGCTGGC CGTCGCGGCG GGGCTGGAGA TTCCTGCCCT GCTGATCATG
GGCCGGCTGA GCCGCCGTTT CACCAGCCTG GGACTGATCA TCGCCGGTTG CCTGGCCGGC
ATCGCCTACT GCGCGGCCAT GGCCGCGCTG TCCGGCCCCA TCGCCCTGCT GGCCGTCCAG
GTGCTCAGTG CCTGGTTGGT CGCGGCCGTC GCCGGGATCG GCATGACCCT GTTCCAGGAC
ATGATCCCGC AGCCGGGCCT GGCCGTCGGC ATCTACGCGA ACACCCGCCG CATCGGGGCG
ATCGCCTCCG GGGCGATCAT CGCCTTCGGT TCCACCAGCG CCCTGGGCTA CCGCGGCGTC
TTCGTCGCGT CGGGACTGGT CACCGCACTG GCTCTGCTCA TGCTCCTAGT GGTACGGATC
AGACCCTCCC GCTCCGGCCA CCATTGA
 
Protein sequence
MMPPARSSVL VPSAALLWGL QAAFLTPVLA LLLVSLYDAT TVQVGWVIAV YNASGFLAAL 
VIPARADRAR RYLPSLVVCA ALTAALAAAL ALSTSLPIAA VALAVLGGPA SSGFSLLFAH
LRHSGATPNQ VVNTRAVVSF AWVAGPPIAT FLVGAFGDRS LLAALGVIGV LSVAVTALMM
RGAASDRLRP PVVEQQTTMR PSRSTVAVVI GAFVAVQAGN AAAVAVMTLY VTESLGLGVV
WAGAALAVAA GLEIPALLIM GRLSRRFTSL GLIIAGCLAG IAYCAAMAAL SGPIALLAVQ
VLSAWLVAAV AGIGMTLFQD MIPQPGLAVG IYANTRRIGA IASGAIIAFG STSALGYRGV
FVASGLVTAL ALLMLLVVRI RPSRSGHH