Gene Namu_4803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4803 
Symbol 
ID8450433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5341246 
End bp5342580 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID645043842 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003204067 
Protein GI258654911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.543727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTC CCACCGCACG ACCGGGCCGC GGCCCGCTGA TCAAGGCGTA CGCGGCCAGC 
CTCACCGGCA CCGCCCTGGA GTACTACGAC TTCGCGGTCT ATTCGGCCGC CGCCGCCCTG
GTCTTCCCGC AACTGTTCTT CCCCGGCGAG GACCCGCTCA CCGGCACCCT GCTGTCCTTC
TCCACCTACG CGGTCGGGTT CCTGGCCCGC CCGGTCGGCG GGATCGTCTT CGGCCGGCTG
GGTGACCGGG TCGGCCGCAA GAACGTCCTG GTCTGGACGT TGATGCTGAT CGGGGCGGCC
ACGCTGCTGA TCGGCCTGCT GCCCGGCTAC GCCTCGATCG GGGTGGCCGC GCCGATCATC
CTGGTCATGC TGCGCTTCGC GCAGGGCGTC GGGGTCGGCG GCGAGTGGGG CGGCGCGGTG
CTGCTGTCCA GCGAGTACGG CGATCCGGCC AAGCGCGGCT TCTGGGCCTC GGCCGCCCAG
ATCGGCCCGC CCGCCGGCAA CCTGCTGGCC AACGGTGTGC TGGCCGTCCT CGCCGCCGCG
CTGACCGAGG ACGCGTTCCT GAGCTGGGGC TGGCGGGTGG CGTTCCTGAT CTCCGCGGTC
CTGGTCGCCT TCGGCCTGTG GATCCGGCTC AAGCTGGAGG ACACCCCGGT CTTCCAGGCC
ATCAAGGAGA GCGGCGAGCG CCCCAAGGCG CCGATCAAGG AGGTCTTCGC CACCCAGAAG
CGGGCGCTGA CCGCCGCCGC GCTGGCCCGG GTCGGCCCGG ACGTGCTGTA CGCGCTGTTC
ACCGTGTTCG TTGCGACCTA CGCCACCCAG GTCCTGGGCA TGACCCGCAG CCAGGTGCTC
ACCGCCGTGC TCATCGGCTC GGCCGCCCAG CTGGGGTTGA TCCCGCTGGC CGGGGCGCTG
TCGGACCGGA TCAACCGGCG GCTGCTCTAC GCCATCGCCG CGATCGGCTC GGCCATCTGG
GTGCCGGTGT TCTTCCTGAT CCTGGGCCAG CCGTCGATGC CGCTGCTGAT CCTGGGGGTC
GTCATCGGCC TGGCGTTCCA CGCCCTGATG TACGGGCCGC AGGCGGCGTA CATCGTCGAG
CAGTTCGACA TCCACCTGCG CTACGCCGGC AGCTCGCTGG CCTACACGCT GGCCGGGGTC
ATCGGCGGCG CCATCGCCCC GTTGGTGTTC ACCGCGCTGC TCGGCGCGTT CGGCTCCTGG
GTGCCGATCG CGCTGTACCT TGCGGGCTGC GTCGCGGTCA CCCTGGTCGG ACTTCGCCTG
GGCCGGGACC CGCAACCGCA GGAGGAGGAG CACGTGCTGT CCGCCGCCCA CCGTCCGGCC
GCCACCACCT CCTGA
 
Protein sequence
MPAPTARPGR GPLIKAYAAS LTGTALEYYD FAVYSAAAAL VFPQLFFPGE DPLTGTLLSF 
STYAVGFLAR PVGGIVFGRL GDRVGRKNVL VWTLMLIGAA TLLIGLLPGY ASIGVAAPII
LVMLRFAQGV GVGGEWGGAV LLSSEYGDPA KRGFWASAAQ IGPPAGNLLA NGVLAVLAAA
LTEDAFLSWG WRVAFLISAV LVAFGLWIRL KLEDTPVFQA IKESGERPKA PIKEVFATQK
RALTAAALAR VGPDVLYALF TVFVATYATQ VLGMTRSQVL TAVLIGSAAQ LGLIPLAGAL
SDRINRRLLY AIAAIGSAIW VPVFFLILGQ PSMPLLILGV VIGLAFHALM YGPQAAYIVE
QFDIHLRYAG SSLAYTLAGV IGGAIAPLVF TALLGAFGSW VPIALYLAGC VAVTLVGLRL
GRDPQPQEEE HVLSAAHRPA ATTS