Gene Namu_3831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3831 
Symbol 
ID8449450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4206114 
End bp4207310 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content78% 
IMG OID645042881 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003203117 
Protein GI258653961 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.135932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0769479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCCG ACCCTGCCCG GCGCCCGGTC GATCGGCGCC GACCCGGAGC CGGGCTGTCG 
GCGGTGCTGG CCCTGCTGCT GGCCACCGGC TGGGCGGCCA ACCATTTTGC CGCGCTGCTG
CCGGTGCTGC GCACGTCGCA GAACCTGTCC GCCGCGCTGG TGGCCGGCCT GTACGGGCTG
TACGCCGTCG GGCTGTTGCC GGGTCTGCTG CTCGGCGGCT CGGCCTCGGA CCGGTTCGGG
CGCCGTGCCG TGGCCGTGCC CGGGGCGCTG CTGGCCGCGA TCGGCACGCT GATCCTGTTG
TTCTGGCACG ACCCGACCGG GCTGGTCGTG GGCCGGCTGG TGGTCGGCGC CGGGGCCGGC
GCGACGTTCA GCGCCGGCAC CGCCTGGGCC GCCGACCTCG GTGGGGCGGC CGGGGTCACC
CGGGCCGGGG TGTTCCTGAC GCTGGGCTTC GCCACCGGCC CGGTGGTTTC CGGGGTGCTG
GCCGAGTTCG CGCCGGCGCC GCTGGTCGTC CCGTTCGTGC TCAGCGCGGT GCTCTCGGTG
GCCGCGGTGG CGGCCGCCGC CCTGGTCCCG GGCCGGCCGC CGCACCCGCC GCCGCACGCG
GTGTCGACGG CCGGCCGGAC CCCTTGGGTG CCCGACCGGC GGCGCTCCGC CGGCACCGCC
CTGGCCTGGG CCTTGCCGGT CGCGCCGTGG GTGTTCGCCG GGGCCACGGT CGGGGTGGTC
ACCCTGCCGT CCCGATTGCC GGCCGGATCC GGGGGACCGT TGCTGGCCGG GATCGCCGCC
GGAGTGGTAC TGGGCACCGG GGTGGTCGTC CAGACGATCG CCCGCCGGCG CAACGTCGGT
CCCGGCGCGG GCGTGCTCGG TGCGGTCGCC GCCGCGGCCG GATTGGTCCT CGCGGCCATC
GGTGGCGCCC AGCCCGGCCT GGTGCTGGTC GCGGTGGCGT TCCTGCTGCT GGGCACCGGG
TACGGGCTGT GCCTGCGGGC CGGCCTGCTG GACCTGGAGC GCTGGGCGCC GCCGGCCGCC
CGCGGCAGCC TGACCGGCGT GTTCTACCTG GCCACCTACA GCGGCTTCGC CGTCCCGGTG
GTGCTGGCCG CGCTCGATCC GGTGGCCGGC CCCACCGTCC CGCTGCTCGT GCTGGGCGCG
TTGGCGGCCC TGGTCGCGGT GCTGCGATGG CTGCGAATCG TCGGGGAGCG GGCCTGA
 
Protein sequence
MPSDPARRPV DRRRPGAGLS AVLALLLATG WAANHFAALL PVLRTSQNLS AALVAGLYGL 
YAVGLLPGLL LGGSASDRFG RRAVAVPGAL LAAIGTLILL FWHDPTGLVV GRLVVGAGAG
ATFSAGTAWA ADLGGAAGVT RAGVFLTLGF ATGPVVSGVL AEFAPAPLVV PFVLSAVLSV
AAVAAAALVP GRPPHPPPHA VSTAGRTPWV PDRRRSAGTA LAWALPVAPW VFAGATVGVV
TLPSRLPAGS GGPLLAGIAA GVVLGTGVVV QTIARRRNVG PGAGVLGAVA AAAGLVLAAI
GGAQPGLVLV AVAFLLLGTG YGLCLRAGLL DLERWAPPAA RGSLTGVFYL ATYSGFAVPV
VLAALDPVAG PTVPLLVLGA LAALVAVLRW LRIVGERA