Gene Namu_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0802 
Symbol 
ID8446394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp883704 
End bp885194 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID645039939 
Productsugar transporter 
Protein accessionYP_003200202 
Protein GI258651046 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGC ACGGTGGCAG CTTCGACAGC AGCGCATCGA TCTACGACGA TTCCGACGAA 
GGCAAGGGCG TCGTCCGGAT CGCCTCGGTG GCCGCGCTCG GCGGGTTCCT GTTCGGGTAC
GACAGCGCGG TGATCAACGG CGCCAACTCG GCCATCCAGG AATACTTCAA CGCCGGGGCG
CTGGAGCTGG GCTTCACGGT GGCGGCCGCG CTGCTGGGCG CCGCGGCCGG TGCGCTGTTG
GCCGGGCGGC TGGCCGACCA CATCGGCCGG CTGTCGGTGA TGCGCCTGGC CGCGGTGCTG
TTCGCGATCA GCGCCATCGG CTGCGCGTTG GTACCCAGCC TGTGGATGCT GATCCTGTTC
CGGTTGATCG GCGGCATCGG CGTCGGCGTC GCCTCGGTGA TCGCGCCGGC CTACATCGCC
GAGATCGCGC CGGCCAAGAT CCGCGGCCGG CTGGGTTCGC TGCAGCAACT GGCCATCGTC
ACCGGCATCT TCATCTCGCT GCTGGTGGAC TTCCTGCTCG CCAACGCCGC CGGCGGCTCG
AACGCGGACT TCTGGTTCGG CTGGGAAGCC TGGCGCTGGA TGTTCTTCAT GATGATCATC
CCCGCCCTGC TCTACGGCGG GCTGGCGTTG ACCATCCCGG AGTCGCCGCG CTACCTGATC
GCCAAGCACC GCATTGCCGA GGCCAAGGAG GTCCTCACCG GCCTGCTCGG CCCGCGCAAC
ATCGACGCCA AGATCGAGAA GATCCGGGCC AGCATGGAGC GCGAGACCGA ACCGTCCTGG
AAGGACCTGA AGTCCACCAC CACCGGCCGC ATCGCCGGCA TCGTCTGGAT CGGCCTGCTG
CTGTCGGTGT TCCAGCAGTT CGTCGGCATC AACGTGATCT TCTACTACTC CAACATCCTC
TGGGAGGCCG TCGGCTTCAC CGAGGATCAG TCGTTCATCA TCACCGTCAT CTCGGCCACC
ATCAACATCC TGACGACGCT GATCGCGATC GCCACCATCG ACAAGGTCGG CCGAAAACCG
CTGCTGCTCA TCGGGTCGGT GGGCATGACG GTCACCCTGG CGACCATGGC CATCATCTTC
GGCACCGCCG GCGAGTGCAC CCAGGTGATC GCCGACCAGT GCACCGAGGC CAACGTGGCC
GACGGCACGC CGAACCTGTC CGTGGCCATC CTGGGCGCGG CCTCGCCGAT CGTCGCGCTC
ATCGCGGCGA ACCTGTTCGT GGTCGCGTTC GGCATGTCCT GGGGCCCGGT GGTCTGGGTG
CTGCTGGGCG AGATGTTCCC GAACCGGATG CGGGCCGCCG CCCTGTCGCT GGCCGCGGGC
GGTCAGTGGG TGGCGAACTG GATCGTCACC GTCACCTTCC CGCCGCTGGC CGACATCTCG
CTGGCGCTGG CCTACAGCCT CTACGCCGCG TTCGCCTTCC TGTCGTTCAT CTTCGTCAGC
AAGTGGGTGC AGGAGACCAA GGGCAAGCAG TTGGAGGACA TGCACGCCTG A
 
Protein sequence
MGEHGGSFDS SASIYDDSDE GKGVVRIASV AALGGFLFGY DSAVINGANS AIQEYFNAGA 
LELGFTVAAA LLGAAAGALL AGRLADHIGR LSVMRLAAVL FAISAIGCAL VPSLWMLILF
RLIGGIGVGV ASVIAPAYIA EIAPAKIRGR LGSLQQLAIV TGIFISLLVD FLLANAAGGS
NADFWFGWEA WRWMFFMMII PALLYGGLAL TIPESPRYLI AKHRIAEAKE VLTGLLGPRN
IDAKIEKIRA SMERETEPSW KDLKSTTTGR IAGIVWIGLL LSVFQQFVGI NVIFYYSNIL
WEAVGFTEDQ SFIITVISAT INILTTLIAI ATIDKVGRKP LLLIGSVGMT VTLATMAIIF
GTAGECTQVI ADQCTEANVA DGTPNLSVAI LGAASPIVAL IAANLFVVAF GMSWGPVVWV
LLGEMFPNRM RAAALSLAAG GQWVANWIVT VTFPPLADIS LALAYSLYAA FAFLSFIFVS
KWVQETKGKQ LEDMHA