Gene Namu_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2998 
Symbol 
ID8448611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3289464 
End bp3290702 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content69% 
IMG OID645042082 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003202324 
Protein GI258653168 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0156539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000637715 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATTACC CCTCCGGTGC GGGCACCGTC CCCGCCGTCA CCGATACCCC CGCACCCACC 
GACCCGGCGC CGGCCCCGGC CAGGACCGGC CAGCCGATCG CCGTCTGGGT GCTGGCCTTC
GCGGCCATGG TCTCGTTCAT GGGCATCGGG CTGGTCGATC CGATCCTCAA GTCGATCGCG
GCCAACCTGG ACGCCACCCC CAGTGAGGTC TCGCTGCTGT TCACCAGCTA CCTGCTGGTC
ACCGCGATCG CGATGCTGAT CACCTCGTTC GTCTCCAGCC GCTTCGGTGG CCGGACCACG
CTGATGGCCG GGCTCGTCAT CATCATCGTG TTCACCACCC TGGCCGGAAC GTCCGATTCG
GTCGCCGCGC TCGTCGGCTG GCGGGCCGGC TGGGGTCTGG GCAATGCCCT GTTCATCGCC
ACCGCACTGG CCGCGATCAT CGCCGTCGCC CGGGGCGGCG CCGAGAAGGC CGTCACGCTC
TACGAAGCCG CGCTGGGCGT CGGCATCTCG GTCGGGCCGC TGGTCGGCGC GCTGCTGGGC
ACCGTCAACT GGCGGGCCCC GTTCTTCGGC GTGGCCGTGT TGATGGGCAT CGCCCTGCTG
GCCATCTCGC TGTTCCTGAA GGACAAGGTC ACGGTCACCC ACCGAATCCG GCCGGCCGAC
CCATTGCGCG CGCTGGGGCA CGGTGGCCTG CTGGTATTGG GCATTGCCGC CCTGCTCTAC
AACGGCGGCT TCTTCGCGGT GCTGGCCTTC ACCCCGTTCA CGTTGCCCTA CAGCGCTTTC
GGGATCGGCT TCCTGTTCTT CGGGTGGGGC GTCCTGCTGG GCCTGTGCGC CGTCTGGGGC
GCACCGTGGA TGCACCGCCG GTTCGGGCTG ACCAACGCGT TCATCATCAC CCTGGGCGTG
TTCACCGCGA TCCTGGTCGC ATTGGCCTTG ACCGTCGACA ACCACGTCGC CGTCACCGTG
TTGGTGATCG CGTGCGGCGC CCCGCTGGGC GTGCTGAACA CGCTGTTCAC CGAGTCGGCG
ATGAACGTCT CCCCCGTCCC GCGCCCGGTC GCCTCGGCCG GTTACAACTT CGTCCGGTTC
CTGGGGGCGG CCGCCTCGCC GTGGATCTGC GGCAAGCTCG GCGAGGAGGT CGGCCTGTCG
GCCCCGTTCT GGTTCGGTGG CGCCTGCGTC ATCGGCGGAC TGCTGATGAT CGCGGTCTTC
GGCCGCCGGC ACCTGGCCGC GATCAACGCC CGGCACTGA
 
Protein sequence
MDYPSGAGTV PAVTDTPAPT DPAPAPARTG QPIAVWVLAF AAMVSFMGIG LVDPILKSIA 
ANLDATPSEV SLLFTSYLLV TAIAMLITSF VSSRFGGRTT LMAGLVIIIV FTTLAGTSDS
VAALVGWRAG WGLGNALFIA TALAAIIAVA RGGAEKAVTL YEAALGVGIS VGPLVGALLG
TVNWRAPFFG VAVLMGIALL AISLFLKDKV TVTHRIRPAD PLRALGHGGL LVLGIAALLY
NGGFFAVLAF TPFTLPYSAF GIGFLFFGWG VLLGLCAVWG APWMHRRFGL TNAFIITLGV
FTAILVALAL TVDNHVAVTV LVIACGAPLG VLNTLFTESA MNVSPVPRPV ASAGYNFVRF
LGAAASPWIC GKLGEEVGLS APFWFGGACV IGGLLMIAVF GRRHLAAINA RH