Gene Ndas_4303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4303 
Symbol 
ID9248178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5119792 
End bp5121093 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682198 
Protein GI297563224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.530625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.111618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACCG CCGTGGGCCG TCGGCGCCTC GCGCTGTGCG TCCTGTTCCT CCTGCCCGGG 
CTGGGGATCT CGTCCTGGGT CACCCGTACG CCCGCCATCC GCGACGCGCT GGGCGCCTCC
ACCGCCGAGA TGGGGTTCGT CCTGTTCGGC CTCTCGATCG GTTCCATGAT CGGCGTCCTC
GGGTCGGGGG CGGTCGTCGC CCGCCTGGGC GCACGGCCGG TCATCGTGGC CGGGACCGCC
GCGATGCTGG GGAGCCTGCC CGTCATCGGT CTTGGCGCGG GCCTCTCGTC CGCCCTCGTC
GTCGCGTTCG GCCTGTTCCT CTTCGGGCTG GGCATGGGCG CTGGGGAGAT CGCGATGAAC
ATCGAGGGAG CCGACGTCGA GCGGGTCATG GCCGAGCCGC TGCTGCCGCG CATGCACGGC
TTCTTCAGCC TGGGGACCGT GATCGGCGCC CTCGTCGGCA TGGCGCTCAC CGCAGTCGGG
TTCCCCGTGG CGTGGCACCT GGCGGCGATG GGCGTCCTGA CCCTGGCGGT GGCGGCGACG
CTCTTCGGCT CCCTGCCCCC CGGCACCGGC AGGGCGCTCC CGCGCGCCTC CGGCCAGGGG
AGCGCGGGCG GCGGACGCGC GCTGTGGAAG GACGCGCGCC TGGTGCTGAT CGGCCTCATC
GTGCTCGCGA TGGCCCTGGC CGAGGGCACC GCCAACGACT GGCTCCCGCT GATCATGGTC
GACGGCCACG GCTTCGACCC GGCCCTGGGG TCGATGGTCT ACGCCGTCTT CGCCGCGTCG
ATGACGGTCG GGCGCTTCGC CGGGGGCTAC TTCCTGGCGC GCTTCGGCAG GGCCCGCGTG
CTCGGGGCGA GCGCGCTGGC CGGGGTGGCG GGCATGGGGC TGGTGGCCGG TGCGGACAGC
CCGGCCCTGG CGGCGGCGGC CGTGGTCCTG TGGGGACTGG GCGCCTCGCT GGGCTTCCCC
GTGGCACTGT CGGCGGCCGG GGACTCCGGG CCCGACTCCG CGGCCCGGGT CTCCCTGGTG
GCGACGCTCG GCTACGTCGC GTTCCTGGTC GGGCCGCCCG TGCTCGGCCT GCTGGGGGAG
GCGTACGGGT TGCGTACGGC CCTGGTCGTG CCCCTGCTCC TCGTGGCGTT CGCCGGGTTC
CTCAGCCCGG CGGCCCGTCC GCGCCGGGCG GCGGGCGCGC GGGCCGAAGA GGACAGTGCT
GGAACCGGAA GGGGCGGTAC GGAAGCCGAG GGGAGTGGTG CGGGGACCGA TCGGGGCGGC
GCGGAGGCCG GGCAGGGTGG TGCGGAGGCA GGACGGGCCT GA
 
Protein sequence
MGTAVGRRRL ALCVLFLLPG LGISSWVTRT PAIRDALGAS TAEMGFVLFG LSIGSMIGVL 
GSGAVVARLG ARPVIVAGTA AMLGSLPVIG LGAGLSSALV VAFGLFLFGL GMGAGEIAMN
IEGADVERVM AEPLLPRMHG FFSLGTVIGA LVGMALTAVG FPVAWHLAAM GVLTLAVAAT
LFGSLPPGTG RALPRASGQG SAGGGRALWK DARLVLIGLI VLAMALAEGT ANDWLPLIMV
DGHGFDPALG SMVYAVFAAS MTVGRFAGGY FLARFGRARV LGASALAGVA GMGLVAGADS
PALAAAAVVL WGLGASLGFP VALSAAGDSG PDSAARVSLV ATLGYVAFLV GPPVLGLLGE
AYGLRTALVV PLLLVAFAGF LSPAARPRRA AGARAEEDSA GTGRGGTEAE GSGAGTDRGG
AEAGQGGAEA GRA