Gene Ndas_1673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1673 
Symbol 
ID9245523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2044753 
End bp2045961 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679608 
Protein GI297560634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTCGGG GCATGTCCAC GACCTCGTCT CCCCAGGCCT CTCCACCCGT GCCCTCCCCC 
GTCCGGGGAT GGTCCGCCGT CGTCGCGGTG GCCCTCGGCA CCTTCACCGT CGTCACCACC
GAAATGCTGC CCGTCGGGCT GCTGACCCCC ATGGGCGCCT CCCTGGAGGT CACGGAGGGC
ACCGCCGGGC TGACCCTGAC CGTCACCGGC CTGGTGGCGG CGGCCACCGC GCCCTTCGTC
CCGGCCCTCA TCGGCCGCGC CGACCGCAAG GGCGTCCTCG TGGCTCTCAT GCTCCTGCTC
GCCGCCGCCA ACCTGCTGTC GGCGTGGGCG CCGAACTTCG CGGTCATGGT CACGGCACGC
GTGCTCGTCG GCCTGGGCAT GGGCGGTGTC TGGGCGCTGG CGGCGGGGCT CGCGCCCCGG
CTGGTGCCCG AGCGGTCGGT GGGGTCGGCG ACCGCCGCCG TCTTCAGCGG GATCGCCGTC
GCCTCGGTGT TCGGCGTGCC GGTGGGCGCC TACATCGGGG CCCTGACCGA CTGGCGCGCC
GCCTTCGCGG CCTTCGCGGT GCTGGCCCTG TCCGTCGCGG TCGCCATGGT CGTGCTGCTG
CCGCGCCTGC CCTCCGAGCG GGCGGCCCGC CTGAGCGGCG TGACCGGCCT GCTGCGCAAC
CCCAGGGTGG TCACCGGCCT GCTCCTCGCG GTGCTCCTGG TGACCGGCCA CTTCGCCGCC
TACACCTACG TCCGCCCCGT CCTGGAGACG ACGGCGGGGG TCAGCGCGGG TCTGATCGGC
ACGATGCTGC TGGTCTACGG CCTGGCCGGT GTCGTCGGCA ACTTCGCCTC CGGCCCGCGC
GCCGTCCGGG CCCCGCGCGC CACCCTCGTG GTGCTCAGCG CTGCCCTGGG CGGATCGGTG
TTCCTCCTCC CCTGGATCGG CGTCACGGTC CTGGGGGCGG GGGTGCTCAT GGTCGTGTGG
GGTCTGGCCT ACGGCGGGGT GTCCGTCAGC GCCCAGACCT GGATGATGCG CGCGGCCCCC
GAGGAGCGCG AGGCCGTCTC GTCGCTGTTC GTCGGCGTGT TCAACGGCAG CATCGCGCTG
GGCGCGCTGG TGGGCGGCCC GGTCCTGGAC GGCGCGGGCG GCACCGCGCT CCTGTGGCTG
GCCGCCGCGC TGGCCCTGGG CGCGCTCACC ACGGCCCTCC TGGGCCGCGC GCCCGCCCGG
ACGGCCTGA
 
Protein sequence
MTRGMSTTSS PQASPPVPSP VRGWSAVVAV ALGTFTVVTT EMLPVGLLTP MGASLEVTEG 
TAGLTLTVTG LVAAATAPFV PALIGRADRK GVLVALMLLL AAANLLSAWA PNFAVMVTAR
VLVGLGMGGV WALAAGLAPR LVPERSVGSA TAAVFSGIAV ASVFGVPVGA YIGALTDWRA
AFAAFAVLAL SVAVAMVVLL PRLPSERAAR LSGVTGLLRN PRVVTGLLLA VLLVTGHFAA
YTYVRPVLET TAGVSAGLIG TMLLVYGLAG VVGNFASGPR AVRAPRATLV VLSAALGGSV
FLLPWIGVTV LGAGVLMVVW GLAYGGVSVS AQTWMMRAAP EEREAVSSLF VGVFNGSIAL
GALVGGPVLD GAGGTALLWL AAALALGALT TALLGRAPAR TA