Gene Ndas_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1671 
Symbol 
ID9245521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2042825 
End bp2044069 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679606 
Protein GI297560632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATC CGTACGCGCA GATCTTCGCG GTGCGTGGAG CCAAGGGCTT CACCGTCGCC 
GGGCTCATCG GGCGCATGCC CGTGGCCATG ACCAACATCG GCATCATCAC GATGCTCTCC
ACCACCCACG GCAGCTACGC CCTGGCGGGC GCGGTGGCCG CCGCCTTCAC CCTGTCCATG
GCGCTGATCA CCCCGCAGGT CTCGCGTCTG GCCGACCGCC ACGGACAGCG CCGCGTCCTG
CCCCCCGCGG CCGCCGTCAG CGTCGCGTCC CTGCTCCTGA TGCTGCTGTG CGTGCGGTTC
GACGCGCCGT ACTGGACGCT CTTCGCGTTC GCGGTCCCCG CCGGGACCAT GCCGAACATG
TCCGCGATGT CCCGCGCGCG CTGGACCGAG CTGCTGCGCG GCTCGCCCCG GCTGCACACC
GCCTACTCCT TCGAGTCCGT GGCCGACGAA CTCACCTTCA TCACCGGCCC GGCGCTGTCG
GTGGTGCTGA GCACCATGGC CTTCGCCCAG GCCGGTCCGC TGGCCGCGGC CGCCTTCCTG
GCCCTGGGCG TCACCCTGTT CGTGGCCCAG CGCGGTACGG AGCCGCCGCT CCAGGCTCCC
GAGGCGTCCG GGACCAAGGG CGCGGGCGCG CTCAGCGGCG CCCTCCTGGT CCTGGTGCTG
ACCCTGCTCG CGGGAGGCGT CATCGTCGGC TCGGTGGACG TGGTCGCGGT GGCCTTCGCC
GAGTCGCTCG GCGTGACCAG CGCCACCGGC GTCGTGCTGT CGGCCTACGC GCTCGGGTCG
GCGATCTCCG GGCTGACCTT CGGGGTGCTC GACCTGCCCT GGCGGCTCCA CATGATGCTG
ATCGTGGCCG TGGCCGGGAC GTTCGCGACC ACCCTCCCCT TCCTGGCGGT CGGTAGCATC
TGGACCCTGT CCGTGGCCGT GTTCTTCGCG GGGATCTTCT TCGCCCCCAC GATGATCCTG
GTGATGACGC TGATCGAGCG GACCGTACCG CCGTCCAAGC TGACGGAGGG CATGACCTGG
GCCCTGACCG GCCTGACCAT CGGTACCGCG ATCGGCACCT TCTCCTCGGG GCTGGCGGTG
GAGGAGAGCG GCACCACGGG CGGCTTCCTC GTCGCGGTCG CGGCCGGCGC CCTCGCCCTG
GTCCTGACCC TGGTGTTCGC CCCCCTCCTG GCCCGGGCCC AGGCGAGGGC CGAGCGGGCG
CAGGAGGAAC AGGCCGCGGA GGAGGCGGCC GGTACCGCCG GGTGA
 
Protein sequence
MPNPYAQIFA VRGAKGFTVA GLIGRMPVAM TNIGIITMLS TTHGSYALAG AVAAAFTLSM 
ALITPQVSRL ADRHGQRRVL PPAAAVSVAS LLLMLLCVRF DAPYWTLFAF AVPAGTMPNM
SAMSRARWTE LLRGSPRLHT AYSFESVADE LTFITGPALS VVLSTMAFAQ AGPLAAAAFL
ALGVTLFVAQ RGTEPPLQAP EASGTKGAGA LSGALLVLVL TLLAGGVIVG SVDVVAVAFA
ESLGVTSATG VVLSAYALGS AISGLTFGVL DLPWRLHMML IVAVAGTFAT TLPFLAVGSI
WTLSVAVFFA GIFFAPTMIL VMTLIERTVP PSKLTEGMTW ALTGLTIGTA IGTFSSGLAV
EESGTTGGFL VAVAAGALAL VLTLVFAPLL ARAQARAERA QEEQAAEEAA GTAG