Gene Ndas_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1835 
Symbol 
ID9245685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2242208 
End bp2243713 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679769 
Protein GI297560795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0636096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.396011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCGAA CCCGGACGGG GGAGACGACC GCGCGCCTGC TGCTCGCCCT GGTGTGCGCC 
GCGCAGCTCA TGGTGGTGCT CGACGTGTCG GTGGTCAACG TCGCCCTGCC CTCGATCCGC
TCCTCCCTGG GGTTCTCCGC GACGGGGCTG CCCTGGGTCG CCCACGCCTA CACCCTGGCC
TTCGGCGGCC TCCTGCTGCT GGGCGGGCGG CTGGCCGACC TGTACGGACA CCGGCGGGTG
TTCGCCGCCG GGCTCGCGCT GTTCTGCGCG GCCAGCCTGC TGGGCGGGGC GGCCCCCTCC
CCCGTCCTGC TGGTCACCGC CCGCGCGCTC CAGGGAGCGG GCGCGGCGGT ACTGGCCCCC
GCGACCCTGA CCATCCTGAC CGCCTCCTTC CCCGAGGGGC GGGCCCGCGT GCGTGCGCTG
GCCGCCTGGA CCGCGGTGAG CGTGGCGGGC GGCGCGGTCG GCAACCTCCT CGGCGGCGCG
CTGACCGAGG CGCTGTCCTG GCGGTCGGTC CTGCTGGTCA ACGTGCCCGT CGGCATCGCG
GCGCTGGCCA TGACCCCCTA CCTCCTGGGC CGGGAACGCC ACGACCGCGA CCCGGCCAAC
ACGGGTCAGC GTGACCGGGA GCGCCACGAC CGCGCCCCTC ACAGCACGGG TTCTCCTGGC
CGGGACCCGC GTGGCCGGAG CCCGGGAGAG GGCCGCGGCG GACGGATCGA CCTGCCCGGG
GCGGTGACCG CGACTGGCGG GACGGTCGCG CTCACCTACG GCCTCACCCG CACGGCGGAG
CACGGCTGGG GGGACCCGGC CGCGGTGGCG GTGCTGGCCG CGGGCGTCCT CGCCCTGGCG
CTGTTCGCCG CGGTGGAGTC CCGTGCGCCC GCTCCCCTCC TGCCGCCGGG GCTCCTGCGC
CGCCGCGCGG TCTGGGCGGG CAACGCCATG GTGTTCCTGG CCGGAGTCTG CTTCCAGGTG
CCCATGTGGT ACTTCCTCAC CTTCTACATG CAGGACGAGC TGGGCTTCGG GCCGCTGCTG
ACCGGCCTGG GCTTCCTGCC GCACACCCTG GTCACCATGG CCGTGGGCTG GCTGGTCACC
CCGTGGCTGA TGGGGTTCGT GCGGGCGCGA ACGCTGATCG GGGTCGGCTG CCTCACCGCC
GCGGCGGGCT TCGCCTGGCA GGCCGCGGCG GTGACCGAGC AGACCTACGC CGCGGCGGTG
CTGGGGCCCG CGGTCCTCAT GTCCGTGGGC GGCGGCCTGT TCACCACTCC GCTGACCGCC
GTCGTGACCT CGGGCGCCGC CCCCGGGGAC GCGGGGGCCG TCTCGGGCCT GATGAACGCG
GCCAAGCAGA CGGGCGGCGC CCTGGGCCTG GCCGCGCTGA TGACGACGGC CGTCTCGGGG
CACGCACCCG AAGGGGAGGC GTACGGCTTG GTCTTCGGCC TGCTCGCGGC CGTGCAGCTC
GTGGCGGCCG CCCTGACACC GGTACTGCCG CGCGAAACGC GACGGGATCC GTCCGAGGGC
GCGTGA
 
Protein sequence
MPRTRTGETT ARLLLALVCA AQLMVVLDVS VVNVALPSIR SSLGFSATGL PWVAHAYTLA 
FGGLLLLGGR LADLYGHRRV FAAGLALFCA ASLLGGAAPS PVLLVTARAL QGAGAAVLAP
ATLTILTASF PEGRARVRAL AAWTAVSVAG GAVGNLLGGA LTEALSWRSV LLVNVPVGIA
ALAMTPYLLG RERHDRDPAN TGQRDRERHD RAPHSTGSPG RDPRGRSPGE GRGGRIDLPG
AVTATGGTVA LTYGLTRTAE HGWGDPAAVA VLAAGVLALA LFAAVESRAP APLLPPGLLR
RRAVWAGNAM VFLAGVCFQV PMWYFLTFYM QDELGFGPLL TGLGFLPHTL VTMAVGWLVT
PWLMGFVRAR TLIGVGCLTA AAGFAWQAAA VTEQTYAAAV LGPAVLMSVG GGLFTTPLTA
VVTSGAAPGD AGAVSGLMNA AKQTGGALGL AALMTTAVSG HAPEGEAYGL VFGLLAAVQL
VAAALTPVLP RETRRDPSEG A