Gene Ndas_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1789 
Symbol 
ID9245639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2186931 
End bp2188469 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679723 
Protein GI297560749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.407024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATG CCCCTGGCGG AGCGACAGCG CCCGGTGACG CCGCTCCTCC CCTACGGCCG 
AACGTGATCG TCGCCGTCCT GGCCTTCGGC GGGATCGTCG TCTCGCTCAT GCAGACCCTG
GTCATCCCGC TCGTGCCCGT CCTGCCCGAC CTGCTGGGCG CCACGCCCGG GGACACCGCG
TGGGCGATCA CCGCCACGCT GCTCGCGGCC GCGGTCGCCA CCCCGACGGT CGGCCGCCTG
GGCGACATGT ACGGCAAGCG CCGCATGCTG CTGTTCAGCC TCGCCGTCCT CGTGGCCGGC
TCGGTGCTGT GCGCCCTGGC CCACAGCCTG GTGCCGATGG TCGTCGGCCG CGCCCTCCAG
GGCCTGGCGG CCGGGGTCAT CCCCCTGGGC ATCAGCATCA TGCGCGACGT GCTGCCCCCC
GAACGGCTCG GCGGGGCGAC CGCGCTGATG AGCGCCTCGC TCGGGGTGGG CGGCGCGCTC
GGGCTGCCCG CCGCCGCGCT CGTGGTGCAG GAGGCGGACT GGCACGTGCT GTTCTGGGTC
GCCGCCGGGC TCGCGACCGG GGCCGCCGTG CTGGTGCGCG CCCTGGTCCC CGCGTCCGGG
GTGCGCGCGG GCGGCAGGTT CGACCTGCCC GGGTCGGCGG GCCTGTCCGT GGCGCTGCTC
CTGCTGCTGC TCGCGGTCTC CAAGGGCTCG GACTGGGGCT GGGGCAGCGG TGTCCCCGCC
GCCATGCTCG CGGTCGCGGT CGCGGTGCTC CTGGTGTGGG GCTGGTGGGA GCTGCGTACG
CCGCACCCGC TGGTCGATCT GCGCGTCAGC GCGCGGCGCC AGGTCCTGCT CACCAACACC
GCGTCCCTGG TGTTCGGCTT CTCCATGTTC GCCATGTCCC TGGTCGTCCC GCAGCTGCTC
CAGATGCCCG AGGTCACCGG CTACGGCTTC GGACAGACGA TCCTGGTCGC GGGTCTGGTG
ATGGCGCCCA ACGGGCTGGT GATGATGGCG ATGTCCCCGG TCTCGGCGCG TATCTCCCGG
GCGAGGGGTC CCAAGACGAC CCTGATGGTC GGCGCACTGC TGGTCGCGCT CGGCTACGGC
CTGAGCCTGG TGTCCATGTC CGCCCTCTGG CAGCTCGTGA TCGCCTCCAC CGTCATCGGC
GCGGGCGTGG GCCTGGCCTA CGGGGCCATG CCCGCCCTGG TCATGGCGGC CGTGCCCCGG
ACCGAGACGG CGGCGGCCAA CAGCCTCAAC ACCCTGATGC GCTCCATCGG CACCTCCGTG
GCCAGCGCCG TCGCGGGCGT GGTCATCGCC AACGTGACCA TGACCGCGGG CGCGGAGGTC
CTCCCCGCCC GGGAGGCCTT CACCCTGCTC CTGGGCCTGG GCGCCCTCGC CGCGCTCGCC
GCCTTCGCCG TCGCCGCCTT CCTGCCCGGG CGGGGCAGGC CGGCCGACCT CGACCGGCCC
GAACCCCCGC CCGCGGCCCC CGACCAGCGC GCGGACCCCT CGGAGGACGA AGGGACCGGA
CGCGACCAGG CGCGGGAGCA CCCGAGCGGG CTCCGCTGA
 
Protein sequence
MTHAPGGATA PGDAAPPLRP NVIVAVLAFG GIVVSLMQTL VIPLVPVLPD LLGATPGDTA 
WAITATLLAA AVATPTVGRL GDMYGKRRML LFSLAVLVAG SVLCALAHSL VPMVVGRALQ
GLAAGVIPLG ISIMRDVLPP ERLGGATALM SASLGVGGAL GLPAAALVVQ EADWHVLFWV
AAGLATGAAV LVRALVPASG VRAGGRFDLP GSAGLSVALL LLLLAVSKGS DWGWGSGVPA
AMLAVAVAVL LVWGWWELRT PHPLVDLRVS ARRQVLLTNT ASLVFGFSMF AMSLVVPQLL
QMPEVTGYGF GQTILVAGLV MAPNGLVMMA MSPVSARISR ARGPKTTLMV GALLVALGYG
LSLVSMSALW QLVIASTVIG AGVGLAYGAM PALVMAAVPR TETAAANSLN TLMRSIGTSV
ASAVAGVVIA NVTMTAGAEV LPAREAFTLL LGLGALAALA AFAVAAFLPG RGRPADLDRP
EPPPAAPDQR ADPSEDEGTG RDQAREHPSG LR