Gene Ndas_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0780 
Symbol 
ID9244625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp959358 
End bp960620 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003678730 
Protein GI297559756 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.347678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCTT TGGACACGCT GCGACGCATG CACCACATCG CCGGGTGGCC GCTGCTCCTG 
AGCACCTTCC TCGCCCGCCT GCCCATCTCC ATGTCCCTGA TCGGCCTGCT CACCCTGGTC
ACCACCACGA CCGGGAGCGT GGCCGCCGCC GGTGCGGTGA CCGGCGCCTT CGCGCTCGGG
GAGACCGTGG GCGGGCCCGT CATCGCCCGC TTCGCCGACC GGCGCGGGCA GCGCGTACCG
GTGCTGGTCA CCTCGCTCGT GGACGCCGTG CTCATCACCG TGCTCGTGAC CGCCGTCCTG
GCGGAGGCCT CCGCCCCGGT CCTGGCCGTG CTCGCCGCCG CGGCGGGTCT GTGCATGCCC
CAGATCGGCC CCATGGCGCG TACGCGCTGG GTGGTGCTCA TCCGTCGGGG GCCGCACCGG
GGCGAGGAGC GCGAGCGGTC GGTGTCCGCG GCGATGTCGG TCGAGGGCGT GCTGGACGAG
GCCGCGTTCG TCCTGGGCCC CGCCCTGGTG GGCCTGCTCA CCGTGACGCT CTCCCCGGCG
GCGAGCGTGC TGGGCGCGGC CGTGCTCATC GGCGTGTTCG GGACCGTGTT CGCCCTGCAC
CCCACCGCGC CGCCCGGAAC CCCGCCGGTG CGCGGCGCCG GGGGCCGCAT CGCCACGCCC
GCGCTGCTGG TGCTGGCCGT CCCCATGTTC TGCCAGGGCC TGTTCTTCGG CGGCATGTCC
ACCGGCGTGA CCGCCTTCTC CGCCGCCTCC GGGCACGGGG ACCTGTCCGG GCTGATGTAC
GCGGTGATGG GCACCAGCAG CGCCGTCGCG GGCCTGCTGA TGGCCTCCGT CCCCGTGGGC
TTCCCGCTCA CGGCGCGGGC CCGGATCGCG GCGGGCGCGC TGTTCCTGCT CACCCTGCCG
CTGTACGCGG CCCACGGCGC GGCCGCCCTG GCGGTGGCCA TCTTCGTCCT GGGCGCCGCC
ATCGGCCCGC ACATCGTGAG CCTGTTCGGG CTGATCGAGA GGGCCGCCCC GGCCAGCCGC
CTGTCGCAGT CGATGGCCGT CATCCTGAGC TGCCTGATCC TGGGCCAGGC GCTGGGGTCG
TCGGTCGCGG GCGTCCTCGC CGACGCGCAC GGCCACCAGG GGGCGTTCGT GCTGGCCACG
CTGGGCGGCC TGGTCTCCTT CGCGGTGACC GTCCTGGTGA TGCGCGCCCG CTGGTACGTG
CGCGGCGAAC CGTCCTCTCC GGCGACGGTG ACACGTTCCG CTCCGGGCGT CGGAGAGGGG
TGA
 
Protein sequence
MSPLDTLRRM HHIAGWPLLL STFLARLPIS MSLIGLLTLV TTTTGSVAAA GAVTGAFALG 
ETVGGPVIAR FADRRGQRVP VLVTSLVDAV LITVLVTAVL AEASAPVLAV LAAAAGLCMP
QIGPMARTRW VVLIRRGPHR GEERERSVSA AMSVEGVLDE AAFVLGPALV GLLTVTLSPA
ASVLGAAVLI GVFGTVFALH PTAPPGTPPV RGAGGRIATP ALLVLAVPMF CQGLFFGGMS
TGVTAFSAAS GHGDLSGLMY AVMGTSSAVA GLLMASVPVG FPLTARARIA AGALFLLTLP
LYAAHGAAAL AVAIFVLGAA IGPHIVSLFG LIERAAPASR LSQSMAVILS CLILGQALGS
SVAGVLADAH GHQGAFVLAT LGGLVSFAVT VLVMRARWYV RGEPSSPATV TRSAPGVGEG