Gene Ndas_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1014 
Symbol 
ID9244860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1240458 
End bp1241792 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003678963 
Protein GI297559989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.506653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC TCGACCCAGA GGCGACCGAG GAGCGCCCGG CACGCGGCGG GCCGGTGGTG 
AGCGGTCGCG AGGCGGGCCG GATCGCTTTC GCGGCCTTCG TCGGCACCGC ACTGGAGTGG
TACGACTACT TCCTCTTCGG TACGGCCGCC GCGCTCGTGT TCAACCGGCT GTACTTCACG
ACGCTGGACG CCACGGCCGC CACCCTGGCC GCGTTCGCCA CCTTCGGCGT CGGCTTCGTC
GCGCGTCCGC TGGGCGCGGT GCTGTTCGGC TGGATGGGCG ACCGGATCGG ACGGCGACCA
GCGCTGCTGA TCACCGTGGT CTCCATCGGT GTGGCCACCG GCCTCATCGG CCTGCTGCCC
GACTTCGCCT CGATCGGCAT CGCCGCTCCG CTGCTGCTGG CCCTGCTCCG GCTCGTGCAG
GGCGTCGCGG TCGGCGGCGA GTGGGGCGGC GCGGTGACCA TCGCCGTCGA GCACGCCCCG
CCGGAGAAGC GCGGGCGCTA CGCGGCCCTG CCGCAGATCG GGTCGCCCGT GGGCACCCTG
CTCTCCTCGG GCGCGTTCTC GCTGGTCCTG CTCCTGCCCG CCGAGCAGTT CGACTCCTGG
GGCTGGCGCC TGCCGTTCCT CGCGGCCTTC CCGCTGCTGC TGGTCGCGGT CTACATCCGC
CAGAAGGTGG AGGAGTCGCC GGTCTTCGAG GAGATGGAGA AGCAGGAGGC GCGCTCCAAG
GTCCCCGCGG TCGACGTGTT CCGCCACGCG TGGGGCAGGC TCCTCATCGC GATCGCGTCC
GCCATGCTGG GCGTCGGCGG CTTCTACGTC ATGACGACGT TCGCCATCAG CTACGGCACC
GACACCCTCG GACTGTCGCG CAGCCTCATG GTCAACGCCA CCCTGGTCGC CGCGGTGGTG
CAGATCGGCG TGATCGTCTA CTTCGGGCGC CTGGCCGAGA AGCTCGGCCC CGGGCGCGTC
ACCATGTGGG GCGGTATCGC CACGGCGGTG ATCGCCTTCC CGGTGTTCTG GCTGATCGAC
ACGACCTCGC CGGTGCTGGT GGTGCTGGCG GTGGCCGGGG GCGTGGGCTT CCTGTCCATC
GCCTACGCGG TCTCCGGCGC GCTGCTCACC GAGCTGTTCC CGGCGAACCT GCGCTACAGC
GGCGTCGCGC TGGCCTACAA CCTGGCCGGC GCCCTGAGCG GCTTCCTGCC GTTCATCGCG
ACCGCCCTGC TGGAGAGCGC GGACGGACGC TCCTGGGTCG CCTCGGTGCT GTTCCTGGGG
ATCGCCCTGG TCACGGCGGT CGGCGGCTTC TACGGCGAGC GGCTGCGCGT CCGCGACGAC
GTGGTGGTGC GCTGA
 
Protein sequence
MSDLDPEATE ERPARGGPVV SGREAGRIAF AAFVGTALEW YDYFLFGTAA ALVFNRLYFT 
TLDATAATLA AFATFGVGFV ARPLGAVLFG WMGDRIGRRP ALLITVVSIG VATGLIGLLP
DFASIGIAAP LLLALLRLVQ GVAVGGEWGG AVTIAVEHAP PEKRGRYAAL PQIGSPVGTL
LSSGAFSLVL LLPAEQFDSW GWRLPFLAAF PLLLVAVYIR QKVEESPVFE EMEKQEARSK
VPAVDVFRHA WGRLLIAIAS AMLGVGGFYV MTTFAISYGT DTLGLSRSLM VNATLVAAVV
QIGVIVYFGR LAEKLGPGRV TMWGGIATAV IAFPVFWLID TTSPVLVVLA VAGGVGFLSI
AYAVSGALLT ELFPANLRYS GVALAYNLAG ALSGFLPFIA TALLESADGR SWVASVLFLG
IALVTAVGGF YGERLRVRDD VVVR