Gene Ndas_1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1490 
Symbol 
ID9245340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1825723 
End bp1827021 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679426 
Protein GI297560452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.81003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0711824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAGC GGACCGCCCC CACGTCCTCG CACCCCGCGC GTGCGGCCGT GGCGGCCTTC 
GTCGGCACCA CCATCGAGTG GTTCGACTTC TACGTCTACG CCACGGCCGC CAGCCTGGTG
TTCGGCACCC TGTTCTTCCC GCCCGGCACC GACCCGGTGA TCGGCTTGAT GGCCTCCTTC
GCCACCTTCT CGGTGGGCTT CTTCGCCCGA CCGCTGGGCG GCCTGGTCTT CGGCCACTTC
GGCGACCGCC TGGGCCGCAA GTCCGCGCTG GTCGTCACCC TGTTGATGAT GGGCACGGCC
ACGTTCTGCG TGGGTCTGCT GCCCACCTAC GAGCAGGCGG GGTTCCTCGC CCCGGCGCTG
CTGGTCCTGC TGCGCTTCGT CCAGGGCATC GCCGTGGGCG GGGAGTGGGG CGGCGCCGTC
CTGATGGCGG TCGAGCACGC CCCCGAGGAC CGCAAGACCT TCTACGGCTC CTTCGCCCAG
CTCGGCAACC CGGCCGGTGC GCTGCTGGCC ACGGGCTCGT TCGGGCTCAT CGCCGCCTGG
GACGCCGACC TGCTCCACAC CTGGGGCTGG CGCCTGCCCT TCCTCGCCTC GGTCCTGCTG
GTCCTGGTGG GCCTGTTCAT CCGCCTGAAG GTGGAGGAGT CGCCGGTCTT CGAGGCCATG
CGCGAGGACA CCGACCAGCC GCGGGAGCTG CCGCTGCGCG AGGCCGTGCG CGGTTCCTGG
CGCCCGCTGC TGCTGGGCAT CGGCGTCCTG CCGGTGGCCG TCGGCGGCTA CTACGTCGTC
ACCACCTTCC TCCAGGCCTA CGGCGTCACC GAGGTCGGCA TCAGCGAGCA GGTCATCCTC
AGCGGCCTGA GCCTGGCCGC GTTCGTCGAA CTCGTCGCGA CCCTGGGCGT GTCCTGGCTG
GGCGACCGCT TCGGCACCGT CCGCGTCGTC ACCATCGGAC TGGTCGGCGT CATCGTGCTG
GCGCTTCCCC AGTTCCTGGT GCTGGAGACC GGCAGTACCG TGCTGATCTT CGTGGTGCTG
TGCGTGATGC GCCTGGCCAT GGCCGCGGTC TACGGGCCCA TCGCCCGCGT GCTCGCCCAG
ATGTTCCCGC CGCGCACCCG CTACACCAGC ATCTCCATCG CCTACCAGGT CGCGGGGGCG
ATCTTCGGCG GCCTGTCGCC GATCGTGTGC ACCGCCCTGC TCGCCGCCAC CGGCAGCATC
CTGCCGGTGG CGGGCCTGCT CATGGCCATG GCCGTGGTGA GCATCCTGTG CCTGGCCCGG
GCGCCGCGCC ACCGCGACAG CGACCTCGCC ACCGCCTGA
 
Protein sequence
MSERTAPTSS HPARAAVAAF VGTTIEWFDF YVYATAASLV FGTLFFPPGT DPVIGLMASF 
ATFSVGFFAR PLGGLVFGHF GDRLGRKSAL VVTLLMMGTA TFCVGLLPTY EQAGFLAPAL
LVLLRFVQGI AVGGEWGGAV LMAVEHAPED RKTFYGSFAQ LGNPAGALLA TGSFGLIAAW
DADLLHTWGW RLPFLASVLL VLVGLFIRLK VEESPVFEAM REDTDQPREL PLREAVRGSW
RPLLLGIGVL PVAVGGYYVV TTFLQAYGVT EVGISEQVIL SGLSLAAFVE LVATLGVSWL
GDRFGTVRVV TIGLVGVIVL ALPQFLVLET GSTVLIFVVL CVMRLAMAAV YGPIARVLAQ
MFPPRTRYTS ISIAYQVAGA IFGGLSPIVC TALLAATGSI LPVAGLLMAM AVVSILCLAR
APRHRDSDLA TA