Gene Ndas_3215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3215 
Symbol 
ID9247072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3845538 
End bp3847022 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content67% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003681129 
Protein GI297562155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAA CGTCGACCGC CGGGCCCGCG ACCAGGGCCC TCATCCCGGT GCTCGCCTTC 
ACGGGCATCG TGGTTTCAGT GATGCAGACC ATGCTCATTC CTCTGATCAA GGATCTGCCG
CAACTCCTGG GCACCGAGCC TCACAACGCG ACCTGGGTCA TCACCTCGAC GCTTCTCTCA
GGCGCCGTCG CCACGCCAGT CATGGGACGC CTGGGTGACC TCCACGGCAA GCGCCGCATG
CTGATCCTCA GCCTCGCGGT GATGGTCGTC GGCGCACTCG TCAGCGCCGT CACCGACGAT
CCGCTTGTGA TGATCACGGG CCGGGCCCTC CAGGGCTTCG CGATGAGCGC GATACCCCTC
AGCATCAGCT TGGCGCGTGA CATAGTTCCC CGCGAAGAAC TCGGCTCTGC GATGGCCCTG
ATCAGCTCCT CCATGGGCGT CGGGGGGAGT CTCGCCCTGC CCGCCTCGGG CTTGGTCGCC
CAGCACGCCG ACTGGCACGC CCTCTTCTAC GGCGCCGCAG GTCTTGGCCT TTTGTGCATC
GCGCTGATCC TCATCGTCGT CCCCGAGTCA CCGGTTTCCA CACACGGCAC CTTCGACCTC
CTGGGCGCGG TCGGCCTGTC CGCCGCCCTC ACCCTCTTCC TGCTGCCGGT CACCAAGGGA
AGCCACTGGG GCTGGACCTC CAGCACCACC CTCGGACTGT TCACCGCGGC GGTCGTCGTG
CTCATCTTGT GGGGCGTGCT GGAACTGCGC CTCGACGCAC CGCTGGTGGA CCTGCACACG
ATGGCCCGTC CCGCGGTGCT TTTCACCAAC CTCGCCTCGA TCATGGTCGG TGCCTCATAC
CTGGTCGTCT CGATGGTCCT TCCCCAACTG CTCCAGTTGC CGAAGGCCAC CGGATACGGC
CTCGGCCAGT CAATGGTGAC CGCGGGCCTG TACCTGGCAC CGCTCGGCCT GACCATGATG
CTCACGGCAC CTATCTACGC GCGGCTGTCC GCGAGGCATG GCCCCAAGAG CACCTTGATC
CTCGGCATGT CGATCGTTGC GATCGGCTAC GGAGTCGGCC TCAGCCTCAT GAACGCACCC
TGGCAAAGCC TCATCATCAC GGCGGTCCTG GGTGTGGGCA TCGGTCTCGC CTACTCCTCC
CTACCCGCCC TGATCGTCGG CGTGGTCCCC GCCACGCAGA CGGGCTCGGC CAACGGCCTC
AACACGCTGA TGCGCTCGAT CGGCTCCTCG CTCTCCAGCG CCGTCATCGG CGGGATCCTC
TCCACCACCG CACACCAGTT CAACGGCGTT CCCGTCCCCA GCATGTGCGG CTTCCGCATC
TCCTTTCTGA TAGCGACAAG CGCAATGGCG ATCGGCCTGT TCACAGCCCT CTTCCTGCCC
GGGCCCGCCC GGTCGGCCGG GGCACCACAC CGACGGCGGG CAAACCCCCG CCCGGTCGCA
CACGCGCGGG AAGTGACAGG TGGCCCTGCA TCGACAGGAG AGTGA
 
Protein sequence
MTRTSTAGPA TRALIPVLAF TGIVVSVMQT MLIPLIKDLP QLLGTEPHNA TWVITSTLLS 
GAVATPVMGR LGDLHGKRRM LILSLAVMVV GALVSAVTDD PLVMITGRAL QGFAMSAIPL
SISLARDIVP REELGSAMAL ISSSMGVGGS LALPASGLVA QHADWHALFY GAAGLGLLCI
ALILIVVPES PVSTHGTFDL LGAVGLSAAL TLFLLPVTKG SHWGWTSSTT LGLFTAAVVV
LILWGVLELR LDAPLVDLHT MARPAVLFTN LASIMVGASY LVVSMVLPQL LQLPKATGYG
LGQSMVTAGL YLAPLGLTMM LTAPIYARLS ARHGPKSTLI LGMSIVAIGY GVGLSLMNAP
WQSLIITAVL GVGIGLAYSS LPALIVGVVP ATQTGSANGL NTLMRSIGSS LSSAVIGGIL
STTAHQFNGV PVPSMCGFRI SFLIATSAMA IGLFTALFLP GPARSAGAPH RRRANPRPVA
HAREVTGGPA STGE