Gene Ndas_3276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3276 
Symbol 
ID9247138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3911315 
End bp3912697 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content70% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003681188 
Protein GI297562214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC AGGCACCCCC CGCGCCAGCC GGACAGGGGG AGCGCAGCAC CCGGGACCTC 
ACCAAGGCCT CCGTGTCCGG GTGGCTGGGG ACCGCCATGG AGTTCATGGA CTTCCAGCTC
TACTCCCTGG CCGCCGCCCT GGTGTTCAAC CAGATCTTCT TCCCCGACCT CAACCCCGCG
GTCGGCCTGA TCGCGGCGAT GGGCACCTAC GGTGTCGGGT ACGTCGCCCG GCTCGTCGGG
GCGGTCTACT TCGGCCGCAT GGGCGACCGC CTCGGCCCCA AGAAGGTCCT GTTCATCACC
GTCGCCCTGA TGGGCGTCTC CACCACCCTG ATCGGCGCCC TGCCCACCTA CCAGCAGGTG
GGCCTGCTCG CCCCGATCCT GCTCGTGGGC CTGCGGCTGA TCCAGGGCTT CGGCGCGGGC
GCCGAGATCG CGGGCGCCAC CGTGATGCTG GCGGAGTACG CCCCGGCCAG GCGTCGGGGG
TTCATCGCCT CCCTGGTGTG CCTGGGCACC AACTCCGGCA CCCTGGGCGC CTCCGCGATC
TGGGCGGTCC TGGTGTTCGC GCTCTCCGAG GAGCAGCTGC TGTCCTGGGG CTGGCGCCTC
CCCTTCCTGG CGAGCTTCCT CCTGCTGCTC CTGGCGCTGT GGATCCGCCT CTCCGTCAAG
GAGAGCCCGG TCTTCGAGCA GCGCGAGGAC ATCGTCGACG GTGTGGCGAT GTCCCGGAGC
GAACTGGCCG CGGCCGCGGT GAAGGAGGAC AGGAGCGGAC TGGAGACCGC CCTGCACCAG
CGCAAGGGCC GCGCGTTCCT GCTCGCCCTC GGCCTGCGCT TCGGCCAGGC GGGCAACTCC
GGCATCGTCC AGACCTTCCT CGTCGGCTAC CTCAGCGCCA ACCTGATGCT CAACGACGCG
GTCGGCACGT CCGCGATCGT CTACGGCTCC CTGCTCGGCT TCGTCACCGT CCCCCTGGTC
GGCGTGCTCG GTGACCGCTT CGGACGCCGT CCCGTCTACC TCTTCCTGAC CGTGGCGAGC
ATGCTGTTCG CCGTGCCCAT GATGCTGATG ATCGAGACCG GCGACACCGT GCTCGTGACC
GTCGCGATGG TCGTCGGGCT CAACCTGTCG GTCCTGGGCC TGTTCTCCGT GGAGAGCGTC
ACCATGGCCG AGCTGTTCGG GGCGCGCACC CGGTTCACCC AGCTGGCCCT GGCCAAGGAG
ATCGGCGGCG TCCTGGCCAC CGCGATCGGC CCGGTGCTGG CCGCCACGCT CACCGCCGCG
ACCGGCTCCT GGTGGCCGCT GTCGGCGATG ATCATCGCCT ACTCCCTGAT CACCCTGGCC
TCCGCCTACC TGTCCCCCGA GGTGCGCGGA CGCGACCTGG TCCGACTGGA GGACGCCGTA
TGA
 
Protein sequence
MTEQAPPAPA GQGERSTRDL TKASVSGWLG TAMEFMDFQL YSLAAALVFN QIFFPDLNPA 
VGLIAAMGTY GVGYVARLVG AVYFGRMGDR LGPKKVLFIT VALMGVSTTL IGALPTYQQV
GLLAPILLVG LRLIQGFGAG AEIAGATVML AEYAPARRRG FIASLVCLGT NSGTLGASAI
WAVLVFALSE EQLLSWGWRL PFLASFLLLL LALWIRLSVK ESPVFEQRED IVDGVAMSRS
ELAAAAVKED RSGLETALHQ RKGRAFLLAL GLRFGQAGNS GIVQTFLVGY LSANLMLNDA
VGTSAIVYGS LLGFVTVPLV GVLGDRFGRR PVYLFLTVAS MLFAVPMMLM IETGDTVLVT
VAMVVGLNLS VLGLFSVESV TMAELFGART RFTQLALAKE IGGVLATAIG PVLAATLTAA
TGSWWPLSAM IIAYSLITLA SAYLSPEVRG RDLVRLEDAV