Gene Ndas_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3371 
Symbol 
ID9247236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4027486 
End bp4029027 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003681282 
Protein GI297562308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAG AAGCCACGGA TCTGCAGGGC GCGCGGAGTG CTCCCGCCCC GTTGACCAGG 
GGACAGGCGG TGGCCACACT CGTCGCCGTC GCCCTGTCCA GCGTGATGCT GCCGCTCGCC
GTCACCGCTC CGGCCGTGGC GCTCACCCAG CTCGCGGCCG ACCTGAACGC CAGCGTCGGC
GAGGCCCAGT GGGTCCAGAA CGCCTACAAC GTCACCTTCG CGGCGTTCAT GCTCGCCGCC
GGCGGACTCG CCGACCGCTT CGGCAGGCGC CGGGTCCTCG TCATCGGCCT CGTCGTCTTC
ACCGCGATGG CCACGGTGAT CGGCCTCTCC TCCAACATCC TCGTCATCGA CGTCGCCCGC
GCGGTCCAGG GCATCGGCGC CGCGGGCATC ATGACCAGCG GATCGGCGAT CCTCGCCGAC
TCCTTCCGGG GAGCGGCCCG GGCCCGGGCC TTCGGCCTCC TGGGGACCTC CTTCGGCTTC
GGACTGGCCA TGGGGCCCTT CGTCGCCGGG CTGATGGTCA ACTTCCTGGA CTGGCGCATG
GTGTTCCTGA TGAACCTCGC GTTCGCCGCC GTCGTCCTGC TCCTGGTCCG CTCGATCCGT
GAGTCGAGCG ACCCCGGTTC CACCTCGGTG GACTGGGGCG GCGTCATGAC CTTCAGCACC
AGCCTGTTCC TGCTCTCCCT CGCCTTCGTC CAGGGCGCCG AGGCGGGTTG GCTCAGCCTG
AGCGCGATCG GGTCGGCGGT CGGGTTCCTC GTCTTCCTCG CCGCCTTCGT CGTGGTGGAG
TCACGGGTCC GGCGCCCGAT GTTCGACCTC TCGCTGTTCA AGCGGCCCAC GTTCGTCGTG
GTCGTCTGCC AGCCCTTCAC CATCACCTTC GGCTTCGTGG TCCTGCTCGT CTACCTGCCG
CCCTTCTTCC AGGGCGTCGG CGGGTTCGGC GCCGCCGAGG CCGGGGCCCT GCTCCTGCCC
CTGACCCTGC CGGTGCTCGC CCTGCCGATG CTGGCCGGGC AGCTCGCCGC CAGGCTCCCC
CTGCGGGTCA TGCTCGCCAC CAGCTCCCTG CTCATCGCGG GCGGCTCCCT GTGGTTGATG
ACCCTCCAGC CGGGCCAGCA CTGGACGGCG CTCGCCGCTC CCCTGGCCCT GTTCGGCACG
GGGGTGGGAA GCGCCTTCGG CGTCATGGAC AACGCGGCGC TCAGCTCCGT GGAGGTCGAG
CGGGCCGGGA TGGCCTCGGG CATCTTCAAC ACCATGCGCA TCACCGGGGA GAGCGTGGCG
ATCGCCGGAG CCGGGTCCGT GCTGGCGACC CTGTCCCTGA ACACGCTCGA CCTCCCCTTC
GCCGACCCCG AGCAGGAGCG GACCCTGGCG GGTGAGGCCA CCCAGGGCCG ACTGGAAACG
GCGCTGGGCC AGTTCGCCGA GGCGGACCGC TCGACCGCGC TGGACGCCGT CTCGGCCAGC
CTCACCTCCG CGATGCACAC CACGTTCCTG GGGCTGGCGC TCCTCGCCCT GGCCGGCGCG
GTGGTCACGT TCCTCGTCGT CAGGGAACGC GAGCTCCGCT AG
 
Protein sequence
MAIEATDLQG ARSAPAPLTR GQAVATLVAV ALSSVMLPLA VTAPAVALTQ LAADLNASVG 
EAQWVQNAYN VTFAAFMLAA GGLADRFGRR RVLVIGLVVF TAMATVIGLS SNILVIDVAR
AVQGIGAAGI MTSGSAILAD SFRGAARARA FGLLGTSFGF GLAMGPFVAG LMVNFLDWRM
VFLMNLAFAA VVLLLVRSIR ESSDPGSTSV DWGGVMTFST SLFLLSLAFV QGAEAGWLSL
SAIGSAVGFL VFLAAFVVVE SRVRRPMFDL SLFKRPTFVV VVCQPFTITF GFVVLLVYLP
PFFQGVGGFG AAEAGALLLP LTLPVLALPM LAGQLAARLP LRVMLATSSL LIAGGSLWLM
TLQPGQHWTA LAAPLALFGT GVGSAFGVMD NAALSSVEVE RAGMASGIFN TMRITGESVA
IAGAGSVLAT LSLNTLDLPF ADPEQERTLA GEATQGRLET ALGQFAEADR STALDAVSAS
LTSAMHTTFL GLALLALAGA VVTFLVVRER ELR