Gene Ndas_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2798 
Symbol 
ID9246649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3341392 
End bp3342705 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content73% 
IMG OID 
Productmetabolite/H+ symporter, major facilitator superfamily (MFS) 
Protein accessionYP_003680716 
Protein GI297561742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.606232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0553764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAGA AAGCCTCCTT CGGGCGGGTG GTCACCGCCA GCCTCATCGG CACGACGATC 
GAGTGGTACG ACTTCTTCCT CTACGGGTCG GCCGCCGCGC TCGTGTTCAA CCACGTCTTC
TTCCCCGAGT CCGACCCGCT GGTGGGCACC ATGCTGGCGT TCACCACCTA CGCGGTGGGG
TTCGTCGCGC GTCCGCTGGG CGGCCTGGTC TTCGGTCACT TCGGCGACAG GATCGGCCGC
AAACAGCTGC TGGTCATCAG CCTGCTGCTG ATGGGCGGAT CGACGTTCGC GATCGGCCTG
CTGCCGACCT ACGCGGTGAT CGGTGTGGCC GCTCCGCTGC TGCTGACGCT GCTGCGGGTG
GTCCAGGGCT TCGCCCTGGG CGGCGAGTGG GGCGGCGCGG TGCTGCTCGT CGCCGAGCAC
GGCGAGCCGC GGCACCGGGG GTTCTGGGCG TCCTGGCCGC AGGCGGGCGC TCCGGGCGGC
AACCTGCTGG CCACCGCCGT GCTGGCGGTG CTCGCCGTGG TGATGAGCGA CGCGGCCTTC
CTCTCCTGGG GCTGGCGCGT GCCGTTCCTG CTCTCGGGCG TGCTGGTGCT CATCGGCCTG
TGGGTGCGCC TGGCGGTCAG CGAGTCGCGG GTCTTCCGCG ACGCGCACGA GCGGGCGGCG
GCCTCGGCCC GGCCCGAGCG CGCCCCGATC CTGGGCGTGC TGCGCGACCA CTGGCGCGAG
GCCCTGGTGG CCATGGGCGT GCGCATGGCC GAGAACGTGT CGTACTACGT CGTCACCGCG
TTCATCCTGG TCTACGCGAC GCAGGAGGCG GGGATGCCCA ACGGCCAGGT GCTCAACGCG
GTGCTCGTCG CCTCGGCGGT GCACCTGGTG ACCATCCCCG CCTGGGGAGC CCTCTCGGAC
CGGATCGGCC GCAGGCCGGT GACCGCGCTG GGGGCCGCGG GGGCGGGGCT GTGGGTGTTC
GCCTTCTTCC CGCTCGTGGA CGCGGGGACC TTCTGGTCGG TGACCCTGGC CGTGACGGTC
GGCCTGGTCC TGCACGGGGC CATGTACGGC CCGCAGGCGG CCTTCTTCTC CGAGCTGTTC
AGCACCCGCG TGCGCTACTC GGGAGCGTCG GTGGGCTACC AGCTGGCGTC GATCGTGGCG
GGCGGGCTGG CCCCGCTGGT CGCGACGGCC CTGCTGGCCT CGTTCGGCAG CAGCGTGCCG
GTCTCGCTGT ACGTGGCCGC CATGGCCGCG GTCACCCTGG TCGCCGTCGC GGCCGCCCGC
GAGACCAGGG GCCGCGACCT GCGGGAGGTG GCAGTGTCCG AGAGAAGTGG GTGA
 
Protein sequence
MRQKASFGRV VTASLIGTTI EWYDFFLYGS AAALVFNHVF FPESDPLVGT MLAFTTYAVG 
FVARPLGGLV FGHFGDRIGR KQLLVISLLL MGGSTFAIGL LPTYAVIGVA APLLLTLLRV
VQGFALGGEW GGAVLLVAEH GEPRHRGFWA SWPQAGAPGG NLLATAVLAV LAVVMSDAAF
LSWGWRVPFL LSGVLVLIGL WVRLAVSESR VFRDAHERAA ASARPERAPI LGVLRDHWRE
ALVAMGVRMA ENVSYYVVTA FILVYATQEA GMPNGQVLNA VLVASAVHLV TIPAWGALSD
RIGRRPVTAL GAAGAGLWVF AFFPLVDAGT FWSVTLAVTV GLVLHGAMYG PQAAFFSELF
STRVRYSGAS VGYQLASIVA GGLAPLVATA LLASFGSSVP VSLYVAAMAA VTLVAVAAAR
ETRGRDLREV AVSERSG