Gene Ndas_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2166 
Symbol 
ID9246016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2586109 
End bp2587494 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680094 
Protein GI297561120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.203269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0459407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCTCAC AGACTGGCCC GCGCGCACCG CGCACCGGTA AGAGCTGGAT CTCCGCCTGG 
GACCCCGAGG ACGAAGGGTT CTGGAACGGC GGCGGACGAC GCGTGGCCCG CCGCAACCTG
TGGGCCTCCA TCGCCTCCGA GCACATCGGC TTCTCGGTGT GGAGCATCTG GTCGGTACTC
GTGCTCTTCA TGATCCCCGA GCACGGCTTC TCCACCACCC CCGAGCAGAA GTTCCTGCTC
CTGTCGGTGG TCACCCTGGT CGGCGCGATC CTGCGCGTGC CCTACACCCT GGCCGTGCCC
GCCCTCGGCG GACGCAACTG GACGGTCATC TCCACCCTGA CCCTGGCCGT GCCCACCGTC
GCCGCCTTCT TCCTGGTCCG CGACCCCGAC ACCCCCTTCT GGCTGCTGCT GGTCCTGGCC
GCCACCGCGG GCGTGGGCGG CGGCAACTTC TCCTCCTCCA TGGCCAACAT CAACTCCTAC
TTCCCCGAAC GGGAGAAGGG GTGGGCGCTG GGCCTGAACG CGGGCGGCGG CAACATCGGC
GTGGCCACCG TCCAACTCGT GGGCCTGGCC GTCATCGCCC TGTTCACCAC CTCCGCCGGA
CACCTGGTCC CGCTGTTCTA CGCACCGCTG ATCCTGCTGG CCGCCTGGTG GGCGTACCGG
GCCATGAACA ACCTGGTCCA CGTGCGCAAC GACGTCTCCG CGCAGCTGTC GGCCGTCCGC
GACCGCCACT TCTGGATCAT GTCGCTGCTG TACGTGGGCA CCTTCGGCTC CTTCATCGGC
ACCGGGTTCG CCTTCGGCCT GCTGCTGCAG TCCCAGTTCG GGCTGGCGCC CGTGCAGTCC
GCCGCGATCG CCGTGCTCGG CCCGGTCATC GGCTCCCTGA TCCGCCCCGT GGGCGGCAGG
ATGGCCGACT CCCTGGGCGG GGCCCGCGTC ACCCTGTGGG TCTTCCTGGC CATGGCCGCC
TGCGCCGCCG TCCTGGTGCT CTCCGTCCAG GCCGCCCACC TGGCCCTGTT CATCGGCGCG
TTCGCGGTGA TGTTCGTCCT CACCGGCCTG GGCAACGGCT CCACCTACAA GATGATCCCC
TCCCTGTACG CCGCACGCGC CGAGGACGCC ATCGCCGCCG GGGAACCCCG GGAACAGGCC
CTGGCCCGCA CCAAGCGCGT GGCCTCCTCC GTGCTCGGCC TCATCGGCGC GGTCGGCGCC
CTGGGCGGGG TGGGAGTCAA CATCGCCTTC CGCGAGTCCT TCGCCGCCAC CGGCTCCACC
GCCCCGGCCT TCGTCGTCTT CGGCGCCTTC TACCTGGTGT GCGCCGCCGT CACCTGGGCG
GTCTACCTGC GCCGCCCCGC CGCCGCCCCG GTCGGCGCCG CCAGCGTGGA GAGCGCCGAC
CGATGA
 
Protein sequence
MASQTGPRAP RTGKSWISAW DPEDEGFWNG GGRRVARRNL WASIASEHIG FSVWSIWSVL 
VLFMIPEHGF STTPEQKFLL LSVVTLVGAI LRVPYTLAVP ALGGRNWTVI STLTLAVPTV
AAFFLVRDPD TPFWLLLVLA ATAGVGGGNF SSSMANINSY FPEREKGWAL GLNAGGGNIG
VATVQLVGLA VIALFTTSAG HLVPLFYAPL ILLAAWWAYR AMNNLVHVRN DVSAQLSAVR
DRHFWIMSLL YVGTFGSFIG TGFAFGLLLQ SQFGLAPVQS AAIAVLGPVI GSLIRPVGGR
MADSLGGARV TLWVFLAMAA CAAVLVLSVQ AAHLALFIGA FAVMFVLTGL GNGSTYKMIP
SLYAARAEDA IAAGEPREQA LARTKRVASS VLGLIGAVGA LGGVGVNIAF RESFAATGST
APAFVVFGAF YLVCAAVTWA VYLRRPAAAP VGAASVESAD R