Gene Ndas_2910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2910 
Symbol 
ID9246762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3478491 
End bp3479810 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content74% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680826 
Protein GI297561852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.346093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.416861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCG AACCCTCCGC CGCGCCGCCG CGCACCGAAC GCCTCGGCGG GCTCTTCCAC 
CGCTTCTGGG CGGCCGGAAC CCTCACCAAC CTCGGCGACG GCATGCTCGC CACCGCGCTG
CCGCTCATCG CCGCCACCCT GACCCACGAC CCGCTCGCCG TCTCCGGCCT GGTGGTGGCG
CGCTTCCTGC CCTGGCTGCT CGTCGCGCCG TTCGCGGGCG TGCTGCTGGA CCGGGTCGAC
CGCCTCCGCG CCATGACGGT CTCCAGCACG GTCGCCGCGA TCGCCGTCAC CGCCCTGGTC
GTGCTGATCG CCACCGGCGG CGCCACCCTG TGGGCCCTGT ACGCCACGAT CTTCGTGGTG
GTCTGCTGCG AGACGGTCAC CGACCCCGTC ACCCGGATCA CCGTGGCGCG GATCGTGCCC
GGCCGCCTGC TGGACCGCGC CAACAGCAGG CTGGAGGGCG GCCGCCTGGT CGCCCAGGAC
TGTGTGGCCA CCCCCGTCGC CGGTGTGCTC TTCGCGGTCG CCGCCGCCCT GCCCGTGGCC
GGGACGGCCC TGTCCTACGC CCTGTGCGCC GTGCTCGTGG TGACCGTCGC CCTCATGCTG
CGGCGGGTTC CCGTCCCGGC CGGGTCCACC GGGGCCGCCG AGTCCAACGG GGGCGCGGAG
AGCGGTGAGG GCGCGGGGTC CGGCCAGGGC GTCCTGCGCT CCCTGCGGGA GGGTTTCGCC
TACGTCTTCG GCCGGCCGCT GACCCGGGGC CTGGCGTTCA ACAACGCCGG GTCCATGATC
GGTATCCAGA TGGCCACCTC GGTGCTGGTC CTCTACGCCC AGGAGGTGTT GGGCGTTCCC
GAGGCCCTGT ACGGGCTCTT CATGGCGTCG ATCGCGGTCG GTGGTGTTCT GGGCTCCGTG
CTCGCCGCCC GGTTCGTCGC GGGCCTCGGC CGCAGGACGG TCATGATGGG CGGCTACCTG
GGGATGGGTG CCTGCCTGTT GGCGGTGGGC CTGGTGCCCG ACGCCTGGTC CGCCGCCCTC
GCCTGGGGGC TGATGGGCCT GTTCCTGACG GTGACCAACG TGTCCGGCTC CCTGTTCTTC
CAGGCCGTGG TCCCCTCCCA CGTGCGCGGC CGGGCGGCCG CCGCCTTCCG CATGGCGGGC
TGGGGGCTGT CCCCGGTCGG CGCGCTGCTG GGGGGTCTGC TGGGCCGGAT CGACCTCGCC
CTGCCCTTCC TGGTCGGCGG GCTGGTCATG GTGGTCACGC CCCTGGTCTT CCGCAGGTCC
GTCGCCGAGT GCGCGCGGCT GTCCGACGAG GCCGCCGCGT CCGCCGCCGG CGACCGCTGA
 
Protein sequence
MATEPSAAPP RTERLGGLFH RFWAAGTLTN LGDGMLATAL PLIAATLTHD PLAVSGLVVA 
RFLPWLLVAP FAGVLLDRVD RLRAMTVSST VAAIAVTALV VLIATGGATL WALYATIFVV
VCCETVTDPV TRITVARIVP GRLLDRANSR LEGGRLVAQD CVATPVAGVL FAVAAALPVA
GTALSYALCA VLVVTVALML RRVPVPAGST GAAESNGGAE SGEGAGSGQG VLRSLREGFA
YVFGRPLTRG LAFNNAGSMI GIQMATSVLV LYAQEVLGVP EALYGLFMAS IAVGGVLGSV
LAARFVAGLG RRTVMMGGYL GMGACLLAVG LVPDAWSAAL AWGLMGLFLT VTNVSGSLFF
QAVVPSHVRG RAAAAFRMAG WGLSPVGALL GGLLGRIDLA LPFLVGGLVM VVTPLVFRRS
VAECARLSDE AAASAAGDR