Gene Ndas_4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4495 
Symbol 
ID9248375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5331248 
End bp5332477 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682389 
Protein GI297563415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.675529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACCTG TGTCCGGTCG GGCCTCCGGC GCGCCCGCAC CCGCCGTGGT CGGTGCGCGC 
CGGGGCCTGG CGGTCCTGTG CGTCACCGTC ACCACCGGCT ACGGGGTGCT GTTCTACGCC
TTCCCCGTCC TGGCGCCGAG CATCACCGCC GACACCGGAT GGTCCCTGAC CGCGGTGACC
GCGCTGTTCT CCGCCTCCCA GGTCATGGCG GGACTGGCGG GCATCCCGGT GGGACGCTGG
GTGCAGGCCC GGGGCCCGCG CCCGGCGATG ACGGCGGCTG CCCTGGCCGC GGCTCCCGCC
GTGGCGGCCC TCGCCCTGGC CCCGAACCTG TGGGGCTTCG CCGCCGCCTG GCTGGTGGCC
GGAGCGGCGA TGGCCGGACT GTTCTACCCT CCGGCCTTCG CCGCCCTGAC CCAGTGGTAC
GGAAGGGCGA AGGTCCGGGC CCTGACCGCG CTGACCCTGG CCGCCGGTCT GGCCAGCACC
GTCTTCGCTC CCCTGACCGC GTTCCTGGAA GGAGTCTGGG GGTGGCGGAC CGCCTACCTG
GTACTCGCGG CCGTGCTCCT GGTCGTGGTG GTGCCCCTGC ACGCCTTCGC CCTGCCACAG
GGCTGGGTCG CCGACGGCGC CGGGCAGCAG AGGGGCCGAG GGCAGGGCGC GCGTGCCGTG
GTGCGCGGTC GGGTGTTCTG GGCTCTGACG ACGGCTCTGG CCCTGGGGTC CTTCACCGTC
TACGCGGTCG TGGTCAACAT CGTCCCCCTG CTGGATGAAC AGGGTTTCGG CACGGCGGAA
GCGGCCTGGG CCCTGGGGGC GGGCGGTGTG GGGCAGGTGC TCGGCCGTCT GGTCTACGCG
CCCCTGGAAC GGTGGACCGA CCCGGTGCCG CGCGCCGTGG CCGTGCTGGG CGCGTGTTCG
GTGACCACCC TGCTTCTGGC CCTGGTGCCG GGACCCCTGG GGCCGGTCCT GGCCATCGCG
GTGCTGGCGG GCATGGCACG CGGCATCCTC ACCCTCCTCC AGGCCACCGC CGTGTCCGAC
CGGTGGGGGA CGGAGCACTA CGCCACCCTC AACGGCGTCA TGCACACCCC GCTCATGCTG
GCCGTCGCGG TCGCGCCCTG GGCGGGCGCA GCCCTGGCCG GTCCCCTGGG CGGCTATCCG
GCGGCGTTCG CGGCGCTGGG AGCCCTGGCG GCGCTCGGCG CGCTGACCGC CCTGGCCACC
CGCGCCGAAC GGGTTCCCAC CCCTTCCTGA
 
Protein sequence
MTPVSGRASG APAPAVVGAR RGLAVLCVTV TTGYGVLFYA FPVLAPSITA DTGWSLTAVT 
ALFSASQVMA GLAGIPVGRW VQARGPRPAM TAAALAAAPA VAALALAPNL WGFAAAWLVA
GAAMAGLFYP PAFAALTQWY GRAKVRALTA LTLAAGLAST VFAPLTAFLE GVWGWRTAYL
VLAAVLLVVV VPLHAFALPQ GWVADGAGQQ RGRGQGARAV VRGRVFWALT TALALGSFTV
YAVVVNIVPL LDEQGFGTAE AAWALGAGGV GQVLGRLVYA PLERWTDPVP RAVAVLGACS
VTTLLLALVP GPLGPVLAIA VLAGMARGIL TLLQATAVSD RWGTEHYATL NGVMHTPLML
AVAVAPWAGA ALAGPLGGYP AAFAALGALA ALGALTALAT RAERVPTPS