Gene Ndas_4657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4657 
Symbol 
ID9248539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5531610 
End bp5532932 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682549 
Protein GI297563575 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.511231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCC ACCAGCAGTC GGCGTCCGGG CCCGCACCGG CGCCGGAGGC GGGCTTCCAG 
TCGCGGATCA AGGTCATCCG CGCGGCGGTC ATCGGCACCG TCGTCGAGTA CTACGACTTC
GGCATCTACG GCTACATGGC CACGTTCGTG GCGATGCTCT TCTTCGTCTC GGAGGACCCG
ACGGCGGCCC TGCTGGGCAC GTTCGCCGCG TTCGCCGTCG CGTTCTTCAT GCGCGTGCCC
GGCGGCATCC TCTTCGGCCA CATCGGGGAC CGCTACGGGC GCAAGCGGGC CCTGTCCTGG
ACCATCCTGC TGATGGTCCT GGCCACCGCG GCCATCGGCG TGCTGCCCAA CTACTACACG
CTCGGCGTCT GGGCGACCGT CCTGCTGGTC CTGTGCCGCT GCGTCCAGGG CTTCGCCGCG
GGCGGCGAAC TCGGCGGGGC CAACGCCTTC GTGGCCGAGT CCGCCCCGGC CCGCTGGCGG
GCCACCCAGA CCTCCCTGGT CAACTCGGGC ACCTACTTCG GCTCGCTGTT CGCCTCGCTG
GTGGCGCTCA CCTTCACCAC GGTCTTCACC GAGCAGCAGA TGCTGGACTG GGCGTGGCGC
CTGCCGTTCC TGCTCAGCCT GCCCATCGGC GTCATCGGCC TCTACATCCG CAGCCACCTG
GACGACACCC CGCAGTTCAA GCAGCTGGAG GACAAGGGCG AGACCGAGCG CATGCCGATC
CGGACCCTGC TGGTCACCAA CTGGCGGTCC GTGCTGAAGA TCATCGGCCT GGGCGCGGTG
ATCACCGGCG GCTACTACAT CGTCTCGGTG TACGCGGCCA CCTACCTCCA GACCACGGCC
GGGCACTCCG CCCAGCTGGC CTTCGCCTCG ACCTCGGTCG CCATGGTCGT CGGCGCCGCC
ACGCTGCCGC TCTCGGGCTA CCTGGCCGAC ACCATCGGCC GCAAGAAGGT CATCCTCATC
GGCAGCGTCG GGGCCGCGCT CCTGGGCTTC CCGATGTTCA TGATGATGTC CGCCGGCCCG
GCCTGGGCCG CGATCGTGGG CCAGACGGCG CTGTTCGTGT GCGTCTCGGT CGTCAACGGC
GCCTCGTTCG TCACCTACGC CGAGATGCTG GGCGCCTCCA CCCGCTACAG CGGGATCGCG
CTCGGCAACA ACGTCACCAA CACGCTCCTG GGCGGCACCG CGCCCTTCGT CGCGACCTGG
CTGATCAGCG CCACCGGCCA GCCGCTGGCT CCCGCGGGCT ACTTCGTCCT CACCGCCCTG
GTGACGCTGG TGGCCGTCCT GTTCGTCACC GAGACCCGCG GCACCGAACT GCCGATCGAC
TGA
 
Protein sequence
MSLHQQSASG PAPAPEAGFQ SRIKVIRAAV IGTVVEYYDF GIYGYMATFV AMLFFVSEDP 
TAALLGTFAA FAVAFFMRVP GGILFGHIGD RYGRKRALSW TILLMVLATA AIGVLPNYYT
LGVWATVLLV LCRCVQGFAA GGELGGANAF VAESAPARWR ATQTSLVNSG TYFGSLFASL
VALTFTTVFT EQQMLDWAWR LPFLLSLPIG VIGLYIRSHL DDTPQFKQLE DKGETERMPI
RTLLVTNWRS VLKIIGLGAV ITGGYYIVSV YAATYLQTTA GHSAQLAFAS TSVAMVVGAA
TLPLSGYLAD TIGRKKVILI GSVGAALLGF PMFMMMSAGP AWAAIVGQTA LFVCVSVVNG
ASFVTYAEML GASTRYSGIA LGNNVTNTLL GGTAPFVATW LISATGQPLA PAGYFVLTAL
VTLVAVLFVT ETRGTELPID