Gene Ndas_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1744 
Symbol 
ID9245594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2121907 
End bp2123217 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679678 
Protein GI297560704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.795428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGACG AAGGACGAAA GGAAGCCGGG GCGCCGCCCG TGGACGTTCC GCTACGGCGG 
AACCGCAGGT TCCAGCTCCT GTGGGTGGGG TCGGCGTTCT CGTTCTTCGG GCTGGAGGTC
TCCGAACTCG TCTACCCGCT GGTCGTCCTG GCCCTGACCG GTTCCCCCGC CTGGGCGGGC
GCCTTCGGCG GCGTGCAGAT GGTCGCCACG CTCCTCGCGG CCCTGCCCGC GGGGGAGCTC
TGCGACAGGT ACGACCGGCG CGCGCTCCTG CTCCTGGCGG AGGGCACCCG GGCGGCCGCC
ACGCTGAGCG TGGTGGCGGC ACTCCTGTTC GCGACACTCA CCCTGCCCCA CCTGCTGGTC
GTCGCGGCAC TCCTCGGCCT GGTCACACCG CTGGGCGGCT CGGCGCGCAT GCTCCTGGTC
CGCGCCGTGG TGCCCAGGGA ACAGCTCACC TCGGCCCTCA CCCAGGAGGA GGTGCGCAGC
AACGGCGCGG CCATGGCGGG TCCTCCGCTG GGCGGATTCC TGTACGCGGT GAGCATGGCG
ACGCCCTTCG TGTTCACCAC GGTCACCTTC GTCCTCTCCG TCGTGTGCGT ACTGTTCGTC
CGCCCGGTCC CACCCCGGCC CGGGGGCTCC GAGGAGGACG CCGACGGATC GGCCTGGACC
CGGATGCTGT CGGGCCTCAG GACGATGGCC GCCGCCACCG AGCTGCGCCG GGTCCTGCTC
TTCACCGTCC TGATGAACGC GGTGAGCGCG CCCTTCCTCC TGATCTCGGT GGTGGTCCTG
GAGGAACAGG GCGCGTCCTC CACGGTGATC GGTTTCGCCA TGATGGGCCT GGCGGCGGGC
GGCCTGGTCG GCGCCTTCCT GGTCAAGCCC CTGCACCGGC TGCTGCCCCC CGGCGGGGTC
ATGCTCGCGG TGGGCGGCAG CACCGTCCTC CTCATCGCGC TCTTCGCCGT CCCGTGGGGC
CCCTGGTGGC TGGCCGCCCT CCTGTTCCTG CTCACCGTCG GCGTCCCGGC GATGCGGATC
CTGGTCGACC TGCTGATCTT CCGTCAGGTG TCCGACGAGA TCAGGGGAAG GGTGATCGCG
GCGGCGATGA CCCTCTACGG GGTGGGAGGC GCCGTCGGCA TGGCCGGGGC CGGTCTGCTC
CTGGAGTTCC TGCGGCCCGG CCACGCGGTC CTCACCCTGG CCGCCGTGCT CGCCGTGTGC
GTGCTCCTCG CCTTCGCCCA CCGGGGCTTC CGGACCATGG CGTGGCCCGT GGAGTCCGCT
GACGGCGACG GTTCCCCGGA GAGGTCCACC AGCGAGGCCG ACCCCAACTG A
 
Protein sequence
MEDEGRKEAG APPVDVPLRR NRRFQLLWVG SAFSFFGLEV SELVYPLVVL ALTGSPAWAG 
AFGGVQMVAT LLAALPAGEL CDRYDRRALL LLAEGTRAAA TLSVVAALLF ATLTLPHLLV
VAALLGLVTP LGGSARMLLV RAVVPREQLT SALTQEEVRS NGAAMAGPPL GGFLYAVSMA
TPFVFTTVTF VLSVVCVLFV RPVPPRPGGS EEDADGSAWT RMLSGLRTMA AATELRRVLL
FTVLMNAVSA PFLLISVVVL EEQGASSTVI GFAMMGLAAG GLVGAFLVKP LHRLLPPGGV
MLAVGGSTVL LIALFAVPWG PWWLAALLFL LTVGVPAMRI LVDLLIFRQV SDEIRGRVIA
AAMTLYGVGG AVGMAGAGLL LEFLRPGHAV LTLAAVLAVC VLLAFAHRGF RTMAWPVESA
DGDGSPERST SEADPN