Gene Ndas_2663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2663 
Symbol 
ID9246514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3171798 
End bp3173138 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680586 
Protein GI297561612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.11892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAGC CCGACCCGGC CCCGGCCGGC GCGAGAGCGC CGAAGGTCAA CGCGGCCGAC 
GCCAGGCGCA TCGCCTTCGC GGCCTTCGTC GGAACAGCCC TGGAGTGGTA CGACTACTTC
CTGTTCGGCA CGGCCGCCGC TCTGGTCTTC AACCGCCTGT TCTTCACCGA ACTCGACCCG
GGCGCGGGCC TCATGGCGGC GCTGGCGACC TTCGGCGTGG GCTTCGCCGC CAGGCCCATC
GGATCCCTGA TCTTCGGCAC CATCGGGGAC CGCTACGGAC GGCGCCCCGC ACTGCTGATG
ACCATCGTCA TGATCGGCTG CGCGACCGGG CTCATCGGCG TGATCCCCGA CTACATGGCC
ATCGGGATCG CCGCGCCGAT CCTGCTCGCG GTCCTGCGGC TCCTCCAGGG CCTGGCCGTG
GGCGGCGAAT GGGGCGGCGC CATCACCATC GCCATCGAGC ACGCCCCCGA GCGCCAGCGC
GCCCGCTACG CCGCCCTGGT CCAGATCGGC TCCCCGGTCG GCACGCTCAT CTCCTCGGCC
GCCTTCGCCG CGGTGCTGAC GCTGCCCGCG GCCGACTTCG ACGCCTGGGG GTGGCGCCTG
CCGTTCCTGG CCGCGTTCCC GCTGCTGGGC ATCGCCCTCT ACATCCGCTT CAAGGTGGAG
GAGTCCCCCG TCTTCCAGGA GCTGGTGCAG ATGGAGGACC GCGCCAAGGT GCCCGCCCTG
GCGCTGTTCC GCGAGGCCGG GGGCCGCCTG CTCGTGGCGG TGGCCGCGGC GCTGCTGGGG
GTCGGCGGCT TCTACGTGAT GACGACCTTC GTGGTCTCCT ACGCCTCCAC GGTGCTGGAG
GTCGACCGCC AGGCGGTCGT GAACGCCACG CTCGTCGCCG CCGTCTTCCA GATCGCCACG
ACCCTGGTCG CCGGGCGCGC GGCCGAGCGC TTCGGCCCCG GCCGGATGAC GGTGATCGGC
GCCCTGGCCA CCGCCGCGGC CGCGTTCCCC CTGTTCCGGC TCATCGACAC CGCCGACCCG
TGGGCGATCA CCGCCGCGGT GACCATCGGG ATCTGCCTCA TCACCCTGGC CTACGCCGTC
ACCGGCACCC TGCTGGCCGA CCTCTTCCCG CCCCGGCTGC GCTACAGCGG GGTGTCCCTG
GGCTACAACC TCGCCGGAAC CCTCAGCGGG TTCCTCCCGC TCATCGCCAC CGCGCTCCTG
GCCGTGGACG ACGGCGCGTC CTGGCCCGCG GTGCTGGTCC TCATCGGCAT CTGCGCCGTC
ACCGCGGTCG GCGGCCTGGC CGGTGAGCGG ATCAGGGCCG CGGACACCCG CGCCGCCGCC
GACACCCGGG CCGCGGCGTG A
 
Protein sequence
MPQPDPAPAG ARAPKVNAAD ARRIAFAAFV GTALEWYDYF LFGTAAALVF NRLFFTELDP 
GAGLMAALAT FGVGFAARPI GSLIFGTIGD RYGRRPALLM TIVMIGCATG LIGVIPDYMA
IGIAAPILLA VLRLLQGLAV GGEWGGAITI AIEHAPERQR ARYAALVQIG SPVGTLISSA
AFAAVLTLPA ADFDAWGWRL PFLAAFPLLG IALYIRFKVE ESPVFQELVQ MEDRAKVPAL
ALFREAGGRL LVAVAAALLG VGGFYVMTTF VVSYASTVLE VDRQAVVNAT LVAAVFQIAT
TLVAGRAAER FGPGRMTVIG ALATAAAAFP LFRLIDTADP WAITAAVTIG ICLITLAYAV
TGTLLADLFP PRLRYSGVSL GYNLAGTLSG FLPLIATALL AVDDGASWPA VLVLIGICAV
TAVGGLAGER IRAADTRAAA DTRAAA