Gene Ndas_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1169 
Symbol 
ID9245019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1425768 
End bp1427135 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679116 
Protein GI297560142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.968326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.245673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCA CCAGTCCCCT GACCTCGTTC CTCGACTACG AGAGGAGCAT CGCGCCACCC 
GGCCACAGCC GGTGGCTCAT CCCGCCCGCG GCCCTGGCGG TACACCTGTC CATCGGCCAG
GTGTACGCGT TCAGCGTCTT CCGCAACCCG CTGGTCGAGC GGTTCGACAC CTCGCTGACC
GCGATCGGTG TCATCTTCAG CATCGCCATC GTCATGCTCG GCCTGTCGGC CGCGTTCGGC
GGCAAGTGGG TGGAGCGCAA CGGCCCGCGC AAGGCGATGT TCGTCTCCGG CCTGTTCTGG
TCCGGCGGGT TCGCGGTCGC CGCGCTCGGC GTGGCGATCG GCCAGCTCTG GCTGGTCTAC
CTCGGCTACG GCTTCCTCGG CGGGCTCGGT CTGGGCATCG GCTACATCTC GCCGGTCTCC
ACCCTCATCA AGTGGTTCCC CGACCGGCCC GGCCTGGCCA CGGGCATCGC GATCATGGGC
TTCGGCGGCG GCGCGCTCAT CGCCTCGCCG CTGTCCTCCG AACTCCTCAA CCGCTACGCC
GACACCCCCG CGGACGCCAT CGCGCCCGCG TTCCTCACCC TGGGCGCGGT CTACTTCGTG
GTCATGATGC TGGGCGCCTT CACCGTGCGC GTCCCCCCGG CCGGCTGGCG GCCCCCGGGG
GCCGCCGCCG AGGAGCCGAA GGGCGCCTCC GCCGGCGTGT CCCCGGCGCT GGAGGGGGTG
TCGGTGGAGA CCGCCGTGCG CACCCCGCAG TTCTGGCTCC TGTGGACCGT GCTCTTCTGC
AACGTCACGG CGGGGATCGG GATCCTTGAG CAGGCCTCGC CCATGGTGCA GGAGTTCTTC
ACCGGGGTCG GCCCGGCCGA GGCGGCGGGT TACGTCGGCT TCCTCTCCCT GTGCAACATG
CTCGGGCGGC TCGTGTGGTC CTCGGCCTCC GACGCGATCG GCCGCAAGCG CGTCTACATC
GGCTACCTCG GCCTGGGCGG GCTCCTCTAC CTGCTGATCG CCGTCGCCGG AACCAGTTCG
ATCGTGCTGT TCGTGGCGCT GACCGGCATC ATCCTGTCCT TCTACGGCGG CGGCTTCTCC
ACGGTGCCCG CCTACCTGAA GGACCTGTTC GGCACCTTCA ACGTGGGCGC CGTCCACGGC
AGGCTCCTCA CCGCCTGGTC GCTGGCCGGG GTCGCCGGGC CGATGATCGT CAACGTCATC
GCCGACGCCC AGCTGGCCGC CGGACGCGAC GGGGCCGAGC TCTACGGGCT CTCCCTCTAC
GTCATGGTCG GCGTCATGGC CGTGGGCTTC CTCGCGAACA TGCTGGTCCG GCCCCTGCCC
GAGCGGGTCC ACCAGCGCGA CCGCGAACGC GTCGCCGGGC GGAGCTGA
 
Protein sequence
MTRTSPLTSF LDYERSIAPP GHSRWLIPPA ALAVHLSIGQ VYAFSVFRNP LVERFDTSLT 
AIGVIFSIAI VMLGLSAAFG GKWVERNGPR KAMFVSGLFW SGGFAVAALG VAIGQLWLVY
LGYGFLGGLG LGIGYISPVS TLIKWFPDRP GLATGIAIMG FGGGALIASP LSSELLNRYA
DTPADAIAPA FLTLGAVYFV VMMLGAFTVR VPPAGWRPPG AAAEEPKGAS AGVSPALEGV
SVETAVRTPQ FWLLWTVLFC NVTAGIGILE QASPMVQEFF TGVGPAEAAG YVGFLSLCNM
LGRLVWSSAS DAIGRKRVYI GYLGLGGLLY LLIAVAGTSS IVLFVALTGI ILSFYGGGFS
TVPAYLKDLF GTFNVGAVHG RLLTAWSLAG VAGPMIVNVI ADAQLAAGRD GAELYGLSLY
VMVGVMAVGF LANMLVRPLP ERVHQRDRER VAGRS