Gene Ndas_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2277 
Symbol 
ID9246127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2724489 
End bp2725898 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content69% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680205 
Protein GI297561231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.419016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.219783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA CACCACCGGA GACGGACGCC ACCGGCGCCC CCTCCGCCGA TCCCGGCCAG 
CGCAGGCGCG AACAGCGCGG CTGGTACCTC TATGACTGGG CCAACTCGGT GTTCACCACC
TCGGTGGTGA CCGTGCTCAT CGGCCCGTAC CTGAGCAACC TGGCGTGTGT GTCGGCCGGG
GCCCCTGACG CCGCGTCCTG CCTGGACCCC GCGCTGTCCA TCAGCCCGCT GGGCCTGGAC
TTCCTCTCGC TGCACCCCAA CGCGCTCTAC CCGGCGCTGA CCACGGTGGC GATCCTGCTC
CAGATCCTGT GCCTGCCGAT CGTGGGGTCG ATGGTCGACC ACTCGCGGCA CAAGAAGCGG
TGGCTGCTCT GGCTCGCCGT AGGCGGGTCG GCGTGCACGC TGGGGCTGTA CTTCGCCACC
GACGGCTACC TGGTGGCCTC GGTACTGTTC GTGCTGGCCA ACCTGCTGTA CGGGCTGGCG
GGCGTGGTCT ACAACGCGTT CCTGCCCGAG GTGGCCACGG CCGAGGAACG CGACCGGGTC
TCGGTCACCG GCTGGGGCAT CGGCTACCTC GGCGGGGCCC TGCTGCTCGC CATCCACCTG
GGCCTGGTGG TCGGCGCTCC GTCCCTGGGC CTGGGCACGG ACGACGCCGC GCGCATCGCC
TTCGCCTCCT GCGGCCTGTG GTGGGCGGGG TTCACCGTCC TGGCGGTGCG GCCGCTGCGC
AACCGCTACG GCGCCCTGGC GGCGAGCAGC CGGGGGCGGC CGAAGGTGGG ACGGTCGCTG
CGCCAGTTCG GGCACACGCT CAAGGACATG CGCAAATACC CGAACACGAT TCTCTTTCTC
CTGGCGTTCA TCCTGTTCAA CGACGGTGTG CAGGCGGCCA TCCGCTACGC GGCGCCCTTC
GCCACCCAGG ATCTGGGACT GGACCAGAAC GTCCTGATCG TCACCATCCT CATCATCCAG
TTCGTGGCCT TCGGGGGCGC CTTCCTGACC GGCCGGGTGG CCCGGGTCCT GGGCAGCAAG
AACACCGTGC TGGCCACGCT CGCGGTGTGG AGCTCGCTCG TGGCGGCGGC CTACTTCCTC
CCGGTCGGGA ACGTGCCGCT GTTCGTGGCC ATGGGCGTGG GCATCGGCCT GGTGCTGGGC
GGCACCCAGT CGCTGGCCCG GTCGCTGTAC TCCCAGCTCA TCCCCCGTGG GCGCGAGGCG
GAGTACTTCA GTCTGTACCA GATCTCGGAC AAGGGATCGA GCTTCCTGGG ATCGCTGACC
GTGACCGTGG CCGTCTCCCT CACCGGCGGC TACCGGATGG CGATCCTGTC GCTGATCGTG
TTCTTCGTCA TCGGCGGTCT GCTGCTGTGG CGCACACGCA TGCGCGAGGG GATTCTCGCG
GTGGGCAACG AGGTACCGCG CAACCTGTAG
 
Protein sequence
MATTPPETDA TGAPSADPGQ RRREQRGWYL YDWANSVFTT SVVTVLIGPY LSNLACVSAG 
APDAASCLDP ALSISPLGLD FLSLHPNALY PALTTVAILL QILCLPIVGS MVDHSRHKKR
WLLWLAVGGS ACTLGLYFAT DGYLVASVLF VLANLLYGLA GVVYNAFLPE VATAEERDRV
SVTGWGIGYL GGALLLAIHL GLVVGAPSLG LGTDDAARIA FASCGLWWAG FTVLAVRPLR
NRYGALAASS RGRPKVGRSL RQFGHTLKDM RKYPNTILFL LAFILFNDGV QAAIRYAAPF
ATQDLGLDQN VLIVTILIIQ FVAFGGAFLT GRVARVLGSK NTVLATLAVW SSLVAAAYFL
PVGNVPLFVA MGVGIGLVLG GTQSLARSLY SQLIPRGREA EYFSLYQISD KGSSFLGSLT
VTVAVSLTGG YRMAILSLIV FFVIGGLLLW RTRMREGILA VGNEVPRNL