Gene Ndas_1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1580 
Symbol 
ID9245430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1932546 
End bp1933760 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content78% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679515 
Protein GI297560541 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0196311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACC GACCCCTCCG CGCGCCGGGC CGGGCCCCCT CCGGCACGGA CGCGCGCAGG 
GCGTTGGCCG CCCTGTGCGT CACCGTCACC GCCAGCCAGG GCGTGCTGTT CTACGCCTTC
CCCGTGCTGG CGCCCGCCAT CGCCGAGGAC ACCGGCTGGT CCCTGCCCGC CGTCATCGCC
CTGTTCTCCG GGTCCCAGGT CGTGGCCGGA CTGGGCGGGC CCCTGGTGGC CCGCTGGCTG
CGCGTGCGCG GCCCCCGGCC GGTGATGACC GCGGCGGCCC TGCTGGGCGC GGTCGCCGTC
GCCGGACTGG CCCTGGCCCC GAACCTGTGG TTCTTCGGCG CGGCCTGGCA GGTGGCCGGA
GCCGCCGTGG CCGGGCTCTC CTACCCGCCC GCCTTCGCCG CCCTGACCCG CTGGTACGGG
CAGGGCAGGG TCCGGGCGCT CACCGCCCTC ACCCTGGTCG GCGGGCTGGC CAGCACCGTC
TTCGCCCCGC TGACCGCCGC CCTGGAGGCG CAGGTCGGCT GGCGCGGCGC CTACCTCGCC
CTCGCCCTGG TCCTGGCCCT GGTGGTGCTG CCGCTGCACG CCCTGGCCCT GACCCCGCCC
TGGACTCCCG GCGGCACCGC CGACCGGGCC GGGCACCGCC GGGCGGTGCG CGGGGTGGTC
CGCAGCGGTG CGTTCTGGGC GCTGACCACG GCGCTCGCCC TGGGCACCCT CACCGTGTAC
GCGGTCGTGG TCGGTGTCGT CCCGCTCATG GAGGGGCGCG GGTTCGGCAC CGCCGAGGCC
GCCTGGACGC TCAGCGCCGT GGGCGTGGGC CAGGTGCTGG GCCGCCTCGT CTACGCCCCG
TTGGCGCGGT ACAGCGGGGC GGTGCACCGG ATCGCCGCGG CGCTCCTGGC CTGCGCCGGG
GCGACCGGCC TGATCTCCCT GGTCAGCGGG CCGCTGTGGC TGGTCATGAC CGCGGCGGCG
CTGGTGGGCG CCGCGCGGGG CGTGCTCACC CTGCTCCAGG CCACGGCCGT GGCGGACCGC
TGGGGGGAGG AGCACTACAC CACGCTCAAC GGCATCATGC ACACCCCGCT CATGCTCACG
ATGGCCCTGG CGCCGGGGGC GTGCGCGCTG CTGGCCGGGC CCCTGGGCGG CTACCCGGCG
GTGTTCCTGC TGCTGGCGGC CCTGTCGGTG CTCGGCGCCC TGGTCGCGCT GGCCAGCGGT
CCGGCCCGCC GTTAG
 
Protein sequence
MTHRPLRAPG RAPSGTDARR ALAALCVTVT ASQGVLFYAF PVLAPAIAED TGWSLPAVIA 
LFSGSQVVAG LGGPLVARWL RVRGPRPVMT AAALLGAVAV AGLALAPNLW FFGAAWQVAG
AAVAGLSYPP AFAALTRWYG QGRVRALTAL TLVGGLASTV FAPLTAALEA QVGWRGAYLA
LALVLALVVL PLHALALTPP WTPGGTADRA GHRRAVRGVV RSGAFWALTT ALALGTLTVY
AVVVGVVPLM EGRGFGTAEA AWTLSAVGVG QVLGRLVYAP LARYSGAVHR IAAALLACAG
ATGLISLVSG PLWLVMTAAA LVGAARGVLT LLQATAVADR WGEEHYTTLN GIMHTPLMLT
MALAPGACAL LAGPLGGYPA VFLLLAALSV LGALVALASG PARR