Gene Ndas_2701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2701 
Symbol 
ID9246552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3221341 
End bp3222852 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680622 
Protein GI297561648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.417248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.482514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTAGAG AATCGAAACC GCGGAGCCAG ACGACCCCTG CGCGGGCCGA GGGTTACCGC 
CAGGCCCGCA GCGCCACCAC GCTGGCCCGA TCAGCCGCAG GGGAGGGGGC GCCGCCTGCG
ACGCTGGGCA CGGTCGGACT CCTGACCGCG TTGTCCGCCC TGCTGCTGTC GGTGATGAGC
TTCGCCGCCG CGGGAATCGC CGTACCGAGT ATCGGCGCCT CGCTGCACGC TTCGGCGTCC
GAGCAGTCGT TGGTGGTGTC GGTCTACTCC CTGGGCTTCG CCGCGCCGAT GGTCGTCGGC
GGGCGTCTGG GCGACCTGTA CGGCAGGCGG CGGCTCTTCC TGTTCGGCAT GGCCGGATTC
ACCGCGTTCT CGCTGATGGC GACGCTCGCG CCGACCATTG CCGTGCTGAT CGTCGCCCGC
GCGCTCACCG GCGTGTCGGC GGCGGCGATG GTTCCCCAGG TGCTCGCGAC GATCACGGCC
TCCACGCATG GACGCGAGCG TGCCCGGGCG GTGGCGTTGT TCGGGGCGAC CGCGGGCGGC
GCGACGGCGG TGGGTCAGGT CCTCGGCGGC GTTTTGCTGT CGGTCCCCCT GCTCGGCTCC
CCCTGGCGCA CGGTCTTCGC GATGAGTGTC CTCATGGGCG CCGTCGCGTT CCTCGCCGCT
CTGCGCTGGA TGCCCAGCAC CGACGCACCG GGCGATCGTT CGCTGGACCT GGTCGGGACC
GCGTTGCTGG GGGTATCGCT GCTCGCGCTG ATGATCCCGT TGTCCCAGGG CGGTGCGCTC
GGTTGGCCCG GGTGGTGCTG GGCACTGCTG GCGGCCAGCC CGGTGGCGTT CGCGGCGTTC
TGGACGCGGC AGCTCCGACT GCACCGCCGC GACCTGGTCC CGCTCGTTCC TCCGCCGCTC
CTGCGTCTGA GGTCGTACCG GCTCGGCCTC ATCATGGCCC TCCTGCTCCA GTCGGCCTTC
GGCGCGTTCA CGTTCCTCTA CGCGCTCTCC ACGCAGACGG GTCTGGGCTG GTCCCCGATG
GGTGCGGCCC TCGTGCTGCT GCCGTTCGCA CTGTGCTTCT TCGCCGTGTC GATCTGGTCG
GGAAAGCTGG CGCCCCGTTT CGGATTCCGC CGTCTGCTGA CGATCGGCGG GTTCGTCCAG
GCGGCGATGC TGGTGGCGAC CGCGGCATCG GTGCTCATGC GGGGCCCGGG TATGAGCGGG
TGGACGCTGG GAGCTCTGCT GGTCGGAGTC GGGGTCGGTC AGGCGCTCAT GTTCGGTCCG
CTGGTCGGGG CGATGATCGC CGACGTCCCG CCCTCCTCGG CAGGAGCGGC CTCCGGGGTC
ATCCAGACCG CGCAGCAGGC CGCCATGGGG CTCGGAGTCG CGGTCGCCGG AGGGGTTCTG
GGTACTGCGA TGGCCGGTTC CACCGCCCCG CCCGGGCAGG ACTACATGAC GGCACTCGCG
ATCTGCATGG TCGTCCAGGC CGCGTTCGCG ATCGCCTTCG CCCTCCTCGC CTTCGCCCTG
CCCAGGCGCT GA
 
Protein sequence
MTRESKPRSQ TTPARAEGYR QARSATTLAR SAAGEGAPPA TLGTVGLLTA LSALLLSVMS 
FAAAGIAVPS IGASLHASAS EQSLVVSVYS LGFAAPMVVG GRLGDLYGRR RLFLFGMAGF
TAFSLMATLA PTIAVLIVAR ALTGVSAAAM VPQVLATITA STHGRERARA VALFGATAGG
ATAVGQVLGG VLLSVPLLGS PWRTVFAMSV LMGAVAFLAA LRWMPSTDAP GDRSLDLVGT
ALLGVSLLAL MIPLSQGGAL GWPGWCWALL AASPVAFAAF WTRQLRLHRR DLVPLVPPPL
LRLRSYRLGL IMALLLQSAF GAFTFLYALS TQTGLGWSPM GAALVLLPFA LCFFAVSIWS
GKLAPRFGFR RLLTIGGFVQ AAMLVATAAS VLMRGPGMSG WTLGALLVGV GVGQALMFGP
LVGAMIADVP PSSAGAASGV IQTAQQAAMG LGVAVAGGVL GTAMAGSTAP PGQDYMTALA
ICMVVQAAFA IAFALLAFAL PRR