Gene Ndas_5360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5360 
Symbol 
ID9249263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp536463 
End bp537755 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003683246 
Protein GI297564273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.261806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG AACAGGAACG CCCGTCCCGG TCACCCGGCC GGACCCCGCC GGTCCACAGC 
GCGGAACAGC TCGTCCCCCT CACCCGCAAC CGCGACTTCC AGGTGCTCTG GACCAGCCGG
TTCCTCGCGG GACTGGGCAA GGAGAGCGGC GAGATCGCCT ACCCCCTCCT CGCCCTCCTC
CTCGCGGAAT CGGCGGCGCA GGCGGGCGTC ATCGGAGCGG CCCAGGTCAC CACGGCCATG
GTCACCGCCG TCCTCGGCGG TTCGCTCGCC GACCGGACCA ACCGCCGCAC GGTGCTGCTG
TGCTGCGACC TCGGACGGCT TACGCTGCTC TCCCTCTTCA CCGTCCTCCT GCTCACCGGG
AACGTCACGT TCACCGTCAT CGTGGGCGTC GCGGTCGGCT CCGCAGCGCT GATGGGCGTC
TCCAACCCCG TCGCGATGGC CTCCGTCAAG CAGCTGGTTC CGGCCTCACA GACGGCCGAG
GCCTCCGCCC AGAACCAGAT CCGCCTCTTC AGCACCACCG CCCTCGGCGG ACCCTTCGCC
GGAACCCTGT TCGGCGTGGG CCGGGCCTTC CCCTTCGCCG CCGAGGCCCT CGCCTACCTG
GTGTCGGCGG CCCTGGTGCT GCTCATCCGC CGCCCCATGC AGGCCCACCC GACCGGCGCG
CGCGGACCGT GGACCCTGCG CGAGGCGGTC AGCGGGTTCA CCGTGCTGGC CAGGCACCCG
ATCCTGCGGC CGATGATCTT GTGGATCGTC GGGTTCAACC TCACCTACAC CCAGACGGGC
GCCTTCCTGG CCCTCATCGC CACCGCCCAG AGCCAGGGCG CCAGCCACCT CCAGACCGGG
ATGACCGTCT CCCTGGCCGG GTCCGGCGGC CTGCTCGGCG CGCTCTGCGC CGGGGCGGTC
GTCAGGCGGG TGCGGCCCTC GGCCATCTTC CTGGTCGCGG CCTGGGCCGC CCCGGTGTGC
GCTCTGGGGC TGCTGTTCGC ACCCAACGTG ATGTTCCTCG GGGCGCTGGT GGGCTGCGTG
TTCGCCATCG TGCCCTGCGT GAACGCCGTG TTCCACGGTT ACGTCGCGGT GTCGGTCAGC
GACCGCTACC AGGGCCGCGT CCTGGGCGCC GTCACGTTCA TGGCGCTGGT GTCGCAGCCG
GTGGGCATCC TCGGCATCGG GGTGATCTTC GACCACGCCG GACCCGCCTG GGTGTTCCTG
ACGATGGCGC TGGTCTCGGC GCTCGCCGCC CTGTTCAGCC TCTCCCCGGT CATGCGCGAC
CTGCCCCGGC CCGAGGAGGT GGCCGTGGCC TGA
 
Protein sequence
MTAEQERPSR SPGRTPPVHS AEQLVPLTRN RDFQVLWTSR FLAGLGKESG EIAYPLLALL 
LAESAAQAGV IGAAQVTTAM VTAVLGGSLA DRTNRRTVLL CCDLGRLTLL SLFTVLLLTG
NVTFTVIVGV AVGSAALMGV SNPVAMASVK QLVPASQTAE ASAQNQIRLF STTALGGPFA
GTLFGVGRAF PFAAEALAYL VSAALVLLIR RPMQAHPTGA RGPWTLREAV SGFTVLARHP
ILRPMILWIV GFNLTYTQTG AFLALIATAQ SQGASHLQTG MTVSLAGSGG LLGALCAGAV
VRRVRPSAIF LVAAWAAPVC ALGLLFAPNV MFLGALVGCV FAIVPCVNAV FHGYVAVSVS
DRYQGRVLGA VTFMALVSQP VGILGIGVIF DHAGPAWVFL TMALVSALAA LFSLSPVMRD
LPRPEEVAVA