Gene Ndas_4471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4471 
Symbol 
ID9248350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5309121 
End bp5310584 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682366 
Protein GI297563392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGACA CGCCGGAAGA GTCCGCTGCC GACACCCCGG AGCACCGATG GAGCGGGCGA 
CTCGTCCTCT GGGTCGCCGT TCTCATCCTC GCCAACGTCC TGGCCGACGT GGCCATCGCC
TCCCCCCTGC TGGTCCTGCC CCAACTGCTG GAGCACTTCG ACACCGACCA GGCCGCGTGG
CTGAACGCGA GCGCGATGCT GGCCGGGGCC ATCTGGTCGC CGCTGCTCGC GAAGAGCTCC
GACGTCTTCG GCAAGCGGCG GCTGCTCGTC GTCACGCTGG TGACCGCGTG CGCGGGCGCG
CTGGTCTGCC TCGTCGCCCC CAACGTCTGG ATCTTCCTGG TGGGGCGCTT CCTCCAGGGG
GCAGCCCTGG CCGCGATCTT CATCACGGTC GCCCTCGTAC TCCAGATCTG CGCCCCGCGG
GTGGCCATGC CCGTGATCGG GCTCGTGACG TCGGGATCGG CGATCGTCGG GATCATCGAA
CCGTTCGTGA TGGTGCCGGT CATCGACCTG TTCGGCTACC GCAGCGTGTT CGTCGTGGCG
GCGCTGCTCG CCGCGGCGGC CGCGCTCTGC GTGCGTGCCC TCATCCCGGA GTCGCCGGTC
CGCGGCGCCG GCCGGATCGA CGTGGGCGGG GCGCTCCTGC TGGGCGGCGG CCTCGGCTCC
GTGCTCGCCT ACGTCAGCCT CGGCGGAGAC GCCGGGTGGC TGTCCGCGGG CATGGTCGTG
CTGCTGGCGG CCGGTGTCGC CGCGTTGGCC GGTTGGGCGG TCCTGGCCCT GCGGATCGCC
GAACCCATCA TCGACATCCG GGCCCTCAGC CGACCGGTCC TGCTGACGCT GCTGGCCCTG
GTCCTGGCCG CGGGCTCCTT CCGGAGCATG CTCCAACTGA CGAGCATCGT CGCCCAGGTG
CCTCCCGAGC TGGGACTCGG CTACGGGTTG GGCGACGGAG GCGCGGTGGC GGTCCTGCTC
GCCGCGCCCT CGCTCGGCAT CATGGTCGGC GGCACCCTCG CCGGATGGGT CGCGGGGCGG
TTCGGCGCCG CACGGCCCCT CCTCGCCGCC ATCGCCGTCG GTGCGGCGGC GACCCTCATG
ATGCTCGTCG GCGTGTCGGT CCTGCCGCTG GCGGTCGCCT GCGGGGCCCT GGTCGGCATG
GCCGCCGGCG CGATCTCGAC GTCCGGCTAC AACCTGGCGA CCAGCCTGGC ACCGCCGGAA
CGGCAGGGCA CGGTCTCCGG CCTGGTGTCG GTCATGTTCG CCCTCGGCTC GGTCGTCTTC
AGCTTCGCCG GAGGCGAACT CCTCAAGGCC ACCCAGATCC CCGGGGTCGT GGCCGACGGC
GCCCCGGTGA GCACGGCGAC CGGCGTGTAC CTCTACGTCC TGACGGCCGG GGTGCTCTTC
GTCCTCGCCG CGGTGCCCGC GGGCATGGTG GTGCGCGGCG GGCGCGCGAC GCCCGCGCCG
GTCACGGCCG GGGCGCCGTC GTAG
 
Protein sequence
MKDTPEESAA DTPEHRWSGR LVLWVAVLIL ANVLADVAIA SPLLVLPQLL EHFDTDQAAW 
LNASAMLAGA IWSPLLAKSS DVFGKRRLLV VTLVTACAGA LVCLVAPNVW IFLVGRFLQG
AALAAIFITV ALVLQICAPR VAMPVIGLVT SGSAIVGIIE PFVMVPVIDL FGYRSVFVVA
ALLAAAAALC VRALIPESPV RGAGRIDVGG ALLLGGGLGS VLAYVSLGGD AGWLSAGMVV
LLAAGVAALA GWAVLALRIA EPIIDIRALS RPVLLTLLAL VLAAGSFRSM LQLTSIVAQV
PPELGLGYGL GDGGAVAVLL AAPSLGIMVG GTLAGWVAGR FGAARPLLAA IAVGAAATLM
MLVGVSVLPL AVACGALVGM AAGAISTSGY NLATSLAPPE RQGTVSGLVS VMFALGSVVF
SFAGGELLKA TQIPGVVADG APVSTATGVY LYVLTAGVLF VLAAVPAGMV VRGGRATPAP
VTAGAPS