Gene Ndas_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3881 
Symbol 
ID9247752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4652473 
End bp4654014 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003681784 
Protein GI297562810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTC GACGCGCCAC CGCCCGCACC TGGGCCGGAC TGGCCCTCCT ACTGATCCCG 
GCACTGCTGG TGTCCATGGA CATCTCGGTG CTCTTCGTCG CCGCGCCCGC CATCACCGAG
GCGCTGCGAC CCACGTCCGC GCAGTGGCTG TGGATGATGG ACGTCTACGG CTTCGTGCTC
GCCGGACTGC TCGTCACCAT GGGAAGCCTG GGCGACCGGA TCGGCCGCAG GCGCCTGCTG
CTCACCGGCG GGGTGCTCTT CGGCGCCGCG TCCGTGCTGC TGGCGCTGGC GCCCTCGCCG
GAGCTGTTCA TCGCCGGGCG GGCCCTGCTG GGCGTGGCAG GGGCGACCCT GGCGCCCTCC
ACGCTCTCCC TGGTCCGGGA CATGTTCACC GACCCCCGCC AGCGCGGCGC CGCGGTCGGG
GCCTGGACCG TCGCCTTCAC CGGCGGCGCC GTCGCCGGGC CGATCCTCGG CGGACTGCTC
CTGGAGTTCT TCTGGTGGGG CTCGGCCTTC CTCGTCAACC TGCCGTTCAT GGTCGTGCTG
GTGGCCGCCG CACCCCTGCT CGTGCCCGAG TCGCGCGACC CGGAGGCCTC CGGCTTCGAC
CTGCCGGGCG CGGGCCTCTC GCTCGCGGCC GTCCTGGGCC TGGTCTACGG CGCCAAGCGC
CTGGCCGAGC ACGGGGCCGA CCCCCACGCC CTCACCGCCC TGGCGGCCGG AGCGGCGCTC
CTGGCTCTGT TCGTGCTCCG GCAGCGCCGT GCCGCGCACC CGCTGATGGA CCTCTCGCTC
CTCGCCCGCC CCGCTTTCAC CGCCGCGATC ATCGGCAACC TGGCCCTGTC CTTCGCCGTC
GGGGGGATGG GGCTCCTGAC CTTCACCTTC CTCCAGACCG TGCACGGCCT GAGCCCGCTC
CACGCCGCCC TGTGGGCGCT GCCCACGATC CTGGGCACCG TCCTGGGCGC GGTCCTGGCC
GGCTCGCTCG CGCCCCGGGC CAGACCCGGC GTGCTCATGG CGGCGGGGCT GGCCCTCAGC
GCGGCGGGGT TCGCGGTCGT GGGCCTGGTG GACGCCGACA CCCGCCTGGC GGTGTTCCTC
GGCGGCTACA CCCTGCTGAC CCTCGGGGCC GGCGTCGTCG GAACCCTGGC CAACACCCTG
GTCCTGGCCA CGGCCCCCCG GGAGCGCGCC GGGGCCGCCG CGGGGATCTC CGAGACCAGC
ACCGAGTTCG GCACCGCCCT GGGCATCGCG GTCCTGGGCA CCGCCGCAGG CGCCGTCTAC
CGCACCTCCG TGGCGGACGC GCTGCCCTCG GTGGACGGGG CCGCGGCCGA GACCGTCACC
GGAGCCCTGG CCGCCGCCCC CCGAGCACAG GACCCCGGGG CCCTGCTCGA CGCGGCCTTC
GACGCCTACA CGGCCGGGGT CAACACCGCC GCCCTCACCG GCGCGGGCGT GCTGGCCGCG
GTCGCGCTCC TGGTCGCCGT CGCGCTGCGG AGGCTGCCCC CCGCGACCGG CGGGGAGCCC
GGCGCACCGG CCCCGGCCGG GGGAGTCCCG GCCCGTCCGT GA
 
Protein sequence
MNPRRATART WAGLALLLIP ALLVSMDISV LFVAAPAITE ALRPTSAQWL WMMDVYGFVL 
AGLLVTMGSL GDRIGRRRLL LTGGVLFGAA SVLLALAPSP ELFIAGRALL GVAGATLAPS
TLSLVRDMFT DPRQRGAAVG AWTVAFTGGA VAGPILGGLL LEFFWWGSAF LVNLPFMVVL
VAAAPLLVPE SRDPEASGFD LPGAGLSLAA VLGLVYGAKR LAEHGADPHA LTALAAGAAL
LALFVLRQRR AAHPLMDLSL LARPAFTAAI IGNLALSFAV GGMGLLTFTF LQTVHGLSPL
HAALWALPTI LGTVLGAVLA GSLAPRARPG VLMAAGLALS AAGFAVVGLV DADTRLAVFL
GGYTLLTLGA GVVGTLANTL VLATAPRERA GAAAGISETS TEFGTALGIA VLGTAAGAVY
RTSVADALPS VDGAAAETVT GALAAAPRAQ DPGALLDAAF DAYTAGVNTA ALTGAGVLAA
VALLVAVALR RLPPATGGEP GAPAPAGGVP ARP