Gene Ndas_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1423 
Symbol 
ID9245273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1744587 
End bp1745873 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content74% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679361 
Protein GI297560387 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG GACCAGCCCC CCGCACCCCC GTGACGAGCG GCCCCGCCGC AGGCCGGACC 
GCCCCGAGCC GTCCACCGCG CGCGGCACTG CTGCTGCTCG CCGCCGGAAT CGCCCTGGCG
GCCCTGAACC TGCGCACCGC CATCACCAGC GTCGGCCCCG TGCTCGACGA GGTCACCGCC
GGGCTGGGCA TGACCGCCGT CGGCGCGGGG ATCCTCACCA CCCTGCCCGT GCTGTGCTTC
GCGCTCTTCG GCGGCCTGAC CCCCGTCCTG GGCCGCCGCC TGGGCGAGCA CCACCTGCTG
GTCTACGCGC TCATCGCCCT CACCGTCGGC CTCGCCGCCC GGGCCGCGGC CCCCGAGCCG
TGGATGTTCC TGGCCCTGAG CGTGGTGGCC CTGTCCGGCG GCGCGGTCGG CAACGTCATC
CTGCCCGCCC TGGTCAAGGA GCACTTCCCC GACCGCGTGG GCCTCATGAC CACCGTGTAC
ACCACCGGCC TGGCGCTGGG CACCACCATC GCCGCCGCGG CCACCGTGCC CCTGGAGCAG
TCCACCGGCG AGTGGCGCGC GGCCCTGGGC GCCTACGCCC TCTTCGGCGT CGTGGCCGCC
GTCCCCTGGC TGCTGGTGCT GCGCCACGAG CCCGCGCGCG GCGACGCCTC CCAGGCACTG
GGCTTCGGCC AGGTGCTGCG CACCGGCCTG GGCTGGCAGT CCGTCCTCTA CTTCGGCACC
CAGTCCTCGG TCGCCTACAT CATGTTCGGC TGGTACGCGC AGATGCTGCG CGACCAGGGC
ATGGACGCCC AGACCGCGGG CCTGGCGCTG TCCTACCTCA CCGTCCTGGG CATCCCCATG
TCCCTGGTAC TGCCCACGCT GCTGACCCGG ACCAGCGACC AGCGCCCCTT CGTGCTGGCC
TTCTCCGCCG CCTACCTCGT GGGCCTGGTC GGGCTGTGGT TCGCCCCGCT GTCGGGGGTG
TGGGCCTGGA CCACACTCGT GGGGATCGGC ATGGGCAGCT TCGTCTTCGC GCTGACCGCC
TTCGCGCTGC GCACCCGCAC CGGGGCGGGG ACGGCGGCGC TCTCGGCCGT CAGCCAGAGC
CTGGGCTACC TCATGGGCGG GGCGGGGCCC TTCCTGTTCG GACTGCTGCG CGAGGTCAGC
GGCGGCTGGC ACGCGCCCCT GCTCCTGCTG GCGGTGCTGG TCGTGGTCAA CCTGGCCACG
GGCCTGTTCC TGGGCCGTCC CCGCTACCTG GAGGACGCCA TCGCCGCCCG CGGCCTCACG
AGGGCAGCCA GTCCAGACGG CCGATGA
 
Protein sequence
MSAGPAPRTP VTSGPAAGRT APSRPPRAAL LLLAAGIALA ALNLRTAITS VGPVLDEVTA 
GLGMTAVGAG ILTTLPVLCF ALFGGLTPVL GRRLGEHHLL VYALIALTVG LAARAAAPEP
WMFLALSVVA LSGGAVGNVI LPALVKEHFP DRVGLMTTVY TTGLALGTTI AAAATVPLEQ
STGEWRAALG AYALFGVVAA VPWLLVLRHE PARGDASQAL GFGQVLRTGL GWQSVLYFGT
QSSVAYIMFG WYAQMLRDQG MDAQTAGLAL SYLTVLGIPM SLVLPTLLTR TSDQRPFVLA
FSAAYLVGLV GLWFAPLSGV WAWTTLVGIG MGSFVFALTA FALRTRTGAG TAALSAVSQS
LGYLMGGAGP FLFGLLREVS GGWHAPLLLL AVLVVVNLAT GLFLGRPRYL EDAIAARGLT
RAASPDGR