Gene Ndas_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2440 
Symbol 
ID9246290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2893917 
End bp2895191 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content78% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680366 
Protein GI297561392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.293743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCACC CCCAGCAGAC CGGCACTGTG CCCACGGCCC CACCGCGCGC GCACGTGTTC 
TCGGACCCCG CCTTCCTCCG CCTGTGGTCG GGGAGCACCG CCTCGGGACT GGCCACGTGG
GCCATGCCGT TCATCCTCGG CCTGGCCGTC CTGGACGGAT CGCTCACCGC CATGGGGCTC
GGCCTGCTCC TGGCCACCCG TACCGGCGGG TTCCTCCTCG CCCTGCCGCT CGGCGGCCTG
CTGGCCGACC GGCTCTCCCG CCGCGCGGTG GTGCTGTGGG CCGGGGTCCT CGCGGCGGCG
GCCACTCCGC TGGTCGCCGC GGGCGTGGCG ACCGGCGCCC TGGCGCTCGC GGCCGTGGCC
GCGGCGGCCG TGGGCGCGGG CCAGGGAGCC TGTAGACCCG CCTTCCAGGC GCTGACCGCC
GAGGTCGTGG ACGAGCCGCG GCGCCAGCGC GCCAACGCGG CGCTCACCAT CTCCGTGCGG
GTCACCACCC TGGTCGCCCC GGGGGCGACG GCGCTGCTCT CCACGGTCCT GGGCGTGCAC
GCCCTCCTCC TGGTCACCGC CGCGCTCTGG GCGGTCGCCG CCCTGGCCCC GCCCCGGGGC
CGGAGCGCGC CCGCCCCGGG AGCGGTGCCC GCGCGCGGCT TCTCCCCGGT CGCCGACTTC
CGCGACGGGC TGCGCGAGGC CCGGCGCCAC ACGTGGTTCC TCGCGGGGCT GGGCGCCCTG
ACGGCGGTGA TCGCCACCGG GTACTCCGCC ACGGGCGTGC TGCTCCCGGT TGTCAGCCGC
GACACCTACG GCACCGAGGC GGTGCTGGCG GGCGCGCTGA CCGCCTACAC CGGCGGCGCG
CTGGCCGGGG CGCTGCTGAT CGGGCGCTGG CGCCCCTCCT CCCAGGGGTG GGTGGCGCTG
GCGGGCCTGG CCCTGTACGG GCTCGCGCCG CTGAGCCTGC TCCTCCCCGT GGGCCCGTGG
ACGGTGTTCG CCGCCTACGC CCTGGCGGGG GTCGGGATCG AACTGTTCAA CGTCCCGTGG
TTCACCGCCG CCCAGCGCGA GGTCGCGCCC GACAAGCTCG CCCGCGTCTC CTCCCTGGAC
TTCCTGTTCT CCTACGGCCT GGCCCCGGTC GGGCTCGCGC TGATCGCCCC GGCCACCCAG
GCCTTCGGCA CGGAGGCGGT CCTGGTCGTG TGCGCCGCCC TGTGCTTCCT GGCCCCGGGG
GCCGCGGCGC TCGCCCCCGG TTCGCGCCAC TTCGCCATGG GAGGCCATGG GGCCCCGGCC
CCGCGAGCCG CCTGA
 
Protein sequence
MAHPQQTGTV PTAPPRAHVF SDPAFLRLWS GSTASGLATW AMPFILGLAV LDGSLTAMGL 
GLLLATRTGG FLLALPLGGL LADRLSRRAV VLWAGVLAAA ATPLVAAGVA TGALALAAVA
AAAVGAGQGA CRPAFQALTA EVVDEPRRQR ANAALTISVR VTTLVAPGAT ALLSTVLGVH
ALLLVTAALW AVAALAPPRG RSAPAPGAVP ARGFSPVADF RDGLREARRH TWFLAGLGAL
TAVIATGYSA TGVLLPVVSR DTYGTEAVLA GALTAYTGGA LAGALLIGRW RPSSQGWVAL
AGLALYGLAP LSLLLPVGPW TVFAAYALAG VGIELFNVPW FTAAQREVAP DKLARVSSLD
FLFSYGLAPV GLALIAPATQ AFGTEAVLVV CAALCFLAPG AAALAPGSRH FAMGGHGAPA
PRAA