Gene Ndas_2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2213 
Symbol 
ID9246063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2642788 
End bp2644074 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680141 
Protein GI297561167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.175074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000120086 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCATTGGA CGTATGAAAC CGGTCTTTCT AGCGTGGACG GTATGAGCCT GACCAGTGCC 
TCGCCGCCTG ATGACGACAC CCGTCCCCGT GCGGGGACCG GCGCCTACCG GCGCACGGTC
ATCGCGCTGG TGGCCGCCGC CGTGGCGACC TTCGCCCAGT TGTGGGCGGT CCAGCCGATC
CTGCCCGCGA TCGCGGAGGG TTTCGGCGCC TCGGCCTCCC AGGCCGCGCT CGCGGTCTCG
CTGGCCACGG GCGGCCTGGC CGGGTTCACC CTGGTCTGGA GCGGGGCGGC GGACCGCTTC
GGTCGTGCGC GTGTCATCGG TGTCTCCCTG CTGGCCGCGA CGCTGCTGGG CTGTGTGATC
CCCTTCGTCG CCGATCTGTG GCCGCTGCTG GTGCTGCGCG CGTTGCAGGG TGCGGCTCTG
GGCGGGGTGC CGGCCGCGGC GGTGGCCTAC CTGTCGGAGG AAATCCACCC GGCCGACGCC
TCGCGTGCCA CGGGCCTGTA CATCGCGGGC AACCCGTTGG GCGGGATGGG CGGGCGTCTG
CTGGCCGGGT TCGCCGCGGA TCTGGGCGGC TGGCAGTGGG GGATCGCCGC CAACACCCTG
CTGGCCCTGG TCGCGCTGGT TGTGTTCGCG CTGGTCCTGC CCCGTCGGCC GCGGGCCGTG
CGCACCGTCG CGGTGCGGGG GGAGGCGTCC CCGTCGCGGG GGTCGGGCGG GGGCGGGGTG
GGCGGGCGCC TGCGTGCGGC GGTGACCACG CCCGGGCTGA TCGCCCTCTA CACCCAGGCC
CTGTTGCTGA TGGGCGCCTT CATGACGGTC TACAACCTGT TGGGTTTTCG TCTGATGGCC
GAGCCCTTCG GGCTCTCCCA GGCCGCGGCC TCGCTGCTCT TCCTGTCCTA CACGGCGGGG
ATGCTGGGTT CGGCGGTGGC GGGGGGAGCC AGCGCGCGTT GGGGCGGGTA CGCGGTGCTC
ACCACGGCGA CCGTGTTGAT GGCCGCCGGG TTGGGCGGGA TGTTCGCCAC GGCTTTGCCG
GGTCTGCTGG CGGCCCTGTT GGTGATGACC TTCGGTTTCT TCTGTGCGCA CGCCACCGCC
TCGGCGTGGG TGGGTACCCG CGCGGTGCGG GGGCGGGCCC AGGCGATGGC GGTCTACACG
CTGGCCTACT ACCTGGGGTC GAGCCTGTTC GGCTGGTTGG GCGGTCTGGT CTACGACGCC
GTGGGCTGGG GTGGGGCGGT GGTCTTCGCG TTGGGGTTGT GCTCGGTGGC CGCGGCGGCG
GGTCTGCGTC TGCGCCGTCT GCGGTAG
 
Protein sequence
MHWTYETGLS SVDGMSLTSA SPPDDDTRPR AGTGAYRRTV IALVAAAVAT FAQLWAVQPI 
LPAIAEGFGA SASQAALAVS LATGGLAGFT LVWSGAADRF GRARVIGVSL LAATLLGCVI
PFVADLWPLL VLRALQGAAL GGVPAAAVAY LSEEIHPADA SRATGLYIAG NPLGGMGGRL
LAGFAADLGG WQWGIAANTL LALVALVVFA LVLPRRPRAV RTVAVRGEAS PSRGSGGGGV
GGRLRAAVTT PGLIALYTQA LLLMGAFMTV YNLLGFRLMA EPFGLSQAAA SLLFLSYTAG
MLGSAVAGGA SARWGGYAVL TTATVLMAAG LGGMFATALP GLLAALLVMT FGFFCAHATA
SAWVGTRAVR GRAQAMAVYT LAYYLGSSLF GWLGGLVYDA VGWGGAVVFA LGLCSVAAAA
GLRLRRLR