Gene Ndas_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1042 
Symbol 
ID9244888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1283214 
End bp1284713 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content79% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003678991 
Protein GI297560017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.147331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGTGG ATCCGGTGGG TTCGGTGGTT GAGGACGCGG GGCTGAGCGC GCTGGGGATC 
CGCGTGCGGA ACCTGGAGGG CCGGACCGTG GTCGGCCCGG TGGACCTGCG GGTGCCGGAC
GGGCGGGTCC TGGCGGTCAT GGGCGGGTCC GGCGGCGGAA AGACGAGTGC CGTGCTGGCC
GCCCTGGACG CCCTGCCGCC CGGGCTGGTG CGCGAGGCCG GGGAGGTGCG CTGGCACGGG
ACGCCGATCC CCGCCGGACG TGCGGCGCGT CGCTGGCGGC TGGCCCACGC CGGAATCCTC
GGCCAGGACC CGGCCTCGGA CCTGCACCCC CTGCGCACGG TGTTCGCGCT GGTGGCGGAG
GGGCTGCCGC GCTCCGGACC GGTGGTGGAG GGAGGTCCGT TCTCCAGTCC AGCGGCGGGG
GTGCCGCCGC GTCCCGGAGG CCGGGACGCG CGGCGGGCCG TGCGCGGCGT GCTCGCGGAT
CTGGGCCTGG ACCCCGACGC CGTGGGACGC CGCCGCCCGC ACGAGCTCTC CGGCGGGCAG
GCCCAGCGCG TGGCGCTGGC CAGGGCGCTC GTGGGCGACC CCCGCGTGCT GGTGCTGGAC
GAACCCACCA GCGGCCTCGA CCCCGCCACG GTGGAACTGG TGGTGCGGGT ACTGGAGCGG
CGGCGAGGGA GGCCCGGCCG GGTCACGGTC GTCGTCACGC ACGACCGCGC CTTCGCGGAC
CGCGTGGCCG ACGACCGCCT CGTGCTGGGG AGCTTCTCCG ACGCGCGGGG AGCGGACGCG
GAGCGCGCCA TCCCCTCCGG CGGCGCCGAG GTGCTCGGGC TCCGGGGCGT GCGCGTGACC
GCTCCCGGCG GACGGGAGCT GATCGCCGCG GCGGACCTGT CGGTGCGACG CGGCGAGTGC
GTCGCGGTCC TGGGCCCCTC CGGAAGCGGC AAGTCCACGC TGCTGCGCGC CGTCGCCGGA
CTGCATCCGC CCGCCTCGGG GAGCATGGCC CTGAACGGCG CTCCGCTGCC GCCGCGCCTG
CGCGACCGGG ACCGACCCCT CCTGCGAGCG GTCCAGTTCG TCGGACAGGA CCCCGTGGGG
GCGCTCAACC CCGCCCACCG CGTGGGCACG GCGCTGGCCC GCCCCGCCCG CGTGCTCCTG
GGGCTCTCGC GCGCGGAGGC CCGCGCCCGC GTCCCCGACC TCCTCGCGCG GGTCGGCCTG
CCGGGAAGCG TCGCCGAGGC CCACCCCGGG CGGCTCTCCG GCGGCCAGCG CCAGCGGGTG
GCCATCGCCC GCGCCCTGGC CGCGCGCCCC GGCCTCCTGC TGGCCGACGA GGTCACCTCG
GCCCTGGACG CGGCCAGCGC CCGCACGGTC CTGGAGCTGC TGGACTCCCT GCGCGAGGAA
CACGGGCTGG CCGTGCTCCT GGTCACCCAC GACCGCGACG TCGCCGCCCG CGCCGACCGG
ATGCTGGCGC TGGACCCCGA GCACCGGAAG CTGGACCCCG AGGGCCTCAC CGTGCCCTAG
 
Protein sequence
MTVDPVGSVV EDAGLSALGI RVRNLEGRTV VGPVDLRVPD GRVLAVMGGS GGGKTSAVLA 
ALDALPPGLV REAGEVRWHG TPIPAGRAAR RWRLAHAGIL GQDPASDLHP LRTVFALVAE
GLPRSGPVVE GGPFSSPAAG VPPRPGGRDA RRAVRGVLAD LGLDPDAVGR RRPHELSGGQ
AQRVALARAL VGDPRVLVLD EPTSGLDPAT VELVVRVLER RRGRPGRVTV VVTHDRAFAD
RVADDRLVLG SFSDARGADA ERAIPSGGAE VLGLRGVRVT APGGRELIAA ADLSVRRGEC
VAVLGPSGSG KSTLLRAVAG LHPPASGSMA LNGAPLPPRL RDRDRPLLRA VQFVGQDPVG
ALNPAHRVGT ALARPARVLL GLSRAEARAR VPDLLARVGL PGSVAEAHPG RLSGGQRQRV
AIARALAARP GLLLADEVTS ALDAASARTV LELLDSLREE HGLAVLLVTH DRDVAARADR
MLALDPEHRK LDPEGLTVP