Gene Ndas_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1036 
Symbol 
ID9244882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1276322 
End bp1278061 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content76% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003678985 
Protein GI297560011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0188433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0545959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCG CCTTCCTCAA GGCCCTGGGA CCCGAGCAGG CCGGGCCCAT GCGCGCGAGC 
CTCGCGCTCA CCACCGTCGT CTCGGCCCTC CAGGGCGTGC TGTTCGCCCT CCTGGTGCCC
GTGCTGTCCC ACCTCCTGGG CCCCGACCCG GACCGGGCCT GGCCCTGGGC GGCCGTCCTG
CTCGCCGCCA CCGCCGTCTA CGCTGTCCTG CGCGCGGGCA GCCTGTACCT CAACTTCCGC
GTCGGCGGCG CGCTCTCGCG GGCCCTGCAC CACCGGCTCG GCGACCACGT CGTCCGCCTG
CCCCTGGGAT GGTTCACCGG AGGCCGGGTC GGGGAGCTCA ACCGCCTCGC CACCGACGGC
GTCTCCCGGG CCACCAGCCT GCCCGTGCAC CTGTACCCGC CCCTGGCCGA CGCGGTGGTC
ACCCCCCTCG TCTCGGTCCT GGCCCTGTTC GTGTGGGACT GGCGGATCGC GCTCGCGGCC
GCCGCCTGCC TCCCGCTGCT GTGGATGGTC TTCACCCTGT CCGGGGAGGC GGTCGGGCGC
AACGACGCCG CACGCGACGC CGTCACCGAC GAGGCCGCCG ACCGGGTCCT GGAGTACGCC
CGCGCCCAGC CCGTGCTGCG CGCCTTCGGC AGGACGGACA AGGGCGGCCA GCGCCTCGAC
GCGGCGCTGG AGGCCGAGCA CGGCGCCGCC CGCCGCCTGC TGCTGCGCGC GGTCCCCGCC
CTGCTCGGCT ACTCCTTCGC GGTGCGCCTG GCCTTCGGCC TGCTCCTGGT CGCCACCGTG
TTCCTCGCGC TCGGCGGCAC CCTCGACGCT CCCCTGGCGG TCGCGCTGCT CGTGCTCGTG
GCCAGGTTCG TCCACCCCCT GTCCGGCGCG GCCGACCAGG GGGCCGCCCT GAGGATGGCG
ACGAACGGGC TCGACCGGAT CAACGCCGTC CTGGAGGCGC GCCCCCTCCC CGAACCGGAC
ACGCCGGTGC CGCCCCGGGG CGCGGACGTG GAGTTCGACG ACGTGTCCTT CTCCTACGCC
CCGGACGGGC CCCGCGTCCT GGACGGGGTC TCCTTCCGGG CCGAACCCGG GACGCTCACC
GCCCTGGTCG GGCCCTCGGG TTCGGGCAAG ACCACGGTGG CCCGGCTGCT GGCGCGCTTC
CACGACGTCG ACGCGGGCAG CGTGCGCCTC GGCGGGGTGG ACGTGCGCTC GGTCGGCAGC
GAGGAGCTGT CCCGGCACGT GGCGATGGTC TTCCAGGACG TGTACCTGTT CGACGCGAGC
ATCGCCGAGA ACGTGCTCCT GGCCGACCCC GCGGCCACGC GGGAGGACCT GGACCGGGTG
GCCGCCGCCT CCGGCCTCGA CGCGGTGGTG GCCGAGTTGC CGGACGGCTG GGACACCCGC
GTCGGCGAGG GCGGCGCCTC GCTCTCGGGC GGGCAGAAGC AGCGGGTCTC CATCGCCCGC
GCCCTGCTCA AGGACGCGCC GGTCGTCGTC CTGGACGAGG CCTCGGCCGC CCTGGACGCG
GAGAACGAGG CGCTGCTGAC CGCGACCGCC GTTTCCCTGG CGCGGGAGCG CACGGTGCTC
GTCATCGCCC ACCGGCCCGC CACCGTGGCC GCCGCGGACC GGGTGGTCTT CCTCGACGCC
GGACGCGTCG CGGAGGCCGG AACCCCGGCC GAACTCCTCG CCGCGGGAGG GCGCCACGCC
GAGTTCGCCC GTGCGCGCGA GCGGGCCCGG GGCTGGAGGC TGACCGCCGA ACCCTCCTGA
 
Protein sequence
MIRAFLKALG PEQAGPMRAS LALTTVVSAL QGVLFALLVP VLSHLLGPDP DRAWPWAAVL 
LAATAVYAVL RAGSLYLNFR VGGALSRALH HRLGDHVVRL PLGWFTGGRV GELNRLATDG
VSRATSLPVH LYPPLADAVV TPLVSVLALF VWDWRIALAA AACLPLLWMV FTLSGEAVGR
NDAARDAVTD EAADRVLEYA RAQPVLRAFG RTDKGGQRLD AALEAEHGAA RRLLLRAVPA
LLGYSFAVRL AFGLLLVATV FLALGGTLDA PLAVALLVLV ARFVHPLSGA ADQGAALRMA
TNGLDRINAV LEARPLPEPD TPVPPRGADV EFDDVSFSYA PDGPRVLDGV SFRAEPGTLT
ALVGPSGSGK TTVARLLARF HDVDAGSVRL GGVDVRSVGS EELSRHVAMV FQDVYLFDAS
IAENVLLADP AATREDLDRV AAASGLDAVV AELPDGWDTR VGEGGASLSG GQKQRVSIAR
ALLKDAPVVV LDEASAALDA ENEALLTATA VSLARERTVL VIAHRPATVA AADRVVFLDA
GRVAEAGTPA ELLAAGGRHA EFARARERAR GWRLTAEPS