Gene Ndas_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3996 
Symbol 
ID9247868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4778532 
End bp4780310 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content73% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003681899 
Protein GI297562925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.181167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGG TCAATCTTCA GGACGTGTCC CTGGCCTACG GGCCGCTCGT ACTCCTCGAC 
AAGGTGTCGC TCGGCGTCGA CGAGGGTGAG CGCATCGGCG TCGTCGGCCG CAACGGCGGC
GGCAAGTCCA CGCTCATCTC CGTGCTCTCG GGCATCACCC GGCCCGACTC CGGGCGGGTG
GTGCACAACC GCGGGCTGCG CATCGGCTTC CTGCACCAGC GCGACACCTT CCCCGACTCC
ACAGTCGGGG AGTTCGTCCT GGGCGACCGG GCCGAGCACG AGTGGGCGGG CGACGCCCGC
GTCCGCGACA TCCTGCGCGG ACTCCTCGGC GGCTGGGCCC TCGACACCCC CATGAGCGGC
CTGTCCGGCG GCGAGCGCCG CCGCTCCGGC CTGGCCCGGC TCCTGGTGGG CGACCACGAC
CTCATCGTGC TCGACGAGCC CACCAACCAC CTCGACATCG AGGGCATCGC CTGGCTCGCC
GAGCACCTGC GCGCCCGTCC CGAGGCGCTC GTGGTCGTCA CCCACGACCG CTGGTTCCTG
GACGCCGTCA CCACCCGCAC CTGGGAGGTC GGACGCGGCG CGGTCGAGCG CTACGAGGGC
GGTTACGCCG CCTACGTCCT GGCCAAGGCC GAGCGCGAGC GCCAGGCGGC CGCCGCCGAG
GAGCGCCGCC AGAACCTCAT GCGCAAGGAG CTGGCCTGGC TGCGCCGCGG CGCCCCGGCC
CGCACCTCCA AGCCCAAGTT CCGCATCGAG GCGGCCAACC AGCTCATCGC CGACGAGCCG
CCGCCCCGCG ACACCGTCGA ACTGGTCAAG TTCGCCAGCT CCCGGCTGGG CAAGACCGTC
ATCGACCTCA AGAACGTGTC CGCGTCCGTC CCCGACCGGA CCCTGCTCGA CCACCTCACC
TGGCAGCTGG GCCCCGGCGA CCGGGTCGGC CTGGTCGGGG TCAACGGCGC GGGCAAGTCC
ACCCTGCTCA GGATCCTGGC CGGGGAGCGC GAGCCCGACT CCGGCAGCGT GCGCACCGGC
CGGACCGTCA AGCTCGCCCA CCTGTCCCAG AACGTCGCCG AACTCGACCC CGCCGCCCGC
CCCCTCCAGG CGGTCATGGA CGTGCGCGAG CACGTCACCA TCGGCAAGCG CGACTACACC
GCCAGCCAGA TGCTGGAGCG GTTCGGGTTC CGCGGGGAGC GCCAGTGGAC CCCGATCGGC
GACCTGTCCG GCGGTGAGCG ACGCCGCCTC CAGCTGCTGC GGCTGCTCAT GGACGAGCCC
AACGTGCTGC TGCTGGACGA GCCCACCAAC GACCTGGACA TCGAGACGCT CACCGAGCTG
GAGGACCTGC TCGACGGCTG GCCCGGCTCC CTGGTGCTGG TCAGCCACGA CCGCTACTTC
CTGGAGCGCA TCACCGACCG CGTGCTGGCT CTGATGGGCG ACGGCGGCCT GGCCTTCCTG
CCCGGCGGCG TGGACGAGTA CCTCCAGCGC CGCGCGGCCA GCGCGGGCGA GGCCACCGTG
CCGCTGGGCG CCTCCGAGGC CGCCGCGGCC CAGGCCCCGG CGGAGCAGCC CGCGGTGTCG
GCGGCCGAGC GGCGCGCCGC GCAGAAGGAG ATGCAGCGCG TCGAGCGGCG CATGGACCGC
ATCGCCAAGC GGGAGGCGGA GCTGCACGAG CTGATGGCCG CCGCGGCCGA GGACTACACC
CGGCTCGCCG AACTCGACGC AGAGGCCAAG GCCCTGGCCG TCGAGCGCGG GGAGCTGGAG
GAGGTCTGGC TGGAGCAGGC CGAACTCGTC GGGGAGTGA
 
Protein sequence
MNLVNLQDVS LAYGPLVLLD KVSLGVDEGE RIGVVGRNGG GKSTLISVLS GITRPDSGRV 
VHNRGLRIGF LHQRDTFPDS TVGEFVLGDR AEHEWAGDAR VRDILRGLLG GWALDTPMSG
LSGGERRRSG LARLLVGDHD LIVLDEPTNH LDIEGIAWLA EHLRARPEAL VVVTHDRWFL
DAVTTRTWEV GRGAVERYEG GYAAYVLAKA ERERQAAAAE ERRQNLMRKE LAWLRRGAPA
RTSKPKFRIE AANQLIADEP PPRDTVELVK FASSRLGKTV IDLKNVSASV PDRTLLDHLT
WQLGPGDRVG LVGVNGAGKS TLLRILAGER EPDSGSVRTG RTVKLAHLSQ NVAELDPAAR
PLQAVMDVRE HVTIGKRDYT ASQMLERFGF RGERQWTPIG DLSGGERRRL QLLRLLMDEP
NVLLLDEPTN DLDIETLTEL EDLLDGWPGS LVLVSHDRYF LERITDRVLA LMGDGGLAFL
PGGVDEYLQR RAASAGEATV PLGASEAAAA QAPAEQPAVS AAERRAAQKE MQRVERRMDR
IAKREAELHE LMAAAAEDYT RLAELDAEAK ALAVERGELE EVWLEQAELV GE