Gene Ndas_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3351 
Symbol 
ID9247215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4004083 
End bp4005933 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content72% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003681263 
Protein GI297562289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.46018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATC CATCGAAAGT AGACACCGGC TCGAAACCGG GTGCGCCCGA ACCCCGGTTC 
GCCCAGCTCC GGGTGCTGTG GTCGTTCGTG CGCCCGCACC GGAACAAGCT GGCGCTGGGC
CTGGTGCTGG CCCTGTTCGG CTCGGCGCTC GAACTCGCCA ACCCGATGGT GATCAAGCTG
GTCCTGGACA CCGTCTCCGG CGGGGGCGGC CTGCTCGTGC CGATCGCCCT GCTGCTGGGC
CTGTTCGTGC TGGGCACGGT GTCCGGCCTG TGGCACTGGA TCCTCCTGGG CACCGTCGCC
GAGAAGGTGG TGCTCGACGC GCGCACCTCG CTGGTGCGCC GCTACTTCCG CGCAGCGCTC
ATCCCGCTGT CGCGCCGCTC CTCCGGCGAG CTCGTCACCC GGGCGACCTC CGACACGGTC
CTGCTGCGCG AGGCCGCCTC CAGCAGCGTC ATCAGCCTCA TCAACGGCGG CGTGCTGCTG
GTGGGAACGC TGGTCATGAT GGGCGTGCTG GACCTGTTCC TGCTCACGGT CACCTTCGTC
GCGGTGCTCG TGGTCACCGT CCTGTTCCTG ACGCTGATGC CCGCCCTGGC CAAGGCGCAG
GAGAGGGCGC AGAACTCCCT GGGCCTGATG GGCGGCATGC TCGACGGCGC GCTGCGCGCG
GTCCGCACGG TCAAGGTCAG CCGGGCCGAG GAGCGCCTGA GCGGCCAGAT CCTCGAACAC
GCGCGGGAGT CCGCGCGGCA CGGCGTGCGC TCGGTGCGGC GCGAGGCGGT CGCCTGGACG
ATCGCGTTCA GCGGGATCCA GCTCGCCATC ATCTCCATCC TGGGCGTGGG CGCGCTGCGG
GTGTCCTCGG GCGCGATCGA GGTCTCCACC CTCATCGCCT TCCTGCTCTA CGCGTTCACC
CTGATGACCC CGGTCATGGA GCTGTCCCAG AGCGTCACCA CCCTCCAGTC GGGCGTGGCC
GCGGCCAAGC GCATCCGCGA GGTGGAGGCC ATTCCGCTCG AACCCTCCTC CGAGGCGGCG
GACACGGACG CGCCGGTCCC CTCCCCGGAC GGGGACCGTT CGGGCGCGCT GCTGGAACTG
CGCGGGGTCA CCGCGCGGTA CGCGCCCGGC GCCGAGTCCG CGCTGGACGG CGTGGACCTG
GCCGTCCCCC GGCGCGGGCA CACCGCGATC GTGGGGCCCT CCGGCGCGGG CAAGACCACC
GTGTTCTCGC TGCTGCTGCG CTTCCTCGAA CCCGAGGAGG GGCAGCTGTT CCTGGACGGG
ACCCCCTACC GGGAGCTCAC TCCCGGGCAG GTGCGCGGCC GCTTCGCCTA CGTCGAGCAG
GACACCCCGG TCGTCCCCGG CACCATCCGG GAGAACCTGC TGTTCAGCCA CCCCGACGCC
ACCGAGGAGG AGGTGCGCCG GGTCCTGGGC CAGGTGCGGC TGGCCGACAA GATCGACGCC
CTGGAGGAGG GGCTGGACAC CCCGCTGGAC GCCACGTCCT TCTCCGGGGG CCAGCGCCAG
CGCATCGCCC TGGCCCGCGC CCTGCTGCGC TCGCCGGACG TGCTGCTGCT GGACGAGGCC
ACCTCGCAGG TGGACGCGAT CACCGAGGCC GCCATCACCG AGAGCGTGCG CGCCCACGCC
GCGCGGGCGG CCGTGGTGAC CATCGCGCAC CGGCTGTCCA CCGTGATCCA CGCCGACACC
ATCGTGCTGA TGGAGGACGG ACGGGTGCGG GCCAGGGGCA CGCACCGGGA GCTGATGGAC
CGGGACGACC TGTACCGGGA GCTGGTCACG GCACTGCACA TCGCCGAGTC CGGGGCTCCG
GACCCGGGCG GTGACCGGGC CGAGGCGGAC CGGGTGACGC CGGTCACGTG A
 
Protein sequence
MTDPSKVDTG SKPGAPEPRF AQLRVLWSFV RPHRNKLALG LVLALFGSAL ELANPMVIKL 
VLDTVSGGGG LLVPIALLLG LFVLGTVSGL WHWILLGTVA EKVVLDARTS LVRRYFRAAL
IPLSRRSSGE LVTRATSDTV LLREAASSSV ISLINGGVLL VGTLVMMGVL DLFLLTVTFV
AVLVVTVLFL TLMPALAKAQ ERAQNSLGLM GGMLDGALRA VRTVKVSRAE ERLSGQILEH
ARESARHGVR SVRREAVAWT IAFSGIQLAI ISILGVGALR VSSGAIEVST LIAFLLYAFT
LMTPVMELSQ SVTTLQSGVA AAKRIREVEA IPLEPSSEAA DTDAPVPSPD GDRSGALLEL
RGVTARYAPG AESALDGVDL AVPRRGHTAI VGPSGAGKTT VFSLLLRFLE PEEGQLFLDG
TPYRELTPGQ VRGRFAYVEQ DTPVVPGTIR ENLLFSHPDA TEEEVRRVLG QVRLADKIDA
LEEGLDTPLD ATSFSGGQRQ RIALARALLR SPDVLLLDEA TSQVDAITEA AITESVRAHA
ARAAVVTIAH RLSTVIHADT IVLMEDGRVR ARGTHRELMD RDDLYRELVT ALHIAESGAP
DPGGDRAEAD RVTPVT