Gene Ndas_2586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2586 
Symbol 
ID9246437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3078872 
End bp3080683 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content71% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003680510 
Protein GI297561536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.653336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00337365 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGACCACCG ACCACCACAC AGGCGCGGCG CCCTCGCGGC GCCGCGGACG CCGCCGGACG 
ACCGCGACCC TGGCCGCGGG CACGGCCGCG GCGCTGCTGC TGGCCGCCTG CGGGGGCGCG
GACGGAGGCG GGGGCGAAGG CGGCCCGGTC GACGCCGAGT TCAACCAGGG CGAGACCGAG
GTGGTCAACC CCTCCGACCA GACCGGCGGC ACCCTGCGCT ACGCCATCTC CTCGGACTTC
GACTCACCCG ACCCCGGCAA CACCTACTAC GCGTTCGGCT GGAACTTCAG CCGCTACTAC
GCGCGCACCC TGCTCACCTA CGTCGGGGCG CCCGGCGCGG AGGGCACCGA ACTCCAGCCC
GACCTGGCCG CGGAGATGCC CAGGCCCAAC GAGGACCTCA CCGAGTGGAC GGTGCCCATC
AAGCAGGGGC TGCGCTACGA GGACGGCTCC GAGATCACCG CGCCCGACAT CGAGTACGCC
ATCGCGCGCG GCAACTTCGG CGCCCAGGCC CTGCCCAACG GCCCCAAGTA CTTCCAGGAC
CTGCTGGCCG ACAGCGACGA CTACGAGGGC CCCTACGCCG ACGAGGACGA CCCGCTGGCC
GGGTTCGACG GGATCGAGAC CCCCGACGAC CACACCCTGG TCTTCCACCT CAAGGACCCC
TTCGCGGAGT TCCCGTACCT GCTCATCCAG CCGCAGACCG CGCCGGTCCC GCCCGAGGCC
GACCGCGGTG AGCAGTACCA GAGCCGCGTC GTCTCCTCCG GGCCCTACAA GTTCGACGGC
GAGTACCGGC CCGGCGTCTC CCTGAACCTG GTGCGCAACG ACCAGTGGGA CCCCGCCACC
GACCCGACCC GCGAGGCGCT GCCGGACCGG GTCGAGGTGC AGCTGGGAGT GGACCAGAAC
GAGATCGACC AGCGCCTGGC CAGCGGCGAC CTGGACGTGG ACCTGGCCGG GGCCGGGGTG
GGACCCGCCA TGCGGGGCAC CCTGCTCACC GACGAGGCGC GCAAGAACAG CGTGGACAAC
CCGCAGAGCA ACACGCTGCG CTACGTCAAC ATCAGCACCG TCCTGGAACC CCTGGACGAC
CTGGCCTGCC GTGAGGCGGT CATGTACGCG GCCGACCGCG ACGCCCTCCA GCGGGCCTGG
GGCGGCGACA CCGGCGGCGA CATCGCCACC CAGATCATGC CCGCGTCGCT GCCGGGGGCC
GATCCCGGCA TCGACCTGTA CCCCTCACAG GACAACCAGG GCGACCTGGA CAAAGCCCGG
CAGAAGCTGG AGGAGTGCGG CGAGCCCGAC GGGTTCTCCA CCTCCATCGG CGTCCGGGCC
GACCGGCCCT CCGAGGTGAG CACCGCCGAG GCGCTCCAGC AGGCCCTGGC CAGGGTGGGC
ATCGAGACGC GGATCAAGCA GTACCCCTCG GACACCTACA CCAACACCCA GGCCGGGTCG
CCGTCCTTCG TGGAGGACAA CGACCTGGGC CTGACCGTGT ACGGGTGGGC CCCGGACTGG
GCGAGCGGCT ACGGCTTCAT GAGCAAGATC CTGGACGGCG ACGCCATCCA GGACGCGGGC
AACGCCAACA TCTCGGAGCT GGACGACGAG CGGATCAACG GCTGGTTCGA CGAGGTCATC
ACCGTGCGGG ACCCCGAGGA GCGCGCCTCG ATCTACACCC GGATCGACCG GCGGGCGATG
GAGCAGGCGG CGATCCTGCC CGCGGTGTTC GAGCGCACGG TGCTCTACCG GCCGCCGAAC
CTGACCAACG TGTACTACCA CTCGGGCTAC TCCATGTACG ACTACATGGC GCTCGGCACC
ACCCGGGAGT GA
 
Protein sequence
MTTDHHTGAA PSRRRGRRRT TATLAAGTAA ALLLAACGGA DGGGGEGGPV DAEFNQGETE 
VVNPSDQTGG TLRYAISSDF DSPDPGNTYY AFGWNFSRYY ARTLLTYVGA PGAEGTELQP
DLAAEMPRPN EDLTEWTVPI KQGLRYEDGS EITAPDIEYA IARGNFGAQA LPNGPKYFQD
LLADSDDYEG PYADEDDPLA GFDGIETPDD HTLVFHLKDP FAEFPYLLIQ PQTAPVPPEA
DRGEQYQSRV VSSGPYKFDG EYRPGVSLNL VRNDQWDPAT DPTREALPDR VEVQLGVDQN
EIDQRLASGD LDVDLAGAGV GPAMRGTLLT DEARKNSVDN PQSNTLRYVN ISTVLEPLDD
LACREAVMYA ADRDALQRAW GGDTGGDIAT QIMPASLPGA DPGIDLYPSQ DNQGDLDKAR
QKLEECGEPD GFSTSIGVRA DRPSEVSTAE ALQQALARVG IETRIKQYPS DTYTNTQAGS
PSFVEDNDLG LTVYGWAPDW ASGYGFMSKI LDGDAIQDAG NANISELDDE RINGWFDEVI
TVRDPEERAS IYTRIDRRAM EQAAILPAVF ERTVLYRPPN LTNVYYHSGY SMYDYMALGT
TRE