Gene Ndas_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1553 
Symbol 
ID9245403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1903193 
End bp1904956 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003679488 
Protein GI297560514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.525667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.106292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGGCA GTTCCTTGAG GAAACGCACC CTCGGCTTCG TCGCGCTCGG CGCGGCGGCT 
TCCATCCTCC TTTCGGCGTG CGGGGGCGGT GGCTCCAACG GCGGAGACTC CGCGTCCGAC
GCCGCCTTCG ACCAGGGCTC GACCGAGGTC GTCAACGCGT CCGACCAGAC CGGCGGCACC
CTGCGCTACG CCATCTCCTC GGACTTCGAC TCCACCGACC CGGGCGACAC CTACTACGGG
TTCAGCTGGA ACTTCACCCG CTACTACGCC CGTACGCTGC TGGCCTTCAC CCCGGCGCCG
GCCCAGGAGT CCACCGAGCT GACCACGGAC ATGGCCGCTG GCATGCCCGA GCCCAACGAG
GACTTCACCG AGTGGACCAT CAAGATCCAG GAGGGCCTCA AGTACGAGGA CGGCTCCGAG
ATCACGGCGC AGGACATCAA GTACGCCATC GCGCGCTCGA ACTACAACGG CGGTGAGCTG
CCCAACGGTC CGCGCTACTT CGAGGCGCAC CTGGACCAGG AGCCCTTCAA CGTCTACGAG
GTCGACGACC CGCTGGAGAC CTTCACCGCG GTCGAGACCC CGGACGACTA CACCCTGGTC
TTCCACCTCA AGGACCCCTT CTCCGAGTTC CCCTACGTGC TGACCCAGCC GCAGACCGCC
CCGGTGCCCG TGGAGGCCGA CCGCGGCGCG CTGTACAAGG AGAAGGTCCT CTCCTCGGGC
CCGTACAAGT TCGAGGGCAA CTACGAGCCC GGCGTCCAGC TCAACCTGGT CCGCAACGAG
CACTGGGACG CCGAGACCGA CCCGATCCGC CCGGCCCTGC CGGATGAGGT CACCGTCCAG
ATCGGCATCG ACCAGGACGA GATCGACCAG CGCCTGGTCA ACGGCGACCT CGACGTCGAC
CTGTCCGGCG TCGGCGTCGG CCCGGCGATG AAGAGCAGCC TGCTCACCGA CGAGGACGCC
CAGGCCAACC TGGACAACCC GTACTCGGGC GCCCTGCGCT ACGTCAACAT CCACACCCCG
GTCATCGAGG ACGTGGCCTG CCGCCAGGCG ATCATGTACG CGGCCGACCG GGACAGCCTG
CACCGCGCCT GGGGCGGCGA GACCGGCGGC GACATCGCCA CCAACCTGCT CCCGCCGACC
ATCCAGGGCT CGAACCCGGA GTCGGACCTG TACCCCTCCG ACGACGACAA GGGCGACCTG
GCCGCCGCCG AGGCCAAGCT GGAGGAGTGC GGCGAGGCCG AGGGCTTCAG CACCACCATC
GCGGTCCGCG ACGGCCGTCC CAACGACATC GCCACCGCGG AGTCCCTCCA GGAGTCCCTC
AAGCGCGTGG GCATCGAGGT CGACATCCAG ACCTTCCCGG CCGAGGACTT CTTCGCCCAG
TACGCGGGCT CGCAGGACTA CGTCCGTGAG AACAACATCG GCCTGAGCGT CTCCGGCTGG
ATCCCCGACT GGGCCACCGG CTACGGCTTC GCCTCCAAGA TCACCGACGG CGACGCCATC
CAGGCCACGG GCAACTACAA CACCTCCGAG CTCGACGACC CGGAGATCAA CGCCCTGTGG
GACGAGGCGC TGGCCACCGA GGACCCGGAC GAGCGCGCCA GCATCTACGA GCAGATCGAC
ACCCTGGTCA TGGAGCAGGC GGCCATCCTG CCGGTCGTCT TCGACCGCGC GCTGTTCTAC
CGCTCGGACG AGCTGACCAA CGTCTACTAC ACGTCCTCGT ACGCGATGTA CGACTTCATG
GCCCTGGGCG TGGACCGGGG TTAG
 
Protein sequence
MRGSSLRKRT LGFVALGAAA SILLSACGGG GSNGGDSASD AAFDQGSTEV VNASDQTGGT 
LRYAISSDFD STDPGDTYYG FSWNFTRYYA RTLLAFTPAP AQESTELTTD MAAGMPEPNE
DFTEWTIKIQ EGLKYEDGSE ITAQDIKYAI ARSNYNGGEL PNGPRYFEAH LDQEPFNVYE
VDDPLETFTA VETPDDYTLV FHLKDPFSEF PYVLTQPQTA PVPVEADRGA LYKEKVLSSG
PYKFEGNYEP GVQLNLVRNE HWDAETDPIR PALPDEVTVQ IGIDQDEIDQ RLVNGDLDVD
LSGVGVGPAM KSSLLTDEDA QANLDNPYSG ALRYVNIHTP VIEDVACRQA IMYAADRDSL
HRAWGGETGG DIATNLLPPT IQGSNPESDL YPSDDDKGDL AAAEAKLEEC GEAEGFSTTI
AVRDGRPNDI ATAESLQESL KRVGIEVDIQ TFPAEDFFAQ YAGSQDYVRE NNIGLSVSGW
IPDWATGYGF ASKITDGDAI QATGNYNTSE LDDPEINALW DEALATEDPD ERASIYEQID
TLVMEQAAIL PVVFDRALFY RSDELTNVYY TSSYAMYDFM ALGVDRG