Gene Ndas_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0487 
Symbol 
ID9244328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp585245 
End bp587023 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003678440 
Protein GI297559466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGACC TGAGCCGCAG GCGCCTGCTC CGGACGATCG GCCTCGGCGC GGCGGGCGTC 
GCCGGCGCCG GAGTGCTGTC CTCCTGCGCG GGCGGCGACA ACGCCGAGAG CGGCGCCACG
CAGTTCACCG GCGTCTTCGA CTTCGACCTG GCCGCCCAGA CGCGCAACGT CGCCGTGGAG
GACGGCGCGC TGCTGATGAA CTCCGTGTAC GCCGACCTCT TCCTGCCCGC GGGCGCGTTC
TACAACTGGG AGACCCACGA GTGGGACTAC CTGCTCCTGG AGAACAGCGC CTGGGAGGGC
GACGACCTCG TCGTGACCCT GCGCCCGGGC CTGAAGTGGA GCGACGGCAC CGACCTGACC
GCCGAGGACC TGCACCAGAA CTACGCCATC CGGGTCCTGG AGGCGCCCGC GTGGTCGGTG
GGCTTCCCGC AGATCACCGA GTTGGAGCGG CTGGACGACC TCAGCGTCCG CGCGCGCTTC
GGCAACCCCT TCCCCGGCAT CGAGCTCCAG GTCGTCAAGC ACCGGATCTT CTCCAAGTCG
ACCTACGGCG ACTTCGGGGA GCGCGCGATC ACGATGGTGG CCGACGGCGT CCGCCAGGGC
GACGACGAGC ACACCGAGTT CAACGCCGAG TTCATCGAGT TCAACCCCGA GGAGATCATC
ACCAGCGGGC CCTACACCTT CGACCGGGCC CAGATGTCCG ACGCGCGGAT CACCCTGGTG
CGCGAGAGCA CCGGCTACCG GGGCGAGGAG GTGAACTTCG AGGAGGTGGT CGTCCACAAG
GGTGACAACC GCCAGGCCTC CCTGCTCATC CAGCAGGGCG AGGTCGACTA CTCGACCCTG
GCCACCTCCG CCGCCGACCA GCAGGCGTTC CGCGGCGTCG AGGGGTTCCG GTGGATCGAG
CACCCCGGCT ACGACGGCTG CGGACTGATG TTCAACTACG CGGCCAAGCC CGAGCTCAAG
GACGTTCGGG TGCGCAAGGC GCTCAAGCAC CTGCTCGACA GCGACCAGAT CGGCCAGGTG
GCCCGGGGCG AGGCCTACGA CCGGGTGCGG TACTACTCCG GGCTGGTCGA CCTCCAGACC
GAGCAGATCT TCACCCCCGA GGAGCTCGCG GAGTTCGCCG CCTACGACCA CGACCCGGAC
CGGGCCACCG AACTGCTGGA GGAGGCGGGC TGGACCAAGG AGGGCGGGGT CTGGCACACC
GCCGAGGGTC AGGAGGCCAG CTACGAGATC ATCGGCGTCG CGGGCTGGGG CGACTTCGAG
CTGACCGCCA CCCAGGTGGA GGAGGCCTGG AACGCCTTCG GTATCAAGAC GACCGCGCGC
AACGTGCCCG CGGACAACCC GTGGGGCATC TGGGCCGCAG GTGACTTCGA GGTGGCCGTG
CGCCACTGGG GCAACCCGGA GATCCCGCAG TACTGGGGCG CCTTCCAGAT GAACTTCCTG
GTGGAGAACG CCCGCACCGG CGAGACCCCG GGGCAGGACT TCGACCTGAA GGTGGACAGC
CCCAGCCGGG GCGAGGTGGA CCTGGAGGCG CTGGTCGAGG TCGCCAAGAC CGCGCAGACC
GAGGAGGAGC AGACCGAGGC CCTCAAGACG ATGGCGATCG TCTTCAACGA GCTGCTGCCG
CGCATCCCGA TCTGGACCTA CAAGTACCTG GCCCCGGCCC TGGAGGGCGC GCGGGTGGAG
TCCTTCCCCG AGGACCACCC CGCCGCCCAG AACCAGATCT ACCGGGACAA CCACATCATC
CTGTCGCTGA TGCAGGGCGG CCTGGAGGCT GCCGGGTAG
 
Protein sequence
MRDLSRRRLL RTIGLGAAGV AGAGVLSSCA GGDNAESGAT QFTGVFDFDL AAQTRNVAVE 
DGALLMNSVY ADLFLPAGAF YNWETHEWDY LLLENSAWEG DDLVVTLRPG LKWSDGTDLT
AEDLHQNYAI RVLEAPAWSV GFPQITELER LDDLSVRARF GNPFPGIELQ VVKHRIFSKS
TYGDFGERAI TMVADGVRQG DDEHTEFNAE FIEFNPEEII TSGPYTFDRA QMSDARITLV
RESTGYRGEE VNFEEVVVHK GDNRQASLLI QQGEVDYSTL ATSAADQQAF RGVEGFRWIE
HPGYDGCGLM FNYAAKPELK DVRVRKALKH LLDSDQIGQV ARGEAYDRVR YYSGLVDLQT
EQIFTPEELA EFAAYDHDPD RATELLEEAG WTKEGGVWHT AEGQEASYEI IGVAGWGDFE
LTATQVEEAW NAFGIKTTAR NVPADNPWGI WAAGDFEVAV RHWGNPEIPQ YWGAFQMNFL
VENARTGETP GQDFDLKVDS PSRGEVDLEA LVEVAKTAQT EEEQTEALKT MAIVFNELLP
RIPIWTYKYL APALEGARVE SFPEDHPAAQ NQIYRDNHII LSLMQGGLEA AG