Gene Ndas_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1913 
Symbol 
ID9245763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2332906 
End bp2333952 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003679846 
Protein GI297560872 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0831866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.645841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCA CCCGCGGACG CGGCGTCCTG AGCGCCGCCG CGATCACCTC CGTCCTCGCC 
ATGACCGCCT GCGGCAACGC CGACCTGCCC CCGGCCGCGG CCACCGGCGA CGGCGGCACC
GTCATCACCT ACAACTCCCC CGCCGAGTGG GGCAACTACG GCGAGGTCCT GGCCGCCTTC
ACCGAGCGGA CCGGGATCCA GGCCCCCAAC GACCCGAAGA ACTCCGGGCA GGCCCTGGCC
GCGCTCCAGG CCGAGAAGGG CGCGCCCGTC GCCGACGTCG CCTACACCGG CATCGCCTTC
GCCGGACAGC TCGTGGAGGC CGGGGTCCTG CAGTCCTACG TGCCCGAGGG CGCGGAGGAG
GTCCCCGAGG ACCTGCGCGA TCCCGACGGG AACTGGACGG CCGTCCACAC CGGCACCATC
GCCTTCATCG TCAACGAGGA CCACCTGGAC GGCGCGCCCG TGCCGAGCAG CTGGGAGGAC
CTCCTCGACC CCGCCTACGA GGGCAAGGTC GGCTACCTCG ACCCCACCCA GGCGGCCGTG
GGCTACTCCG CCGCGACGGC GGTCAACCAC GCGCTCGGCG GCGACCTGAC CGACTGGGGG
CCGGGTCTGG ACTACCTGGC CGAGCTGAAG GAGAACGGCG CCTCCACCTC CGCCCAGACC
GCCACCGCCA AGGTCGCCCA GGGCGAGATC CCCATCCTCA TCGACACCGA CTTCAACGGC
TACAAGCTCC GCGACGAGGG CGCCGACGTC AGCGTCGTCA TCCCCGAGGA GGGATCGCTG
CAGATCCCCT ACATCGTCGG CCTGGTCGAG GGCGCCCCCA ACGCCGACAA CGGCAGGGAG
CTGCTGGACT TCTACTTCTC CGAGCAGGGC CAGGGCCTCT TCGCGGACGG TTACATGCGC
CCGGTGGTCG GCCAGATGCC CGAGGAGCTC GCCGACCGGG TCCTGCCCGA GTCCGACTAC
GAGCGCGCGA TCACCATCGA CTACCTCGAA CAGGGCGAGC GGCAGCAGGA GTTCATCGAC
CTGTACCAGA GCGAGGTCGG CTTCTAG
 
Protein sequence
MTLTRGRGVL SAAAITSVLA MTACGNADLP PAAATGDGGT VITYNSPAEW GNYGEVLAAF 
TERTGIQAPN DPKNSGQALA ALQAEKGAPV ADVAYTGIAF AGQLVEAGVL QSYVPEGAEE
VPEDLRDPDG NWTAVHTGTI AFIVNEDHLD GAPVPSSWED LLDPAYEGKV GYLDPTQAAV
GYSAATAVNH ALGGDLTDWG PGLDYLAELK ENGASTSAQT ATAKVAQGEI PILIDTDFNG
YKLRDEGADV SVVIPEEGSL QIPYIVGLVE GAPNADNGRE LLDFYFSEQG QGLFADGYMR
PVVGQMPEEL ADRVLPESDY ERAITIDYLE QGERQQEFID LYQSEVGF