Gene Ndas_0694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0694 
Symbol 
ID9244536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp854431 
End bp855678 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content70% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003678645 
Protein GI297559671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.333505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCC GCAGCGCACG AGCGGTCACG GGCATCGCCG CGATCGCACT GATGGCGACC 
GCGTGCAGTG GTGGCGACGA GGGCGGGGAG GCCGCCGCCG AGGGCAGCCT CGTCATCTGG
TCCGACCCCG AGCGCGCCGA CGCCATCAAG GCGGCCGCCC AGGAGTTCGC CGAGGCCAAC
GGCATCGAGG TCGAGGTCCA GGGCCTGACC TTCGGCGACA TCCAGGGCGA CGTGCTCAAC
GCCCACCAGG CCGGAAACGC CCCCGACGTC TTCATCGGCG CCCACGACTG GACGGGCAAC
CTCGTGCGCA ACGGCGCCGT CCAGCCCATC GAGCTGCCCC AGGACCGCGC CTCCGGCCTG
GACGAGACCT CGTTGCAGGC CCTCAACTAC GACGGCCAGC TCTTCGGTGT TCCCTACTCC
CAGGAGAACA TCTTCCTCAT GCGCAACACC GACCTGGCGC CGGACGCCCC CGCGACCTTC
GAGGAGATGG TCGAGGTCGG CACCGAGCTC AAGGACTCCG GTGAGACCAG CGAGGTCCTG
TCCATGGCCG TGGGCCAGGA GGGCGACCCC TACCGGATGA ACGCCCTGTT CACCTCCGCG
GGCGGCTACC TCTTCGGCCA GGACGAGGAG GGCAACTGGG ACCCGACCGA CCTGGGCGTG
GGCACCGACG AGTCCATCGC GGCCATGGAG AAGGTCGCCG AGTACGGCGA GGCCGGGGAG
GGCGTGCTGC GCCGCTCCAT CACCCTGGAG AACGACGCCT CCCTGTTCTA CGAGGGCGAG
GCCCCCTTCT TCGTCGCGGG TCCGTGGAAC GTCGCCGACG CCAACGAGGC GGGCGTCAAC
TACGAGATCA GCCCCATCCC CGGCTTCGAG GGCGAGGAGC CCGCCAGCCC CTACATCGGC
TACCAGGCGT TCTTCGTCAC CGAGGGCAGC GCCAACAGCG CCCTGGCCCA GGAGTTCGTG
ACCAACTACG TCACCGACAC CGACTTCGTC CTCAGCCTCT ACGAGGCCGA CCCCCGCATG
CCGGTGCAGA CCGAGGCCCT GGAGAGCGTC TCGGCCGACG ACCCCACCAT CGCCGCGATC
TCCGAGGCCG AGGCCGGGGC CGAGGGCATG CCGATGCCCT CCATCCCGGA GATGGGCGAG
ACGTGGGAGC CGCTGGGCAT CGCCCAGGCC GCCGTCATCG CCGGTGAGGA CGTGCGCGAG
GCCATGGAGG CCACCCACGA AACGATCGCC TCGCAGATCG GCGAGTAG
 
Protein sequence
MRFRSARAVT GIAAIALMAT ACSGGDEGGE AAAEGSLVIW SDPERADAIK AAAQEFAEAN 
GIEVEVQGLT FGDIQGDVLN AHQAGNAPDV FIGAHDWTGN LVRNGAVQPI ELPQDRASGL
DETSLQALNY DGQLFGVPYS QENIFLMRNT DLAPDAPATF EEMVEVGTEL KDSGETSEVL
SMAVGQEGDP YRMNALFTSA GGYLFGQDEE GNWDPTDLGV GTDESIAAME KVAEYGEAGE
GVLRRSITLE NDASLFYEGE APFFVAGPWN VADANEAGVN YEISPIPGFE GEEPASPYIG
YQAFFVTEGS ANSALAQEFV TNYVTDTDFV LSLYEADPRM PVQTEALESV SADDPTIAAI
SEAEAGAEGM PMPSIPEMGE TWEPLGIAQA AVIAGEDVRE AMEATHETIA SQIGE