Gene Ndas_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1049 
Symbol 
ID9244895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1294487 
End bp1295800 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content72% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003678998 
Protein GI297560024 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.340866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAC CCCCGGGCCG AGTGCGCCGA CACGGCCGCC GCGGCACCGC CCTGTCCGCG 
ACCGCCGCGG CCGCGGCGAC GGTCCTGCTC GCCTCCGCCT GCTCGGGCTC CGACGACGGC
ACCGTCGAGC TGCGCTTCTC CTGGTGGGGC TCCAACGAGC GCCAGGCCAC CATGCTCCAG
GTCATCGAGA ACTTCGAGGC GGACAACCCC GACATCCGGA TCACGGCGGA GACCACCGAC
TGGTCCGCCT ACTGGGACCG CCTGGCCACC ACCACCGCGG CCAACGACTC CCCGGACGTC
CTCATGCAGG AGGAGCGCTA CCTGCGCGAG TACGCCGACC GCGGCGCCCT GCTCGACCTG
GGCGAGGCCG AGGGCCTGGA CCTGTCGCTG ATCGACCCGC TGGTCGCCGA GAGCGGCCAG
CTGGACGGGC AGACCTTCGG CGCGGCCAGC GGCGTCAACG CCTACTCCAT CCACGCCGAC
CCCGAGGCCT TCGCCGCCGC GGGGGTGGAG ATGCCCGACG ACGACACCTG GACCTGGGCG
GACTACGTCG AGATCGCCGG GCAGATCAGC GAGGGCACCG GCGGCGAGAT CGCCGGCGCC
CAGAGCATGA GCTACAACGA GGCCGGTTTC CAGGTCTTCG CCCGCCAGCA CGGGGAGGCG
CTCTACAACG AGGACGGCAG CCTCGGCTTC TCCCAGGAGA CCCTGGAGGC CTGGTACGAG
ATCACCCAGG ACCTGCTGGA GAACGGCGGC CAGCCCAGCG CGGCCCGGAG CGTGGAGATC
CAGGCGGGCG GCATCGACCA GTCGGTCGTG GCCACCGGCG AGGGCGCCAT GGCGCACTTC
TGGAGCAACC AGCTCGGCAA CGTGGTCGAG GCCTCCGGGC GCGAGATCCA GCTCCTGCGC
TACCCCGGGG AGACCGAGTT CGACCGGACC GGCCTGTTCT TCAAACCGGC CATGTTCTAC
TCGATCTCCG CGGGCTCCGA GCACCCCGCG GAGGCGGCCC GCTTCGTCGA CTACATGCTC
AACGACCCGG CGGCGTCCGA GCTGCTCCTG GCCGACCTGG GCCTGCCCGC CAACACCGAG
GTCCGCGAGG CCATCCTCGA CGACCTGCCC GAGTCCGACG CCCGGATGGC CGAGTTCATG
GGCGAGATCG AGGGAACGAT CGTGGACGGC AACCCGCCCG CGCCGATCGG CGCCGGTCAG
GTCGTGGACA TCAGCAGCCG CGTCAGCGAC GGGCTCGCCT TCGGCGACCT CACCCCGGCC
GAGGCCGCCG AACAGTTCAT GACCGAGGTC GAGGCGGCCA TCGAGACCTC CTGA
 
Protein sequence
MRVPPGRVRR HGRRGTALSA TAAAAATVLL ASACSGSDDG TVELRFSWWG SNERQATMLQ 
VIENFEADNP DIRITAETTD WSAYWDRLAT TTAANDSPDV LMQEERYLRE YADRGALLDL
GEAEGLDLSL IDPLVAESGQ LDGQTFGAAS GVNAYSIHAD PEAFAAAGVE MPDDDTWTWA
DYVEIAGQIS EGTGGEIAGA QSMSYNEAGF QVFARQHGEA LYNEDGSLGF SQETLEAWYE
ITQDLLENGG QPSAARSVEI QAGGIDQSVV ATGEGAMAHF WSNQLGNVVE ASGREIQLLR
YPGETEFDRT GLFFKPAMFY SISAGSEHPA EAARFVDYML NDPAASELLL ADLGLPANTE
VREAILDDLP ESDARMAEFM GEIEGTIVDG NPPAPIGAGQ VVDISSRVSD GLAFGDLTPA
EAAEQFMTEV EAAIETS