Gene Ndas_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1085 
Symbol 
ID9244931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1332883 
End bp1334238 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content74% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003679033 
Protein GI297560059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.474529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACC CACCACGCAG ACCCAGCCCG CCGACCGGGC CGCCTCCGCC GTTCTCGCGG 
CGCCGCGTGC TCCTGGGCAC GGGAGCCCTC GCCCTGGGCT CGGCCCTGGG CGCCGCCGGA
TGCGCCCCCG CCCCCGGTTC GGGATCGACC ACGCAGGTGC GGTTCTGGAG CCTGTTCCAG
GGCGGCGACG GCGCCCGGGT GCAGACCATG CTGGACGCGG TGCGCGAACA GGCCCCGCAC
CTGGACGTCA CCCCCAGCAC ACTGGCCTGG GGACCGCCGT ACTACACCAA GCTGGCGGTG
GCCTCCGTGG GCGGTCGGGC CCCCGAGACG GCCGTGCTGC ACCTGTCCCG CCTGCCCGGG
TACGCCCCCG GCGGGCTGCT CGAACCCTTC GACCTGGACC TGCTGGCCGA GTTCGGGGTC
ACCGCCGAGG ACTTCGTACC CGACCTGTGG GAACGCGGCA TCCACGACGG CGCCACCTAC
GCCGTCCCGC TGGACACCCA CCCGGTGATC GTCTTCTACG ACGCCGAAGT CGCCGACCGG
GCCGGTCTGC TCGACGGGGA CGGGAAGCTG ACCGGGATGG ACTCCCCCGA GGGGTTCCTC
GCGGCCTCCC GGGCGCTGGC CGAGGCCGGG GGCGGCAACG GCGTCTCCTA CGGGCACGTC
AACGACGACT CCCAGGGGTG GCGGCTGTTC TGGATGCTGT ACAACCAGAC CGGCGCGTCC
ATGGAGCTGC CCGGGGGCGG ACCGGCGGTG TTCGACCGCG ACGCGGCGCT GCGCGTGTAC
TCCTTCCTCG CCGAACTGCT CGACGGCCGG ACGTCGGAGC CGGACCTGGA CTACCCCACC
GCCCTGGCGG CCTTCGCCTC GGGGCGCTCG GCGATGCTCG TGTGCGGGGA GTGGGAGCTG
CCCTACCTGT CGGAACACGT GGAGAACCTG GGGGCGGCCC CCTTCCCCAC GGTCTTCGAC
CAGCCCGGCG GGTACGCCGA CTCCCACGCC TTCGTGCTGC CCCGCCAGGG CGACCCCGAC
CCCGCACGGG TGCGCGCCGC CCACGAGTTC GTGGCGCTCA TGGTGCGCAA CAGCCTGATC
TGGGGCGAGG CGGGCCACAT CCCGGCCTAC TCGCCGATCG CCCAGTCGCC GGAGTACCTG
GCGCTGGACC CGCAGTCGGA CTACGCCGCC GCCGGGGAGA CCCCCGTGCT CGACCCCGAG
GTGTGGTTCG CCGGGGCCGG ATCGCGGTTC CACTCCGACG TGAGCGAGGC GCTGCGCACG
GCCCTGACCG GCGACGGACC CGAGGCGGCG GTGGACCACC TGGGCCGGAC CCTGGACTCC
TGGGCCGCCC GCACCAACCC GGGAGGCCAG GAATGA
 
Protein sequence
MPHPPRRPSP PTGPPPPFSR RRVLLGTGAL ALGSALGAAG CAPAPGSGST TQVRFWSLFQ 
GGDGARVQTM LDAVREQAPH LDVTPSTLAW GPPYYTKLAV ASVGGRAPET AVLHLSRLPG
YAPGGLLEPF DLDLLAEFGV TAEDFVPDLW ERGIHDGATY AVPLDTHPVI VFYDAEVADR
AGLLDGDGKL TGMDSPEGFL AASRALAEAG GGNGVSYGHV NDDSQGWRLF WMLYNQTGAS
MELPGGGPAV FDRDAALRVY SFLAELLDGR TSEPDLDYPT ALAAFASGRS AMLVCGEWEL
PYLSEHVENL GAAPFPTVFD QPGGYADSHA FVLPRQGDPD PARVRAAHEF VALMVRNSLI
WGEAGHIPAY SPIAQSPEYL ALDPQSDYAA AGETPVLDPE VWFAGAGSRF HSDVSEALRT
ALTGDGPEAA VDHLGRTLDS WAARTNPGGQ E