Gene Ndas_0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0752 
Symbol 
ID9244594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp922297 
End bp923616 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative extracellular solute-binding protein 
Protein accessionYP_003678703 
Protein GI297559729 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.31197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGACA CCACCCCCCA CCACCCCCCA CCACACGTCG GGCCCGGTCG CCGGGCCCTG 
CTCACCGGCG CCCTGGCCGC GGCCGGACTG GCCGCCGCGG GCTGCGCGCC CCCCGACGCA
CTGCTGACCG GGGACACCCG CCTGCGGCAG TGGAACCTCT TCTCGGGCGG CGACGGCCTG
CGGATGATCG AGATGCACGA CGCCTACCGG GCCGAGCACC CGGAGGTCGA CTTCCGGGCG
ACCACCTTCA CCTGGGGCTC CCCCTTCTAC ACCAAGGTCG CGATGGGCGC GGCGGGAGGG
CGCGGCGCCG ACATCGCCAC CGTGCACGTC TCCCGCCTGG AGAGCCTGGC CCCCGGGAGG
CTGCTGGACC CGGTCGACCC GGCGCTGCTG GCCGAGGCCG GGATCGACGA CACCGTGATC
CCGCCCAACG TCTGGGAGAA GTGCTTCTTC GACGGGCAGC TGTACGCGGT CCCCATCGAC
ACGCACGTCC TGATCCAGTA CCACAACCTC GACGTGTGCC GCGAGGCCGG ACTGCTCGAC
GCCGACGACC GGCTGGTCCA GGTCAGCGGG CTGGACGACT ACATGGCCAT GCTCCGCGAG
ATCAAGGCGG TCACCGGGGC CTACGGCCTG TCCGTCGACA CCTGGCAGCC CTGGCCCAAC
TTCTGGGCGC TCTACCGCCA GCAGGACGGG GAACTCCTCC TGGGCGAGGA CGACTTCACC
ATGGACGACG ACAAGGCCCT GGCGGCCATG GAGGTCATGT ACCGGCTCTC CGAGGAGGAG
CTGGCGCCCC GCCACTCGAT GCTGGCCGAC ACCGCGGCCA ACCTCTCCAA CGGCAGGGCC
GGGCTGATGA TCCACGGCAA CTGGGAGATC CCGACGCTGG AGGCCGCCGG AACGGCCTTC
TCGGCGTCCC AGTTCCCCGA CGTCTTCGGC AACCGCCGCA CCCGAGGCGA CTCGCACTGC
TACGTGTTCC CGCACCAGCG CGACCCCGAC CCCGAGCGGA TCCGGGCCGC CGTCGGATAC
GCCGCATGGA TGCTGCGCCA CAGCCTCACC TGGGCCGGGG GCGGCCACAT CCCCGCCTAC
CGGCCCGTGG TCGAGAGCGC CGAGTACGAG GCGCTGCACC CCCAGTCCGC GTACCGCGAG
GCGGCCGAGA ACGTGCAGTT CGAGCCCGAG GCCTGGTTCA GCGGCTCGGC GGGGCGCCTC
CAGGAGGAGG CCAACGGCCC GCTCACCACC CTCCACCAGG GGACCCAGAC ACCCGAACAG
GCGCTGGAAC AGCTCAAGGG AGCCATCCGC GACCTGCTGA CCGTGCCGTC ACCGGTGTGA
 
Protein sequence
MRDTTPHHPP PHVGPGRRAL LTGALAAAGL AAAGCAPPDA LLTGDTRLRQ WNLFSGGDGL 
RMIEMHDAYR AEHPEVDFRA TTFTWGSPFY TKVAMGAAGG RGADIATVHV SRLESLAPGR
LLDPVDPALL AEAGIDDTVI PPNVWEKCFF DGQLYAVPID THVLIQYHNL DVCREAGLLD
ADDRLVQVSG LDDYMAMLRE IKAVTGAYGL SVDTWQPWPN FWALYRQQDG ELLLGEDDFT
MDDDKALAAM EVMYRLSEEE LAPRHSMLAD TAANLSNGRA GLMIHGNWEI PTLEAAGTAF
SASQFPDVFG NRRTRGDSHC YVFPHQRDPD PERIRAAVGY AAWMLRHSLT WAGGGHIPAY
RPVVESAEYE ALHPQSAYRE AAENVQFEPE AWFSGSAGRL QEEANGPLTT LHQGTQTPEQ
ALEQLKGAIR DLLTVPSPV