Gene Ndas_4930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4930 
Symbol 
ID9248817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp65582 
End bp66841 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content73% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003682819 
Protein GI297563846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.265943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAC ACCTGTACCG ACACCCGCAC CGCCCGCTAC GAACGGCGTC GGCGGCCGTC 
TGCGCCGCGG CGCTCGCCCT GACCGGAGTC GGCTGCGCGG CCGAACGCGA CCCGAGCGTG
TACACCATCA TGGACTCCAG CACCGACGAG CCCTACCACA CCTGGGACCA GGAGGCGATG
GACGCCTGCG GGGAACAGGT GGGCGTGGAG GTGCGCCACA TCAGCGTGCC CGCCGACCAG
CTCGTGCCCA AGGCCCTGCG GATGGCCTCC TCCGACTCGC TCCCCGACGT GCTCAACCTC
GACGGCTCGG ACCTGCCGCA GTTCGCGGCG GCCGGGGGCC TGGTACCGCT GGAGGAACTC
GGCATACCCA CGGACGGCCT GTCCGAGGGC GCGCTGTCCA TCGGCAGCTA CGAGGGCGTG
TACTACGGCG CCGCCCGGTC GGTGAACTCC CTCGGCCTCT TCTACAACGC GGCCGCCCTG
GAGGAGGCGG GCATCGACCC GCCCCGGACC TGGGCGGAGC TGGAGGAGGC CGCCGCCGAG
CTGACCGGGG GCGGGCGCTA CGGCCTGGCG ATCAGCGCCC TGGCCACCGA GGACGGCGTC
TACCAGTTCC TGCCGTGGCT GTGGTCCAAC GGCGGCGACG AGAGGGAGCT GGCCTCACCG
GAGTCGGTGG AGGCACTGGA GTACGTCACC TCGCTGGTCG AGGCGGGATC GGTCTCCCCC
TCGGTGGTCA ACTGGACCCA GGCCGACGTC AACGACCAGT TCATCGCGGG CAACGCGGCC
ATGATGGTGA ACGGCCCCTG GCAGCTGCCG GTCCTCCAGG AGCACCCCGA CCTGGAGTGG
GCCGTGGCGG AGATCCCCGT GCCCGAGGCC GGGGACACCT CGGTGGCTCC GATCGGCGGG
ACCACCTTCA CCGTGCCGGT CAACGCCGAG GACCCCGACC GCGAGCGCGT GGCCGCCGAA
CTCGTGGCCT GCCTGACCAC GGCCGAGGCG CAGCTGGACT GGTCCACCAA GGGCAGCAAC
GTGCCCGTGG ACACCGGGGC GGCCGAGCAG TACCGCGACC TGGTCCCGGA ACTGGCCCCG
TTCGTGGACC AGGTGCGCAC GGCGCGCAGC CGGACCGAGC ACGCCGGCAC CGAGTGGAAC
GCCTACTCCC AGGCCATCGG CACCGCGCTC CAGGCGGCGC TCACCGGCGA GGCGAGCCCG
CGGGAGGCCA TGGAACGCGC CCAGGCGCGG GTCGAGGCGG AGCTGGAGGC CCGGTCATGA
 
Protein sequence
MREHLYRHPH RPLRTASAAV CAAALALTGV GCAAERDPSV YTIMDSSTDE PYHTWDQEAM 
DACGEQVGVE VRHISVPADQ LVPKALRMAS SDSLPDVLNL DGSDLPQFAA AGGLVPLEEL
GIPTDGLSEG ALSIGSYEGV YYGAARSVNS LGLFYNAAAL EEAGIDPPRT WAELEEAAAE
LTGGGRYGLA ISALATEDGV YQFLPWLWSN GGDERELASP ESVEALEYVT SLVEAGSVSP
SVVNWTQADV NDQFIAGNAA MMVNGPWQLP VLQEHPDLEW AVAEIPVPEA GDTSVAPIGG
TTFTVPVNAE DPDRERVAAE LVACLTTAEA QLDWSTKGSN VPVDTGAAEQ YRDLVPELAP
FVDQVRTARS RTEHAGTEWN AYSQAIGTAL QAALTGEASP REAMERAQAR VEAELEARS