Gene Ndas_2345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2345 
Symbol 
ID9246195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2795205 
End bp2796479 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003680273 
Protein GI297561299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.822896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.413258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCC CCCGCCTGGG CTCCGCCGCC GGAGCGTCGG CGCTCTGCCT GCTCGCCGTC 
ACCGCCTGTT CCGGTGGCGG CGGCGGAGAC GACCGCATGC ACGTGTGGAT GTACCAGGAC
ACCCTGGTCG TGGTGCAGGA GGGCGCCGTC GAGAGGTTCA ACGGGGCCTC CGAGACCGAG
GCGGTCATCG ACGAGGTCCC CGGGGACAGC TACGAGGAGC GCCTGCGCAC GGCGATGGGC
TCCAGCGAGA AGCCCGACGT GTTCTTCAAC TGGGGCGGCG GCAGCATCGA GCCCTACGTC
GAGCAGGACA TGCTCGTCCC CCTGGACGAC ATGCTCGCCG AGAACCCCGA GTTCGCCGAC
TCCTTCATCC CCTCCATCCT GGAGGCGGGC AAGGTCGACG GCGTGCAGTA CGGCATCCCC
CTGCGCGGCA CCCAGCCGGT CATCCTCTTC TACAACGAGA CGGTGTTCGA GGAGGCCGGA
GCGGAGCCCC CCGAGACCTG GCAGGACATC CTGGACCTGG TCGACACCTT CACCGAGGAG
GGCGTCACCC CCTTCGCCCT GGCCGGGGCC GACCCCTGGA CCGAACAGAT GTGGCTCCAG
TACCTCGTGG ACCGCATCGG CGGACCGGAG GTGTTCGCGC GCATCGTGGA GGGCGACTCC
GAGGGCTGGC GCGACCCCGC CGTGCTGGAG GCCGCCCGGA TGGTCCAGGA GCTGGTGGAC
CAGGGCGCGT TCGGCAACTC CTACGCCTCG GTCAGCTACA CCGAGGGCGC GGCCTCGGCG
CTGCTGTCCG AGGGTCGGGC CGCCATGCAC CTGATGGGCT CGTGGGAGTA CTCCACCATC
CTGGACCAGA ACGAGGAGTT CGCGACGAAC GACCTCGGGT ACGTGGCGTT CCCGCCGATC
GAGGGCGGCG AGGGCGACCC CGCCAACGTG GTCGGCAACC CGACCAACTA CTTCTCGGTC
TCCGCCGAGA CCGAGTACAC GGACCAGGCC ATGGAGTTCC TGACGTACAT GTCCCAGGAG
GAGTACGTCG CCGACATGGT GGCGAACGGC GAGGTGCCCA CCACCACCAA CGCCGAGGAG
GTCGTCGCCG ACAGCCCCAG TCCGGACTTC GCCACCTTCC AGTACGAGAT GGTGCGCGAC
GCGCCGCACT TCCAGCTCTC GTGGGACCAG GCGCTGCCGC CGGAGGTGGC CACGCCGATG
GTCACCGAGA TCGAGTCGCT GTTCAACGGT GAGAGCACGC CCGAGCAGTT CGTCGACGCG
CTGGCGGCCC TGTGA
 
Protein sequence
MRTPRLGSAA GASALCLLAV TACSGGGGGD DRMHVWMYQD TLVVVQEGAV ERFNGASETE 
AVIDEVPGDS YEERLRTAMG SSEKPDVFFN WGGGSIEPYV EQDMLVPLDD MLAENPEFAD
SFIPSILEAG KVDGVQYGIP LRGTQPVILF YNETVFEEAG AEPPETWQDI LDLVDTFTEE
GVTPFALAGA DPWTEQMWLQ YLVDRIGGPE VFARIVEGDS EGWRDPAVLE AARMVQELVD
QGAFGNSYAS VSYTEGAASA LLSEGRAAMH LMGSWEYSTI LDQNEEFATN DLGYVAFPPI
EGGEGDPANV VGNPTNYFSV SAETEYTDQA MEFLTYMSQE EYVADMVANG EVPTTTNAEE
VVADSPSPDF ATFQYEMVRD APHFQLSWDQ ALPPEVATPM VTEIESLFNG ESTPEQFVDA
LAAL