Gene Ndas_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3658 
Symbol 
ID9247527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4388668 
End bp4389768 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003681562 
Protein GI297562588 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.720757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATGC TGCGGCTTCC GGCGGCACTC GCGGCCCTGG CCCTCGCGGT CAGCGCCTGC 
TCGGGTGGCG GCGGGAACAG CGACGGCGGT TCGGGTCAGT ACCCCAGGAA CGAGACCCTG
TACACCACGG GTACGGCCTG GGAGGCGCCG ACCAGCTGGA ACCCGATGAT GCGGGGCCAG
TTCGCGGTCG GCACCAACGG CCTGGTCTAC GAGTCGCTCT TCCACTACGA CGCGGACGCG
GGAGAGTACG TCCACTGGCT CGCCGAGAGC GACGAGTGGA CCTCGGAGAC CGAGCACGTG
ATCACCCTGC GCGAGGGCGT CACGTGGAAC GACGGCGAGC CCTTCGTCGC CCAGGACGTG
GTCACCACGC TGGAACTCGG CCAGGTCCCC GGAGTCCCCT ACAGCAACGT CTGGGACTAC
ATCGAGAGCG TCGAGGCCAC CGACGAGCGC ACGGTCACCG TCACCTTCTC GGAGAGCCGT
CCGCAGGAGT GGATGAACTG GGCCTACTCC AACCCCATCG TCCCGGACCA CATCTGGGCC
GGCATGGAGG AGAGCCAGGT CGCCGACAGC CCCAACGAGA ACCCGGTCGG CACCGGCCCC
TACGTCTACG AGTCGCACAC CGACGACCGC ATGGTCTGGG AGCGCAACGA CGAGTGGTGG
GCCATCGAGG CCCTCGACAT GACGATGGAC GCCCGCTACA TCGTCGACAT CGTCAACGCC
TCCAACGAGG TCACGATGGG CATGCTGAAC CAGGGCGAGG TCGACCTCTC CAACAACTTC
CTGCCCGGTA TCGACCAGGT CCTCAACAGC AACGAGACCA TCACCAGCTT CTACGACGGC
CCCCCGTACA TGAAGAGCGC CAACACGGCG TGGCTCATCC CGAACCACAC CCGTGAGCCG
CTCAACGACA CGGCGTTCCG CCAGGCCCTG GCCCACTCGA TCAACATCAC CCAGATCGTC
GAGGGCCCGT ACGCCAACCT GGTCCAGGCG GCCAACCCCA CGGGTGATGA TGCGGGGCCA
GTTCGCGGTC GGCACCAACG GCCTGGTAAC GAGTCATATA TATCTACGAC GAGGTCGCGG
AAGAGAACGT CCACTGGCTA A
 
Protein sequence
MRMLRLPAAL AALALAVSAC SGGGGNSDGG SGQYPRNETL YTTGTAWEAP TSWNPMMRGQ 
FAVGTNGLVY ESLFHYDADA GEYVHWLAES DEWTSETEHV ITLREGVTWN DGEPFVAQDV
VTTLELGQVP GVPYSNVWDY IESVEATDER TVTVTFSESR PQEWMNWAYS NPIVPDHIWA
GMEESQVADS PNENPVGTGP YVYESHTDDR MVWERNDEWW AIEALDMTMD ARYIVDIVNA
SNEVTMGMLN QGEVDLSNNF LPGIDQVLNS NETITSFYDG PPYMKSANTA WLIPNHTREP
LNDTAFRQAL AHSINITQIV EGPYANLVQA ANPTGDDAGP VRGRHQRPGN ESYISTTRSR
KRTSTG