Gene Ndas_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3638 
Symbol 
ID9247507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4362559 
End bp4364226 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003681543 
Protein GI297562569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.339986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.415005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGGA AGCCTTTCCG AAAGCCGCTG GCCCTCCTGG CCTCCGCGGC CGCGCTCACG 
CTGGTGGCCA CGGCGTGCGC CGAGAGTAAC CGCGAGGGCG GGGGGACCGA CGCGTCGGAA
CCCTTCGTCT TCGCCTCCGC GGGCGACATC AAAACCCTCG ACCCCTTCCT CACCAGTGAC
GGTGAGACCT TCCGTTACAG CAGGCAGGTA TTCGAAACCC TTCTCGAACA CGAATCGGGT
GGGACCGAAA TCGTCGGCGG ACTCGCCGAG GACTGGGAGC AGTCCGAGGA CGGCACCGTC
TGGACCTTCC ACCTGCGCGA CGGCGTCCTG TTCCACGACG GCGACGAGTT CAACGCCGAG
GCGGTCTGCG CCAACTTCGA CCGCTGGTAC AACCTCACCG GCGGTTTCCA GAGCTCGAAC
AACTCCTATT ACTGGCAGTC GATCTTCGGC GGCTTCGCGG AGAACGAGAG CGAGGACCTC
GCCGAGTCCC GGTACGTCTC CTGCGAGGCC ACCGACGAGC TGACCGCGGT CATCACGATC
GACGAGTACT CCTCGATCTT CCCCGGCGGC TTCAGCCTCG CCTCGTTCGG CATCATGAGC
CCCAGCACGC TGGAGGCCAT CGCCGACGCC GAGATCACCG GCGAGGAGGG CAACTTCACC
CTCCCCGAGT ACACCCAGAC GGCCGGAACC GTCGCGGGCA CCGGGCCCTT CACCGTCCAG
GAGTGGGACC ACGACCAGGC CGAGGTGACC CTCCAGCGCT TCGACGACTA CTGGGGCGAG
GCCGCGGGCT TCGAGACGAT GATCCTGCGC GCGATCCCCG ACGAGACCGC CCGCCGCCAG
GCCCTGGAGG CGGGTGACAT CCACGGCTAC GACCTGGTCG CCCCCGCCGA CGTCGCCCCC
CTGTCCGAGG CCGGGTTCCA GGTGCCCACC CGCGGCGTGT TCAACGTCCT GTACATGGCC
TACCAGCAGG AGGCCAGCGA GGCGCTCGCC GACCTTGAGG TGCGCCAGGC CCTCGCCCAC
GCCGTGGACC GCCAGCGCAT CGTCGACACG ATCCTGCCCG AGGGCGGCGA GGTCGCGAGC
CAGTTCCACC CCGACACCCT CGACGGCTGG TCCCCGGACG TGCAGACCTA CGAGTACGAC
CCCGAACTGG CCAGGGAGAT GCTGGCGGAC GCCGGGCAGG AGGACCTGAC CCTGGAGTTC
TGCTACCCGA CCGACGTCAC CCGCCCCTAC ATGCCCGCGC CGCGCGACAT CTTCGACGTC
ATCGCCGCGG ACCTGGAGGC GGTCGGCGTC ACCGTGGAGC CGGTCACCTA CGAGTGGACC
GAGTACGTGC CGCGCACCAA CTCGGGTGAG TGCCCGCTGT ACCTGCTCGG CTGGACCGGC
GACTACAACG ACGCCTACAA CTTCATCGGC ACCTGGTTCT CCCAGTACAA CAGCGAGTTC
GGCTTCCGTG ACGAGGACCT GTTCGAGGCC ATGGAGGAGG CGAGCACCAA CCCGAACCAG
GAGGAGCGCG TCGCCGCCTA CCAGGACCTG AACAACCAGA TCATGGACAT CCTGCCGGGG
CTGCCCATCT CCAGCTCCCC GCCGTCCATC GCCTTCTCCG CGAACGTCAA CCCGCCCAAC
GTCAGCCCGC TGACCCAGGA GCAGTTCGCC GAGGCCTCCT GGAAGTAG
 
Protein sequence
MFRKPFRKPL ALLASAAALT LVATACAESN REGGGTDASE PFVFASAGDI KTLDPFLTSD 
GETFRYSRQV FETLLEHESG GTEIVGGLAE DWEQSEDGTV WTFHLRDGVL FHDGDEFNAE
AVCANFDRWY NLTGGFQSSN NSYYWQSIFG GFAENESEDL AESRYVSCEA TDELTAVITI
DEYSSIFPGG FSLASFGIMS PSTLEAIADA EITGEEGNFT LPEYTQTAGT VAGTGPFTVQ
EWDHDQAEVT LQRFDDYWGE AAGFETMILR AIPDETARRQ ALEAGDIHGY DLVAPADVAP
LSEAGFQVPT RGVFNVLYMA YQQEASEALA DLEVRQALAH AVDRQRIVDT ILPEGGEVAS
QFHPDTLDGW SPDVQTYEYD PELAREMLAD AGQEDLTLEF CYPTDVTRPY MPAPRDIFDV
IAADLEAVGV TVEPVTYEWT EYVPRTNSGE CPLYLLGWTG DYNDAYNFIG TWFSQYNSEF
GFRDEDLFEA MEEASTNPNQ EERVAAYQDL NNQIMDILPG LPISSSPPSI AFSANVNPPN
VSPLTQEQFA EASWK