Gene Ndas_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3337 
Symbol 
ID9247199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3989551 
End bp3990834 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003681249 
Protein GI297562275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.806542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCA GAAGAACCCA CAAGTACGTT CCGCTCCCCG CCGCCGCCGC GGCGCTGGTC 
CTGGCCGCGA CGGCGTGCGG GAGCGGAGGA GGCGGGGCCT CCGGTTCCGA CGGCCTCCTG
GTCTGGATCA TGCAGGGCAC CAACCCCGAC GAGACGGGGT TCTTCGAGGC GGCCAACGCC
GCGTTCACCG AGGAGACCGG CATCGAGGTG GACGTCGAGT TCGTGCCGTG GCAGGACGCC
CAGAACAAGA TCTCCACCGC CATCGCGGGC GGCACCATGC CCGACGTCGC CGAGCTCGGC
AACACCTTCA CCCCCGGTTT CGCCGACGCC GGAGCCCTGC ACGACCTGTC CGGCTACGAC
ATCGACACCT CCCAGTACAT CCCCGGCCTG ATGGAGATGG GCCAGCTCGA CGACGGTGTC
TACGGCGTGC CCTGGTACGC CTCCATCCGC TCCGTCGTCT ACCGCACCGA CGTCTTCGAG
GAGCACGGCC TGGAGGTCCC CGAGAACTGG GAGGAGCTGC GCGAGACCGC CCTGGCCCTG
TCCGAGGCCG AGGAGGACAT GATCGCCTTC CCCGTGCCCG GAGACGCCCA GTACTCGGTC
ATGCCGTGGA TCTGGGGCGG CGGCGGGGAG ATCGCGGTCG AGCAGCCCGA CGGCACCTGG
GTCTCGGAGA TCGACAGCGA GGAGGCCCGT GCCGGGATCG GGTTCTTCAC CGGCCTGGCC
CTGGAGGACA ACACCTCCAC CACCGGCGCC GTCAACTGGA ACGAGATCGC CGTCATGGAG
GCCGTCGCGG AGGAGGAGGC CGCCATGGCC ATCCTCGGCA GCGCCAACCC CAAGGCCATC
CTGGAGGCCA ACCCCGACCT GGAGGGCAGG CTGGGCTCCT TCACCCTGCC CGGCCAGGAC
GGCGGGTACA TGCCCTCCTT CGCGGGCGGC TCGCTGCTGT CGGTCTTCGA GGGCACCGGC
CAGGAGGAGG CCGCCTGGCA GTACGTCCAG CACCTGACCG GCGAGGAGTT CGGCATGCGC
TGGTCCGAGG AGACCGGCTT CTTCCCGGGC GTGGTCGACC GGGTCGACAC CTTCTCCTCC
TCCGCCGACC CCATCCTGGA GCCCTTCGCC GTCCAGCTCA ACGAGGCCAG CCGGGGCGTG
CCCGTCACCC CCGCCTGGAC CCAGGTCGAG GCCGAGAAGG TCCTGGTCGG CATGCAGCAG
GACATCCTCA ACGGCGAGGC CACCGTGGAC GAGGCCACCG AGAACGCGGC CGACGAGATC
GAGCGCATCC TCAACGGGGG GTAG
 
Protein sequence
MARRRTHKYV PLPAAAAALV LAATACGSGG GGASGSDGLL VWIMQGTNPD ETGFFEAANA 
AFTEETGIEV DVEFVPWQDA QNKISTAIAG GTMPDVAELG NTFTPGFADA GALHDLSGYD
IDTSQYIPGL MEMGQLDDGV YGVPWYASIR SVVYRTDVFE EHGLEVPENW EELRETALAL
SEAEEDMIAF PVPGDAQYSV MPWIWGGGGE IAVEQPDGTW VSEIDSEEAR AGIGFFTGLA
LEDNTSTTGA VNWNEIAVME AVAEEEAAMA ILGSANPKAI LEANPDLEGR LGSFTLPGQD
GGYMPSFAGG SLLSVFEGTG QEEAAWQYVQ HLTGEEFGMR WSEETGFFPG VVDRVDTFSS
SADPILEPFA VQLNEASRGV PVTPAWTQVE AEKVLVGMQQ DILNGEATVD EATENAADEI
ERILNGG