Gene Ndas_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1926 
Symbol 
ID9245776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2346178 
End bp2347239 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003679859 
Protein GI297560885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.932979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA CCGCACACCC GCGCCGCATG CTCGCCGCGC TCGCCGGGAC GGCGGCGCTG 
GCCCTGGTGA GCGCCTCGTG CGCACCGGTG CGCGAGGACG ACCCCGACAC GCTGGTGGTG
AGCACCTTCG CCTTCGCCAC CGAGGAGTTC ACCGAGGTGG TGGCCGACCC CTTCGAGGCC
GAGACCGGGA TCCGGGTGGT CCTGGACACC GGCAACAACG CCGGGCGGCT CACCAAGCTC
AGGATCAACG CCGAGACGCC CGACACCGAC GTCGTGCTCA TCTCCGACTA CTACGCCCAG
ATCGGCAAGG ACATGGGGCT GTTCGCCCCC GTCGACCCCG CCGACGTGCC CAACCTCGAC
GCCATCCAGC CCTGGGCGGT GGACCCGGAC GGGTACGGCC CCGCCTACAC CTTCCAGCTC
CTGGGCCTGC TCTACCGCAC CGACCTCGTC GAGGAGGCCC CCGACTCCTG GGACGACCTG
TGGGCCGAGC CCGAGGGAGG GTACGTGCTG CCCGACATCT CGGTCTCGGC CGGTCCGATG
TTCGTCCTGG CCGCGGGCGA ACACTTCGGC TCGGGTCCCT CCGACCCCGA CGCCGGCTTC
GAGGCGATGG GCCGGATCGG CGCGGACGCG CTCCAGTTCT ACACCGGCTC CACCGAGCTC
ACCAGCCTGC TCGAACGCGG TGAGATCGCC ATGGCGCCCG GCCTGGACAA CTTCGCCATG
GGCTCGGTGG AGGCCGGGCA GCCGATCGGC TTCGCCGCGC CCGAACAGGG CCGGGTGATG
ACCGCCAACA CCGTCCAGGT GGTCGACGGG GCGCCCAACG AGGCCGGTGC ACTGGCCTTC
GTCGACTTCC TGCTGCGCCC CGAGATCCAG GAGGGGATGG CCGAGGCCCT CTACGACAAG
CCCGTGGCGC TGGAGGCCGA CCCCACCCCG CTCATGGAGC GGGTGTCGGG ACAGGCCGCC
TCCTCCCCGT CCGACAGCGG CTACCACCAG GGCGACCTGG CCCTCATCGC GCAGGAGCGC
TCCACCTGGC TGGACCGCTT CACCGAGGAG GTGGCGCGGT GA
 
Protein sequence
MSRTAHPRRM LAALAGTAAL ALVSASCAPV REDDPDTLVV STFAFATEEF TEVVADPFEA 
ETGIRVVLDT GNNAGRLTKL RINAETPDTD VVLISDYYAQ IGKDMGLFAP VDPADVPNLD
AIQPWAVDPD GYGPAYTFQL LGLLYRTDLV EEAPDSWDDL WAEPEGGYVL PDISVSAGPM
FVLAAGEHFG SGPSDPDAGF EAMGRIGADA LQFYTGSTEL TSLLERGEIA MAPGLDNFAM
GSVEAGQPIG FAAPEQGRVM TANTVQVVDG APNEAGALAF VDFLLRPEIQ EGMAEALYDK
PVALEADPTP LMERVSGQAA SSPSDSGYHQ GDLALIAQER STWLDRFTEE VAR