Gene Ndas_4535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4535 
Symbol 
ID9248415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5378107 
End bp5379810 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003682428 
Protein GI297563454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.494788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAAC GAAGGAAGAC GCTCGCGCTC GCCGCGGCGG GCACCTCCGC TTTGATGGTG 
CTGACCGCCT GCAGCGGCGG CGGTGGTGGC GAAGGCGAGC AGGAGAAGGA GGTCACATGG
GTGATCAACA GCCTTCCCGC CGCCTGGGGA GCCATCAGTA GCGCCGGAGG CAGCGTCTAC
GTCATCCAGG CACTCTCCGG CGTCGTGCCC TTCACCGGGC AGTACCAGCC CGACGCGACG
TACGAGTACG ACATGGACGT CCTCGCCGAG GAGCCCACCC TCATCAACGA CAATCCCGAC
GAGGGTCCGT TCCAGTTCAG CTTCACCCTC GCCGAGGACG CCGTGTGGAA CGACGGCACG
CCGATGACCG GCGAGGACCT GCGGGTCACC ATGATGATGT CGGCGTCCCC CACCGAGGGC
TACTGCGACA CCTGTGACTC CCGCGGCACC ACCGGCGCCG ACATGGTCGA GGAGGTCGAG
GTCGACGGCA AGACCGCGAC CTTCACCCTC AAGGAGGGCC TGTCCAACCC CGAGTGGATG
GGCATGTTCG ACGCGCACAG CGTTGGCGGC GGCTTCTACC CGGCGCACCT GGCCGAGGAG
AACGGCTGGG ACGTCGACGA CCCCGCGCAG CTCGGCGAGT ACTACGCCTG GCTGCACGAG
ACGCGCCCCG AGTGGTCCGG CGGCCCCTAC CAGATCGTGG ACGGCGACCT GGAGAACCAG
GTCGTCAAGG AGCCCAACCC CGAGTGGTTC GGTGAGACGC AGCCCGCGCT CGACCGCATC
ATCATGCCGT ACAACACCGA CGAGGGCACC TTCATCCCCG CCTTCCAGAA CGGCGAGATC
GACGGCGCCA ACCCCGCGCA GTACAGCGAG GACATCATCA CCCAGCTCCA GGGGATGGAG
ACCGCCACGC TCACCATCGG CGAGGGCAAC ATCTGGGAGC ACATCGACAT CAACACCGAG
AACGAGTGGC TCTCGGACGT CGAGCTCCGC AGGGCCGTGT TCACCGCGAT CAACCGCGAC
GAGATCGCCA GCCGCAACTT CGAGGCCGGA TACCCCGAGT ACGAGCTGAA GAACAACCAC
ATCTTCGGCA GCGACAGCGA GTACTTCGAG GACCTCGTCT CCGAGTCCGG GCAGGGCAGC
GGCGACGTCG AGGCCGCCAC CGCGATCCTG GAGGAGGCCG GTTACGAGCT CGACGGGGAC
ACCCTCATGC TCGACGGCGA GCAGGTCGGC CCGTTCCGCC TGCGCAGCAC CGACACCGTC
ATCCGCAACA ACTCCGTGCA GCTGATCCAG GCCCAGCTCG CCGAGATCGG CATCGAGACC
ACCATCGAGA TGACCGACGA CCTGGGCACG ATGCTGGCCG AGCAGGACTA CGACATCGTC
CAGTTCGGCT GGAGCGGCAG CCCGTACTTC GCCTCCAGCC CCGAGCAGTT CTGGCACTCC
GAGAGCACCA GCAACTTCGG CGGCTACTCC AACGACGAGG TGGACGAGCA CGCCGAGGCC
ACCGCGACGG CCGCCAACCT GGACGAGGCG GCCGAGCACG CCAACGCCGC CGTGGCCGCC
GTGGTCCCGG ACGCCTACGT CCTGCCGATC GTGGCCGAGC CCAACTACTT CTTCGTGAAC
GACAGGCTCG CCAACGTCGA GGACAACCTC CAGTCCAGCT ACCGCGCCAC CTACAACATC
GGTGAGTGGG ACCTCGCCGA GTAG
 
Protein sequence
MHKRRKTLAL AAAGTSALMV LTACSGGGGG EGEQEKEVTW VINSLPAAWG AISSAGGSVY 
VIQALSGVVP FTGQYQPDAT YEYDMDVLAE EPTLINDNPD EGPFQFSFTL AEDAVWNDGT
PMTGEDLRVT MMMSASPTEG YCDTCDSRGT TGADMVEEVE VDGKTATFTL KEGLSNPEWM
GMFDAHSVGG GFYPAHLAEE NGWDVDDPAQ LGEYYAWLHE TRPEWSGGPY QIVDGDLENQ
VVKEPNPEWF GETQPALDRI IMPYNTDEGT FIPAFQNGEI DGANPAQYSE DIITQLQGME
TATLTIGEGN IWEHIDINTE NEWLSDVELR RAVFTAINRD EIASRNFEAG YPEYELKNNH
IFGSDSEYFE DLVSESGQGS GDVEAATAIL EEAGYELDGD TLMLDGEQVG PFRLRSTDTV
IRNNSVQLIQ AQLAEIGIET TIEMTDDLGT MLAEQDYDIV QFGWSGSPYF ASSPEQFWHS
ESTSNFGGYS NDEVDEHAEA TATAANLDEA AEHANAAVAA VVPDAYVLPI VAEPNYFFVN
DRLANVEDNL QSSYRATYNI GEWDLAE