Gene Ndas_5255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5255 
Symbol 
ID9249153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp417340 
End bp419295 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003683141 
Protein GI297564168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.313455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAC GAACGCCAGG CGGCGCGAGG CGGTGGCTGG GCGCCGCCGC GGGAGCGGCG 
GCGGTCGTGG TGACCGCCGG GTGCACCTTC TTCTCCCTCG ATCCCGAGGT CGAGGAGGGC
GAACGGGTGG AGGGCACCGG CGAGGCGCCG ATGCTCACCG CCCTCGTGGA GTCGGGCGAC
CTGCCCCCGC TGGAGGAGCG GCTGCCCGAC GAGCCGCTGG TGGTGGAGCC CCACGACCGC
GTCGGCGTGT ACGGCGGCGA GTGGAACAGC GCCATCCTCG GCGTGGGCGA CTGGCCCTGG
CTCGGCAGGA CCGTGGGCTA CGAGAACCTC ACCCGCTGGG ACCCGGAGTG GCAGGAGGTG
ATCCCCAACC TCGCCGAGTC CTGGGAGTAC AACGAGGACG CCACCGAGCT GACCTTCACC
CTGCGCGCGG GCCTGCGCTG GTCCGACGGC GAGCCGTTCA CCTCCGACGA CGTCGTGTTC
GCCTTCAACG ACATCTTCAA CAACGAGGCG CTGACGCCGG TCGCGGCCGC CGATCCCGGC
ACCGCCGAGA AGGTGGACGA GCGGACCTTC ACCATCACCT TCGACGAGCC CGACGCCCTG
TGGGCCGGGT ACGACCTCCT CCAGTACCAG GTGGTGACCA AGCCCAAGCA CTACCTGGAG
CGGTTCCACA TCGAGTACAA CCCCGACGCC GACGAGCTGG CCGAGGAGGA GGGGTACGCC
GACTGGGTCG AGATGTTCGA GGCCGAGGCG GGCGTGATCG ACAGCTCCCG GTACTGGCAG
AACCCCGACA TCCCCACCAT GTACCCGTGG CGGGTCGTGG AGCCGCTGGC CGACTCCGGG
CGGATGGTGC TGGAGCGCAA CCCCTACTAC TGGAAGGTGG ACACCGAGGG CAACCAGCTC
CCCTACATCG ACCGGGTCGT CTTCGACATC CTCCCGGACG AGGAGGTCAT GCTGGTCAGG
GCGCTCAACG GCGAGTTCGA CATGCACTCG CGGCACTTCA ACACCCTGGA GAACAGGCCC
ACCCTGGCCG AGGCGCGCGA GTCGGGCGGC TACGACTTCT TCGAGCTGCG GCCCGCCGAG
ATGAACACCG CGATGATCTC CCTCAACCTC ACCCACGAGG ACGAGGAGCT GCGCGAGACC
TTCAACGACC GCGACTTCCG GGTGGCGCTC TCACACGCCG TCAACCGCCA GGACATCATC
GACGTCGTCT ACCGCGAACA GGGCGAGCCC TGGCAGGGCG CGCCCCGCGA GGACAGCCCC
TTCCACAACG AGGAGCTGGC CAAGCAGTAC ACCGAGTACG ACCCGGACCT GGCCAACGGG
ATCCTCGACG AGGCGGGCTA CGACGAACGC GACTCCGACG GCTTCCGCAC GAGCCCGCGC
GGCGAGACGG TGCGCTTCAC GCTGTCGGTG CCCACGGACT TTCGCCCCGA CATCGTGGAC
TCGATGGAGA TGGTCGTCGG CTTCTGGCAG GAGCTGGACA TCGACGTGGA GCTCAACACC
GAGGACCGCT CGCTGTGGCA GACCCGCCGG GAGAACAACG AGCACGACGC CAACGTGTGG
TCGGGCGACA ACGGCATGAT GGACGCGATG TACGACCCCC GCTGGTACGC GCCCACCCAG
AGCGGGGAGT CCAACTTCGC CATCCCGTGG GCCCAGTGGT ACGTCTCCGA CGGCGAGGAC
CCGCGCTCCC AGGAGCCGCC CGCCGACGTG CGCGAACACC TGGAGCTGTA CGACGCCGTC
CAGGCCGAGC CCGACCCCGG GGCCCGCGAG GAGCTGATGC GCGAGTTCCT GTCGGTCTCC
CAGGAGCGGT TCTACGCGAT GGGCGTCAGC CTGAGCCCGA CCGGCTACGG GATCGTCGCG
GACGACTTCC ACAACGTGCC CGGGTCGATG CCCTCCTCCG GCAACTACAA CGACCCCGGG
CCGACCAACC CCGAGCAGTA CTTCATCGAG GAGTGA
 
Protein sequence
MRRRTPGGAR RWLGAAAGAA AVVVTAGCTF FSLDPEVEEG ERVEGTGEAP MLTALVESGD 
LPPLEERLPD EPLVVEPHDR VGVYGGEWNS AILGVGDWPW LGRTVGYENL TRWDPEWQEV
IPNLAESWEY NEDATELTFT LRAGLRWSDG EPFTSDDVVF AFNDIFNNEA LTPVAAADPG
TAEKVDERTF TITFDEPDAL WAGYDLLQYQ VVTKPKHYLE RFHIEYNPDA DELAEEEGYA
DWVEMFEAEA GVIDSSRYWQ NPDIPTMYPW RVVEPLADSG RMVLERNPYY WKVDTEGNQL
PYIDRVVFDI LPDEEVMLVR ALNGEFDMHS RHFNTLENRP TLAEARESGG YDFFELRPAE
MNTAMISLNL THEDEELRET FNDRDFRVAL SHAVNRQDII DVVYREQGEP WQGAPREDSP
FHNEELAKQY TEYDPDLANG ILDEAGYDER DSDGFRTSPR GETVRFTLSV PTDFRPDIVD
SMEMVVGFWQ ELDIDVELNT EDRSLWQTRR ENNEHDANVW SGDNGMMDAM YDPRWYAPTQ
SGESNFAIPW AQWYVSDGED PRSQEPPADV REHLELYDAV QAEPDPGARE ELMREFLSVS
QERFYAMGVS LSPTGYGIVA DDFHNVPGSM PSSGNYNDPG PTNPEQYFIE E