Gene Ndas_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4050 
Symbol 
ID9247922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4841688 
End bp4845269 
Gene Length3582 bp 
Protein Length1193 aa 
Translation table11 
GC content65% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681952 
Protein GI297562978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTCCTC CCGACACTCC CCCGAGCGCG CCGCTGTCCG TCGGCGAGCA GACCGGCATG 
CTCACCGACG GTGAGAACAA GGTCAGTCCC GCGACCTTCC CCGTCCCCTC CCCCTCCCCC
GACTACATCG AGACCCTGGC CTCCGACCTG CGCAGCGCCG GGGAGTCGGT CGGCGACACC
GGAAACGACA TCACGTCCGC CTGGAGCGGT CTGAGGTCCC ACTACAAGGC TCCCGAGGCG
GAACACCTCT ACTCCGTGCT CGAACCGGTG GCCGCTGACG GCGACACGGT CTCGACCGAC
CTCGGCCACG CAGCGAGCGC GCTGGAGACC TTCGCCGCGG ACCTGCGCGA CATCAAGTCC
CGCTGGTCCA GCCTGAGCGC CGAGGCCTTC GAGTTCCGGG CCCGCATCGA CGCGAAGGGC
GATGACTGGC GCAAGGCCGA GGGCGTGGCC GGTTTCTTCG GGATCGGCGA GAACCCCGAC
GTGGAGGAGA ACCAGCGCCT CATCGACGAG GGCATCCGCA TCATCGAGGA CTACGCGGAC
GCCGAGCTCG CCTGCGCCAA CGCCATCAAC CGGTTCGTCC CGGACCGCAC CCCGTTCGAG
AGGACGCCCT CCGGGGACGG CGCCCTCGAC CCAGACGTCT TCTACCACGG CTACGAGGAG
GACCTGTCGG ACCTTGCCAC CGAGTGGGAC ATGGGCGGGG CCGTCACCGA CGAGAGCTGG
TTGGTCGACG GCTGGGACGC GGTGTGGGAC TTCGGCGTCG GGGCGGTCGA GGGCACCGGC
GCGATGCTGG GCATGCACAG CTCCGAGGGC TGGTTCAACA TGTCCTGGGG CGACGCCCTG
TACGAGTACC ACGAGAGCAA CATCCAGTCC GTGGCCTCCC TGGTGGGCAT GTACGACGCC
GAGTCCGACA GCTACGGCTG GTCGGGCTGG GACACCGTCG GCTCGGCGTG GAAGGACCTG
GCCCACTCGG TGGTGCCCTG GGAGGAGTGG GGTGAGCGGC CCGGCTACGT GATCGGCACC
GCCGTGCTCA ACATCGGCGT CACCGCCCTC GGCGCCGCGC TGTCGGCCAC CGGTGTGGGC
GCGGCGGTCG GCGTCCCGCT GATGGCCTGG CGCGGCATGG CCATCGTGGA CGGTATGGGC
GGTCGCGGCG GTGGCAGCGG ATCCGGCGGA GCGGCGGACG TCGACGTGGA CCTGCCCGCC
AACATCCCCG CCTTCGGGGG CAGCGGTGCC CCCGTCGTGC GGATCGACAC CAGCGTCTTC
GACACCGACG GACTCAGCCC GCAGCAACTG GGTGACCTGA GGGGGTCCTT GGACCGCTTG
CAGGGCATCA CGAACGACCC CGCCGACGGC TCGCACGCGG ACGGGTCTCC TCCGCCGCGG
TCCGCGCCCG TCCAGGGTGA CAGCGATCCC TCCGAATCGA GGAGGCCCAC CGCACGACCG
GTCGCGGACC GGGAGACCGG GGCGAACGAG AGACCCTCCA GGGCCGAGCC CGAAACCGAC
CGTGCCGGGG TGGACGACGA ACAGGCCCTC CTTGAGGAGG TTTCCGGGGA AGAGCGCGGC
AGCTCTCCCG TGCCGGTGAT CGAGGACGCT TCGGACACCT CCCACCGGTC CTCGCCGACC
GAGAGCGGTT ACCAGGACCC CACCGCCGAG CAGCTCAGCG AGAGCGACAG GCTGCTGCGA
CAGGTCAACG GCATGTTCAC GGCTGAGGAC CACGCGGACT TCCAGGTCAA CCAACGCGCC
GAGACCGCCC TGTACGAGGG CGACCGCGCC ACCGTGGACA CGGACTCCTC TTCGCTGAAG
GACTCCCAGG TCGCGGAGAG GTACGGGCTG GACGGCCGCG CGAACGCCGC GTTCTCCGAC
ATGCGTGCGG TGGCGAGCGA CTACCCCCAT GTGAACTGGG AGGGTGGCCC CGGTGACGGA
CGGGACGTGC GGGATGGCCG CGACGAGCCC GAGGCCGACA GGGAGTACGC GACGACGGGG
CCGCGTCCCC ACGCAGACAC CGCGAGCGCG CAGAGAGCGA TCCCCCACAG CCGGGTCCAG
CACGTGGACC TGGGGGACAG CGGAAGCCAC CGGATCGACG CCTCACGGGT CGGTGCACCT
GACGCGCGTA GTGGCTCCGA TCCCGGATTC CGCCACGACC ACGGCGCCGA CACGGGAACC
CATCGTCCGG TCGCCGTCAA CAGGGCCGAC TCCGTGCCAA GCGGCGACGG CGGCTCCGGC
GTTGACACGC CTCTCCGGGG CGACGGTCCC TCCGGCCTCG GCGGATCCGA CCGGACCGAC
TCCGTCGATG CCGTCCCGGC CGTGGTCACC CCGACGCGTG CTGGCGTCCC CGGGTCCTCC
GGCGGCCCGA GCGGAGGTAC CGGCGGCTCC GGGAACCAGA ACGGTCGGAG CGGTCCCGGC
GGCGGCAACA GTTCCAGCGG CATCCCCTCT GACAAGCCGC CGCACGACTC CAACGGAGCA
GGATCCCCCC AGCAGGAACT TCCAAGAAGC AAGGATGCCG ACATCCCCGC CAGGAAGTTC
GACGGAATAT CCAGGGAGGA TCTCAATCGA GAGATCGCCC AAGAGATAGG ACTGAAGCCG
GACACGCCCT ACAAGGTTTT CGCCAAGAAG TTTACGGATT TCCTCAACGA CAAAATCTCG
AACGGAAAGG ATAAATATTT CCTCGAATTT TACAACAGCA ACGGCAGTAG ATTTAGAATC
CAACGCGATG TTTTCGGCTT CAAGATTCCA CAGCTAACCA GCGTCAATGG CTCCGCCCCC
TGGTTCGCCA TGGAGACGCT GCCCGAACCA GACGCGCCCA AATACCGCAC GCTTAACGAC
CGAGTGTACT TCTCTGCCGA AAATAGCAAG AACACGACCT CGGAGATGGA CGACTACGCC
GAACAAAGAA GGCAATCCAT CAAGGATGAT CTTGCCGCGG AGAAAAAACT TAAGGAAAAG
AAGGAGGAAC ACAAGAACTA CCCGGGCGAA CACCCGGAAG TACTCAAGGC AGACGCGGAA
CACAGCCCAC TCCACAAGAA GATGACCAAA GATAGCGAAA CATACGGGGA GGAGGGTGCC
AAGGTAGCGG TCAAGGATCT GTTCGACGGC CATCACTCGC TTCCCGGAAA AGATGGAAAT
GTCATCGATC TTCCTGAAAT CGAGGAAATA AAAGGCTCCA AAAAACCCTA TGAAACTCCG
GGGACACCGA GGAGTGGGAA CTACCAGTTC GACCAGATTT GGCCCCTCAA AGGCGGTGGT
ATCCTCATTG TCGAGGCCAA GAGTAGCCAG TCGACCTCCC TGGGCGAGAG AACCATTCCG
GCAGGCAAAG AACCCAAGCG GGTCTCCCAA GGCACGCCAG AGTACCTCGA AGCAACCACG
AAAGACATGA AGAGGAGGGG GCGCAAGAAT CAACTCGGAG ACCCAACAGA AGAAAGACTT
GCGGAAATGA TTGAAGAAGC CAGGGCCGCC GGCAAGCTTT TCTATGCGGA AGTAAAGGGC
ATCACCCCGG AGCAAAAACA GAAGGCCCTC GGCGAAGACC CAAAGGAAGG AGAAAAGCAC
TCCGGATACA GCATTGGCCT TTTCGACATA CCCAAGAAGT AG
 
Protein sequence
MLPPDTPPSA PLSVGEQTGM LTDGENKVSP ATFPVPSPSP DYIETLASDL RSAGESVGDT 
GNDITSAWSG LRSHYKAPEA EHLYSVLEPV AADGDTVSTD LGHAASALET FAADLRDIKS
RWSSLSAEAF EFRARIDAKG DDWRKAEGVA GFFGIGENPD VEENQRLIDE GIRIIEDYAD
AELACANAIN RFVPDRTPFE RTPSGDGALD PDVFYHGYEE DLSDLATEWD MGGAVTDESW
LVDGWDAVWD FGVGAVEGTG AMLGMHSSEG WFNMSWGDAL YEYHESNIQS VASLVGMYDA
ESDSYGWSGW DTVGSAWKDL AHSVVPWEEW GERPGYVIGT AVLNIGVTAL GAALSATGVG
AAVGVPLMAW RGMAIVDGMG GRGGGSGSGG AADVDVDLPA NIPAFGGSGA PVVRIDTSVF
DTDGLSPQQL GDLRGSLDRL QGITNDPADG SHADGSPPPR SAPVQGDSDP SESRRPTARP
VADRETGANE RPSRAEPETD RAGVDDEQAL LEEVSGEERG SSPVPVIEDA SDTSHRSSPT
ESGYQDPTAE QLSESDRLLR QVNGMFTAED HADFQVNQRA ETALYEGDRA TVDTDSSSLK
DSQVAERYGL DGRANAAFSD MRAVASDYPH VNWEGGPGDG RDVRDGRDEP EADREYATTG
PRPHADTASA QRAIPHSRVQ HVDLGDSGSH RIDASRVGAP DARSGSDPGF RHDHGADTGT
HRPVAVNRAD SVPSGDGGSG VDTPLRGDGP SGLGGSDRTD SVDAVPAVVT PTRAGVPGSS
GGPSGGTGGS GNQNGRSGPG GGNSSSGIPS DKPPHDSNGA GSPQQELPRS KDADIPARKF
DGISREDLNR EIAQEIGLKP DTPYKVFAKK FTDFLNDKIS NGKDKYFLEF YNSNGSRFRI
QRDVFGFKIP QLTSVNGSAP WFAMETLPEP DAPKYRTLND RVYFSAENSK NTTSEMDDYA
EQRRQSIKDD LAAEKKLKEK KEEHKNYPGE HPEVLKADAE HSPLHKKMTK DSETYGEEGA
KVAVKDLFDG HHSLPGKDGN VIDLPEIEEI KGSKKPYETP GTPRSGNYQF DQIWPLKGGG
ILIVEAKSSQ STSLGERTIP AGKEPKRVSQ GTPEYLEATT KDMKRRGRKN QLGDPTEERL
AEMIEEARAA GKLFYAEVKG ITPEQKQKAL GEDPKEGEKH SGYSIGLFDI PKK