Gene Ndas_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1402 
Symbol 
ID9245252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1716695 
End bp1719937 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679340 
Protein GI297560366 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0215001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAC GGCTCACCTA CGCGGACGCG CTCAAGGTCC TCGGCCAGAA CGACTCGGAG 
GTGATGAACC TCGCCGAGAA ACTCGCGGAC GGCGGGCTGG GGCTGGTGGG GGTGCCCGAC
CTCTTCGGCC TGCGCGGTTT CCTGGTGAGC AAGGGCCGCC AGGCCATCGA GGGCATCCGG
GACAAGATCA GCGGTGAGAG CCGCCTGTCC CGGACGGAGA AGATCGAGGC GGCCCACAAG
GTACTGGTCG TCATCTCGTT CTTCGAGGCG GTGGAGGAGT GCCTCGACAG GGCCGAAGCT
CCCTTCTCCC TGGACGCGCT CAGGCTCGGC GACGCCGCAC CGTTCACGTC CTTCGACTCG
GCCCTCCAGG ACCTGGGGGC GCTCCCCGCC CCCGCCGACG TGCACACCGA GGACGTGGAA
CCCTACGAGA TCCTCCAGGT CTTCCACCTC CTCACGGGCA GCTTCAGCGA CATCCTGGTG
GGCCACCCCG AGGCCGTCGC ACACGCTGTG GACCACGACC ACCCCCTCAT GGCACGGCTG
ATCGACGAGG TCCCCGAGGA CGCCGCCCGG CGGCTCACCG AGTACCTGCG CCAGCTCGCC
GCCGAGGTCC CCGAGTTCGG CATGTGGGTG CACCTCTACG AGCACGACCA CACCCGCCAG
GTCCTCGCCC ACGGGTTCGG GGAGCTGGGC AGACTGCTGG AGCTGGCGGG CTCCGGCCGA
CCGGTGGACC AGCGCAGGCG GGAACTCGCC GCCTCCTACC GGGCGGTGCT GCGCCAGCCC
GTGCTGCGCT CGGACGAGGC GTCGTCGGGG CTGGTCCTGC CCGCCCTGGA GGACGCCTAC
GTGCCGCCGT GCGGCCGTGT GCTGAGGGCC GACACCAACG GCTCCCCCTC CACGGAGGAG
GCGTGGCAGG AGGCCGCACT CGTGGACGAC CTCCGGCGCT TCTTCGCGCT GCACCTGATC
CATGCGGACT CCGTCGAGAC CCCCACGGTC GTCCTGGGCC ACCCGGGCGC GGGCAAGTCC
AAGTTCACCG AGATGCTCGC GGCCCACCTG CCCGCCGCCG ACTTCCTCCC CATCCGGGTG
GAGCTTCGCT CGGTCCAGCC GAACGCGCCC ATCCACGCGC AGATCGAGGA AGGACTCGCC
GAGACGCTCC ACACCCGCGT GTCCTGGCGC GAACTCGCCG AGTCGGCGGA CGGCGCGCTG
CCCGTCATCA TCCTGGACGG CTTCGACGAA CTCCTCCAGG CGACCGGTGT GGACCGCTCC
GACTACCTGG AGCGGGTGGA GGAGTTCCAG CACAAGCAGG AGGCTCTGGG GCAGCCCGTG
GCGGTCATCG TCACCAGCCG CACGGTCGTG GCGGACCGGA CGCGCTTCCC CCTGGGCACC
ACCGTGGTCC GGCTGGAACC CTTCACCGAG GCCCAGATCG GACAGATGGT CCGGGTGTGG
AACGAGGCGA ACGCGCGGGC CCTGGCCTCC CGGGGCCTCG AACCGCTGTC CGTCGACTCC
CTCCTGCCCT ACCGCGAACT GGCCGAGCAG CCCCTGCTGC TGCTCATGCT GCTCATCTAC
GACGCCGGGG ACAACGGTCT GCGGCGCGCC TCGCACGCCC TGTCCCACGG GGAGCTGTAC
GAACGGCTGC TGACGATGTT CGCCAGGCGC GAGGTGGACA AGCACCACTC CGGTCTGGGC
AGGAACGACT TCGACGACGC GGTCGAGGAG GAGCTGCGCC GTCTGGAGAT CGCCGCGCTG
GCGATGTTCA CCCGGCGCAA GCAGAGTGTG AGCGCCGACG AGCTGGACCG GGACCTCGCG
GTCCTCATGC CCGACGCCGC GGTGCGCGCG GCCGAGACCG ACCTGCACGG GCGGATCGAC
CCCGCCCACC AGGTGCTGGG CAGGTTCTTC TTCGTCCACG AGTCCCGGGC CAGGCGCGGC
GAGGGCACCG CGAGCGTGTT CGAGTTCCTG CACGCCACCT TCGGCGAGTA CCTGGTGGCC
AGGGCGGTGA TCGTCGCCCT GGACGAGCTG GACGCCTCAC GCGCGCGCTC CTCACGGCGC
CGGGCGCGGA GTACGCGTCC CGACGACGGC GAGCTGTACG CCCTGTCCTC CTTCGCCAGC
TACGCGGGCA GGGAGAAGGT CGTGGACTTC CTCTCCGAAC TCCTGGAACG GCGGTTCACC
GAGGAGCCGG AGGCCCGCGA GGACTACGCG CGGCTGCTCG TGGAGCTGTT CCAGGAGGCG
CCCTTCCCCG CCCCCAACCG CTCGTACGGT GCCTACGAGC CCACGCGGCT GCCGGTGACC
CAGCGCGAGG CCAACTACAC CAGCAACCTC ATGATCCTGC TGTGCCTGGT GCGGGAGGAG
CCGGTGGACG CGCGCGAGCT CTTCCCGGAG GCGCACGAGC CCGACCAGAC CCTCCAGGGC
ACCACCACGA TGTGGCGCAC CCTGCCCGGC GCCGAGTGGT TCAGCATCGT GTCCGTGCTC
CGCGTCCGCC ACCTGGATGG CTGGGAGGCC GACGGCCCCG TGACGGTCAT CGGCAGGGAG
GACGGCGCAC CGGTCAACGT CGGGGAGTGC ATGGGGTTCG AACTGCGCAA GAACAACGAC
GCCGCGCCCT CCGTGACCGA CCCCTACGGC ATCTCCGTGT CCTACGACAC GGTCGCCTCC
CGTCTGCTCC GGTCGATGGC TATGCGCGTC AACGGCACGG CCGCGCGCTT CATGTTCGGG
CTTCTGCCCT ACCTGGGCCA CGTGTCGGAG GACCTGGGCA CCTGGTACTC GGACGTCCAC
TCCGAGGCGA CGTGGACGGA GGCCCACGAG CTCATGCGGC TGCGGCTGGA GCCCGCCGCC
GAAAACCCCG GGGAGCGCCT GCGCACCTAC CGGCGCCTGC TCGCGCAGCG GGCCCTCGGA
CGCCTCGAAC TGGTGGTGCT CAGGCAGGCG GCGGAGGACC TGGCCCTGGT GTCCGAGGCG
TCCGCGTTCG CTACCGAACT CGTGGATGTC GTCAACCTCT ACCTCAACGG TGTGCGGACG
GTCGTGGTCG GGCCCCAGCT CCGGGAGGAG ACGGTCGTCC CGGTCCTGCG GGTCCTGGCC
CCGTACACGC ACGAGGACGT CTTCGAGCGG GTCCTCGGAC TCGCCCGGGC CTCGTGGGCG
TCGGAACACG GCCCCCGTGC CGTCGGCGGT GTCGGGCGGA TCGGAGAACA GGCCGCCGAC
ACCGTCCCCG GGCTGCGCCG AACGCCTTCC CCGACGGGCC GGTACAGCAG CAGCGGGGGC
TGA
 
Protein sequence
MAKRLTYADA LKVLGQNDSE VMNLAEKLAD GGLGLVGVPD LFGLRGFLVS KGRQAIEGIR 
DKISGESRLS RTEKIEAAHK VLVVISFFEA VEECLDRAEA PFSLDALRLG DAAPFTSFDS
ALQDLGALPA PADVHTEDVE PYEILQVFHL LTGSFSDILV GHPEAVAHAV DHDHPLMARL
IDEVPEDAAR RLTEYLRQLA AEVPEFGMWV HLYEHDHTRQ VLAHGFGELG RLLELAGSGR
PVDQRRRELA ASYRAVLRQP VLRSDEASSG LVLPALEDAY VPPCGRVLRA DTNGSPSTEE
AWQEAALVDD LRRFFALHLI HADSVETPTV VLGHPGAGKS KFTEMLAAHL PAADFLPIRV
ELRSVQPNAP IHAQIEEGLA ETLHTRVSWR ELAESADGAL PVIILDGFDE LLQATGVDRS
DYLERVEEFQ HKQEALGQPV AVIVTSRTVV ADRTRFPLGT TVVRLEPFTE AQIGQMVRVW
NEANARALAS RGLEPLSVDS LLPYRELAEQ PLLLLMLLIY DAGDNGLRRA SHALSHGELY
ERLLTMFARR EVDKHHSGLG RNDFDDAVEE ELRRLEIAAL AMFTRRKQSV SADELDRDLA
VLMPDAAVRA AETDLHGRID PAHQVLGRFF FVHESRARRG EGTASVFEFL HATFGEYLVA
RAVIVALDEL DASRARSSRR RARSTRPDDG ELYALSSFAS YAGREKVVDF LSELLERRFT
EEPEAREDYA RLLVELFQEA PFPAPNRSYG AYEPTRLPVT QREANYTSNL MILLCLVREE
PVDARELFPE AHEPDQTLQG TTTMWRTLPG AEWFSIVSVL RVRHLDGWEA DGPVTVIGRE
DGAPVNVGEC MGFELRKNND AAPSVTDPYG ISVSYDTVAS RLLRSMAMRV NGTAARFMFG
LLPYLGHVSE DLGTWYSDVH SEATWTEAHE LMRLRLEPAA ENPGERLRTY RRLLAQRALG
RLELVVLRQA AEDLALVSEA SAFATELVDV VNLYLNGVRT VVVGPQLREE TVVPVLRVLA
PYTHEDVFER VLGLARASWA SEHGPRAVGG VGRIGEQAAD TVPGLRRTPS PTGRYSSSGG